1
|
Feuerriegel S, Frauen D, Melnychuk V, Schweisthal J, Hess K, Curth A, Bauer S, Kilbertus N, Kohane IS, van der Schaar M. Causal machine learning for predicting treatment outcomes. Nat Med 2024; 30:958-968. [PMID: 38641741 DOI: 10.1038/s41591-024-02902-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Accepted: 03/04/2024] [Indexed: 04/21/2024]
Abstract
Causal machine learning (ML) offers flexible, data-driven methods for predicting treatment outcomes including efficacy and toxicity, thereby supporting the assessment and safety of drugs. A key benefit of causal ML is that it allows for estimating individualized treatment effects, so that clinical decision-making can be personalized to individual patient profiles. Causal ML can be used in combination with both clinical trial data and real-world data, such as clinical registries and electronic health records, but caution is needed to avoid biased or incorrect predictions. In this Perspective, we discuss the benefits of causal ML (relative to traditional statistical or ML approaches) and outline the key components and steps. Finally, we provide recommendations for the reliable use of causal ML and effective translation into the clinic.
Collapse
Affiliation(s)
- Stefan Feuerriegel
- LMU Munich, Munich, Germany.
- Munich Center for Machine Learning, Munich, Germany.
| | - Dennis Frauen
- LMU Munich, Munich, Germany
- Munich Center for Machine Learning, Munich, Germany
| | - Valentyn Melnychuk
- LMU Munich, Munich, Germany
- Munich Center for Machine Learning, Munich, Germany
| | - Jonas Schweisthal
- LMU Munich, Munich, Germany
- Munich Center for Machine Learning, Munich, Germany
| | - Konstantin Hess
- LMU Munich, Munich, Germany
- Munich Center for Machine Learning, Munich, Germany
| | - Alicia Curth
- Department of Applied Mathematics & Theoretical Physics, University of Cambridge, Cambridge, UK
| | - Stefan Bauer
- School of Computation, Information and Technology, TU Munich, Munich, Germany
- Helmholtz Munich, Munich, Germany
| | - Niki Kilbertus
- Munich Center for Machine Learning, Munich, Germany
- School of Computation, Information and Technology, TU Munich, Munich, Germany
- Helmholtz Munich, Munich, Germany
| | - Isaac S Kohane
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Mihaela van der Schaar
- Cambridge Centre for AI in Medicine, University of Cambridge, Cambridge, UK
- The Alan Turing Institute, London, UK
| |
Collapse
|
2
|
Balagopalan A, Baldini I, Celi LA, Gichoya J, McCoy LG, Naumann T, Shalit U, van der Schaar M, Wagstaff KL. Machine learning for healthcare that matters: Reorienting from technical novelty to equitable impact. PLOS Digit Health 2024; 3:e0000474. [PMID: 38620047 PMCID: PMC11018283 DOI: 10.1371/journal.pdig.0000474] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Accepted: 02/18/2024] [Indexed: 04/17/2024]
Abstract
Despite significant technical advances in machine learning (ML) over the past several years, the tangible impact of this technology in healthcare has been limited. This is due not only to the particular complexities of healthcare, but also due to structural issues in the machine learning for healthcare (MLHC) community which broadly reward technical novelty over tangible, equitable impact. We structure our work as a healthcare-focused echo of the 2012 paper "Machine Learning that Matters", which highlighted such structural issues in the ML community at large, and offered a series of clearly defined "Impact Challenges" to which the field should orient itself. Drawing on the expertise of a diverse and international group of authors, we engage in a narrative review and examine issues in the research background environment, training processes, evaluation metrics, and deployment protocols which act to limit the real-world applicability of MLHC. Broadly, we seek to distinguish between machine learning ON healthcare data and machine learning FOR healthcare-the former of which sees healthcare as merely a source of interesting technical challenges, and the latter of which regards ML as a tool in service of meeting tangible clinical needs. We offer specific recommendations for a series of stakeholders in the field, from ML researchers and clinicians, to the institutions in which they work, and the governments which regulate their data access.
Collapse
Affiliation(s)
- Aparna Balagopalan
- Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology; Cambridge, Massachusetts, United States of America
| | - Ioana Baldini
- IBM Research; Yorktown Heights, New York, United States of America
| | - Leo Anthony Celi
- Laboratory for Computational Physiology, Massachusetts Institute of Technology; Cambridge, Massachusetts, United States of America
- Division of Pulmonary, Critical Care and Sleep Medicine, Beth Israel Deaconess Medical Center; Boston, Massachusetts, United States of America
- Department of Biostatistics, Harvard T.H. Chan School of Public Health; Boston, Massachusetts, United States of America
| | - Judy Gichoya
- Department of Radiology and Imaging Sciences, School of Medicine, Emory University; Atlanta, Georgia, United States of America
| | - Liam G. McCoy
- Division of Neurology, Department of Medicine, University of Alberta; Edmonton, Alberta, Canada
| | - Tristan Naumann
- Microsoft Research; Redmond, Washington, United States of America
| | - Uri Shalit
- The Faculty of Data and Decision Sciences, Technion; Haifa, Israel
| | - Mihaela van der Schaar
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge; Cambridge, United Kingdom
- The Alan Turing Institute; London, United Kingdom
| | | |
Collapse
|
3
|
Bellot A, van der Schaar M. Linear Deconfounded Score Method: Scoring DAGs With Dense Unobserved Confounding. IEEE Trans Neural Netw Learn Syst 2024; 35:4948-4962. [PMID: 38285579 DOI: 10.1109/tnnls.2024.3352657] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/31/2024]
Abstract
This article deals with the discovery of causal relations from a combination of observational data and qualitative assumptions about the nature of causality in the presence of unmeasured confounding. We focus on applications where unobserved variables are known to have a widespread effect on many of the observed ones, which makes the problem particularly difficult for constraint-based methods, because most pairs of variables are conditionally dependent given any other subset, rendering the causal effect unidentifiable. In this article, we show that under the principle of independent mechanisms, unobserved confounding in this setting leaves a statistical footprint in the observed data distribution that allows for disentangling spurious and causal effects. Using this insight, we demonstrate that a sparse linear Gaussian directed acyclic graph (DAG) among observed variables may be recovered approximately and propose a simple adjusted score-based causal discovery algorithm that may be implemented with general-purpose solvers and scales to high-dimensional problems. We find, in addition, that despite the conditions we pose to guarantee causal recovery, performance in practice is robust to large deviations in model assumptions, and extensions to nonlinear structural models are possible.
Collapse
|
4
|
Curth A, Peck RW, McKinney E, Weatherall J, van der Schaar M. Using Machine Learning to Individualize Treatment Effect Estimation: Challenges and Opportunities. Clin Pharmacol Ther 2024; 115:710-719. [PMID: 38124482 DOI: 10.1002/cpt.3159] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Accepted: 12/07/2023] [Indexed: 12/23/2023]
Abstract
The use of data from randomized clinical trials to justify treatment decisions for real-world patients is the current state of the art. It relies on the assumption that average treatment effects from the trial can be extrapolated to patients with personal and/or disease characteristics different from those treated in the trial. Yet, because of heterogeneity of treatment effects between patients and between the trial population and real-world patients, this assumption may not be correct for many patients. Using machine learning to estimate the expected conditional average treatment effect (CATE) in individual patients from observational data offers the potential for more accurate estimation of the expected treatment effects in each patient based on their observed characteristics. In this review, we discuss some of the challenges and opportunities for machine learning to estimate CATE, including ensuring identification assumptions are met, managing covariate shift, and learning without access to the true label of interest. We also discuss the potential applications as well as future work and collaborations needed to further improve identification and utilization of CATE estimates to increase patient benefit.
Collapse
Affiliation(s)
- Alicia Curth
- Department of Applied Mathematics & Theoretical Physics, University of Cambridge, Cambridge, UK
| | - Richard W Peck
- Department of Pharmacology & Therapeutics, University of Liverpool, Liverpool, UK
- Roche Pharma Research & Early Development (pRED), Roche Innovation Center, Basel, Switzerland
| | - Eoin McKinney
- Cambridge Institute for Immunotherapy & Infectious Disease, Jeffrey Cheah Biomedical Center, Cambridge Biomedical Campus, Addenbrooke's Hospital, Cambridge, UK
- Cambridge Centre for AI in Medicine, Cambridge, UK
| | - James Weatherall
- AstraZeneca R&D Data Science and Artificial Intelligence, Cambridge, UK
| | - Mihaela van der Schaar
- Department of Applied Mathematics & Theoretical Physics, University of Cambridge, Cambridge, UK
- Cambridge Centre for AI in Medicine, Cambridge, UK
- The Alan Turing Institute, London, UK
| |
Collapse
|
5
|
Heremans ERM, Seedat N, Buyse B, Testelmans D, van der Schaar M, De Vos M. U-PASS: An uncertainty-guided deep learning pipeline for automated sleep staging. Comput Biol Med 2024; 171:108205. [PMID: 38401452 DOI: 10.1016/j.compbiomed.2024.108205] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Revised: 02/16/2024] [Accepted: 02/20/2024] [Indexed: 02/26/2024]
Abstract
With the increasing prevalence of machine learning in critical fields like healthcare, ensuring the safety and reliability of these systems is crucial. Estimating uncertainty plays a vital role in enhancing reliability by identifying areas of high and low confidence and reducing the risk of errors. This study introduces U-PASS, a specialized human-centered machine learning pipeline tailored for clinical applications, which effectively communicates uncertainty to clinical experts and collaborates with them to improve predictions. U-PASS incorporates uncertainty estimation at every stage of the process, including data acquisition, training, and model deployment. Training is divided into a supervised pre-training step and a semi-supervised recording-wise finetuning step. We apply U-PASS to the challenging task of sleep staging and demonstrate that it systematically improves performance at every stage. By optimizing the training dataset, actively seeking feedback from domain experts for informative samples, and deferring the most uncertain samples to experts, U-PASS achieves an impressive expert-level accuracy of 85% on a challenging clinical dataset of elderly sleep apnea patients. This represents a significant improvement over the starting point at 75% accuracy. The largest improvement gain is due to the deferral of uncertain epochs to a sleep expert. U-PASS presents a promising AI approach to incorporating uncertainty estimation in machine learning pipelines, improving their reliability and unlocking their potential in clinical settings.
Collapse
Affiliation(s)
- Elisabeth R M Heremans
- KU Leuven, Department of Electrical Engineering (ESAT), STADIUS Center for Dynamical Systems, Signal Processing and Data Analytics, Kasteelpark Arenberg 10, B-3001 Leuven, Belgium.
| | | | - Bertien Buyse
- UZ Leuven, Department of Pneumology, Herestraat 49, B-3000 Leuven, Belgium
| | - Dries Testelmans
- UZ Leuven, Department of Pneumology, Herestraat 49, B-3000 Leuven, Belgium
| | | | - Maarten De Vos
- KU Leuven, Department of Electrical Engineering (ESAT), STADIUS Center for Dynamical Systems, Signal Processing and Data Analytics, Kasteelpark Arenberg 10, B-3001 Leuven, Belgium.
| |
Collapse
|
6
|
Giddings R, Joseph A, Callender T, Janes SM, van der Schaar M, Sheringham J, Navani N. Factors influencing clinician and patient interaction with machine learning-based risk prediction models: a systematic review. Lancet Digit Health 2024; 6:e131-e144. [PMID: 38278615 DOI: 10.1016/s2589-7500(23)00241-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Revised: 10/20/2023] [Accepted: 11/14/2023] [Indexed: 01/28/2024]
Abstract
Machine learning (ML)-based risk prediction models hold the potential to support the health-care setting in several ways; however, use of such models is scarce. We aimed to review health-care professional (HCP) and patient perceptions of ML risk prediction models in published literature, to inform future risk prediction model development. Following database and citation searches, we identified 41 articles suitable for inclusion. Article quality varied with qualitative studies performing strongest. Overall, perceptions of ML risk prediction models were positive. HCPs and patients considered that models have the potential to add benefit in the health-care setting. However, reservations remain; for example, concerns regarding data quality for model development and fears of unintended consequences following ML model use. We identified that public views regarding these models might be more negative than HCPs and that concerns (eg, extra demands on workload) were not always borne out in practice. Conclusions are tempered by the low number of patient and public studies, the absence of participant ethnic diversity, and variation in article quality. We identified gaps in knowledge (particularly views from under-represented groups) and optimum methods for model explanation and alerts, which require future research.
Collapse
Affiliation(s)
- Rebecca Giddings
- Lungs for Living Research Centre, UCL Respiratory, University College London, London, UK.
| | - Anabel Joseph
- Lungs for Living Research Centre, UCL Respiratory, University College London, London, UK
| | - Thomas Callender
- Lungs for Living Research Centre, UCL Respiratory, University College London, London, UK
| | - Sam M Janes
- Lungs for Living Research Centre, UCL Respiratory, University College London, London, UK
| | - Mihaela van der Schaar
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK; The Alan Turing Institute, London, UK
| | - Jessica Sheringham
- Department of Applied Health Research, University College London, London, UK
| | - Neal Navani
- Lungs for Living Research Centre, UCL Respiratory, University College London, London, UK
| |
Collapse
|
7
|
Islam N, van der Schaar M. Use of generative artificial intelligence in medical research. BMJ 2024; 384:q119. [PMID: 38296355 DOI: 10.1136/bmj.q119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/08/2024]
Affiliation(s)
- Nazrul Islam
- Faculty of Medicine, University of Southampton, Southampton, UK
| | | |
Collapse
|
8
|
Laferrière-Langlois P, Imrie F, Geraldo MA, Wingert T, Lahrichi N, van der Schaar M, Cannesson M. Novel Preoperative Risk Stratification Using Digital Phenotyping Applying a Scalable Machine-Learning Approach. Anesth Analg 2023:00000539-990000000-00672. [PMID: 38051671 DOI: 10.1213/ane.0000000000006753] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/07/2023]
Abstract
BACKGROUND Classification of perioperative risk is important for patient care, resource allocation, and guiding shared decision-making. Using discriminative features from the electronic health record (EHR), machine-learning algorithms can create digital phenotypes among heterogenous populations, representing distinct patient subpopulations grouped by shared characteristics, from which we can personalize care, anticipate clinical care trajectories, and explore therapies. We hypothesized that digital phenotypes in preoperative settings are associated with postoperative adverse events including in-hospital and 30-day mortality, 30-day surgical redo, intensive care unit (ICU) admission, and hospital length of stay (LOS). METHODS We identified all laminectomies, colectomies, and thoracic surgeries performed over a 9-year period from a large hospital system. Seventy-seven readily extractable preoperative features were first selected from clinical consensus, including demographics, medical history, and lab results. Three surgery-specific datasets were built and split into derivation and validation cohorts using chronological occurrence. Consensus k -means clustering was performed independently on each derivation cohort, from which phenotypes' characteristics were explored. Cluster assignments were used to train a random forest model to assign patient phenotypes in validation cohorts. We reconducted descriptive analyses on validation cohorts to confirm the similarity of patient characteristics with derivation cohorts, and quantified the association of each phenotype with postoperative adverse events by using the area under receiver operating characteristic curve (AUROC). We compared our approach to American Society of Anesthesiologists (ASA) alone and investigated a combination of our phenotypes with the ASA score. RESULTS A total of 7251 patients met inclusion criteria, of which 2770 were held out in a validation dataset based on chronological occurrence. Using segmentation metrics and clinical consensus, 3 distinct phenotypes were created for each surgery. The main features used for segmentation included urgency of the procedure, preoperative LOS, age, and comorbidities. The most relevant characteristics varied for each of the 3 surgeries. Low-risk phenotype alpha was the most common (2039 of 2770, 74%), while high-risk phenotype gamma was the rarest (302 of 2770, 11%). Adverse outcomes progressively increased from phenotypes alpha to gamma, including 30-day mortality (0.3%, 2.1%, and 6.0%, respectively), in-hospital mortality (0.2%, 2.3%, and 7.3%), and prolonged hospital LOS (3.4%, 22.1%, and 25.8%). When combined with the ASA score, digital phenotypes achieved higher AUROC than the ASA score alone (hospital mortality: 0.91 vs 0.84; prolonged hospitalization: 0.80 vs 0.71). CONCLUSIONS For 3 frequently performed surgeries, we identified 3 digital phenotypes. The typical profiles of each phenotype were described and could be used to anticipate adverse postoperative events.
Collapse
Affiliation(s)
- Pascal Laferrière-Langlois
- From the Department of Anesthesiology and Perioperative Medicine, UCLA David Geffen School of Medicine, Los Angeles, California
- Department of Mathematics and Industrial Engineering, Polytechnique Montreal, Montréal, Québec, Canada
- Maisonneuve-Rosemont Hospital Research Center, Montréal, Québec, Canada
- Department of Anesthesiology and Pain Medicine, Maisonneuve-Rosemont Hospital, Centre intégré universitaire de santé et service sociaux de l'Est de L'Ile de Montréal, Montréal, Québec, Canada
| | - Fergus Imrie
- Department of Electrical and Computer Engineering, University of California in Los Angeles, Los Angeles, California
| | - Marc-Andre Geraldo
- Department of Mathematics and Industrial Engineering, Polytechnique Montreal, Montréal, Québec, Canada
- Maisonneuve-Rosemont Hospital Research Center, Montréal, Québec, Canada
| | - Theodora Wingert
- From the Department of Anesthesiology and Perioperative Medicine, UCLA David Geffen School of Medicine, Los Angeles, California
| | - Nadia Lahrichi
- Department of Mathematics and Industrial Engineering, Polytechnique Montreal, Montréal, Québec, Canada
| | - Mihaela van der Schaar
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, United Kingdom
- The Alan Turing Institute, London, United Kingdom
| | - Maxime Cannesson
- From the Department of Anesthesiology and Perioperative Medicine, UCLA David Geffen School of Medicine, Los Angeles, California
| |
Collapse
|
9
|
Stankevičiūtė K, Woillard JB, Peck RW, Marquet P, van der Schaar M. Bridging the Worlds of Pharmacometrics and Machine Learning. Clin Pharmacokinet 2023; 62:1551-1565. [PMID: 37803104 DOI: 10.1007/s40262-023-01310-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/19/2023] [Indexed: 10/08/2023]
Abstract
Precision medicine requires individualized modeling of disease and drug dynamics, with machine learning-based computational techniques gaining increasing popularity. The complexity of either field, however, makes current pharmacological problems opaque to machine learning practitioners, and state-of-the-art machine learning methods inaccessible to pharmacometricians. To help bridge the two worlds, we provide an introduction to current problems and techniques in pharmacometrics that ranges from pharmacokinetic and pharmacodynamic modeling to pharmacometric simulations, model-informed precision dosing, and systems pharmacology, and review some of the machine learning approaches to address them. We hope this would facilitate collaboration between experts, with complementary strengths of principled pharmacometric modeling and flexibility of machine learning leading to synergistic effects in pharmacological applications.
Collapse
Affiliation(s)
- Kamilė Stankevičiūtė
- Department of Computer Science and Technology, University of Cambridge, 15 JJ Thomson Avenue, Cambridge, CB3 0FD, UK
| | - Jean-Baptiste Woillard
- INSERM U1248 P&T, University of Limoges, 2 rue du Pr Descottes, 87000, Limoges, France.
- Department of Pharmacology and Toxicology, CHU Limoges, Limoges, France.
| | - Richard W Peck
- Department of Pharmacology and Therapeutics, University of Liverpool, Liverpool, UK
- Pharma Research and Development, Roche Innovation Center, Basel, Switzerland
| | - Pierre Marquet
- INSERM U1248 P&T, University of Limoges, 2 rue du Pr Descottes, 87000, Limoges, France
- Department of Pharmacology and Toxicology, CHU Limoges, Limoges, France
| | - Mihaela van der Schaar
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK
- The Alan Turing Institute, London, UK
| |
Collapse
|
10
|
Callender T, Imrie F, Cebere B, Pashayan N, Navani N, van der Schaar M, Janes SM. Assessing eligibility for lung cancer screening using parsimonious ensemble machine learning models: A development and validation study. PLoS Med 2023; 20:e1004287. [PMID: 37788223 PMCID: PMC10547178 DOI: 10.1371/journal.pmed.1004287] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Accepted: 08/29/2023] [Indexed: 10/05/2023] Open
Abstract
BACKGROUND Risk-based screening for lung cancer is currently being considered in several countries; however, the optimal approach to determine eligibility remains unclear. Ensemble machine learning could support the development of highly parsimonious prediction models that maintain the performance of more complex models while maximising simplicity and generalisability, supporting the widespread adoption of personalised screening. In this work, we aimed to develop and validate ensemble machine learning models to determine eligibility for risk-based lung cancer screening. METHODS AND FINDINGS For model development, we used data from 216,714 ever-smokers recruited between 2006 and 2010 to the UK Biobank prospective cohort and 26,616 high-risk ever-smokers recruited between 2002 and 2004 to the control arm of the US National Lung Screening (NLST) randomised controlled trial. The NLST trial randomised high-risk smokers from 33 US centres with at least a 30 pack-year smoking history and fewer than 15 quit-years to annual CT or chest radiography screening for lung cancer. We externally validated our models among 49,593 participants in the chest radiography arm and all 80,659 ever-smoking participants in the US Prostate, Lung, Colorectal and Ovarian (PLCO) Screening Trial. The PLCO trial, recruiting from 1993 to 2001, analysed the impact of chest radiography or no chest radiography for lung cancer screening. We primarily validated in the PLCO chest radiography arm such that we could benchmark against comparator models developed within the PLCO control arm. Models were developed to predict the risk of 2 outcomes within 5 years from baseline: diagnosis of lung cancer and death from lung cancer. We assessed model discrimination (area under the receiver operating curve, AUC), calibration (calibration curves and expected/observed ratio), overall performance (Brier scores), and net benefit with decision curve analysis. Models predicting lung cancer death (UCL-D) and incidence (UCL-I) using 3 variables-age, smoking duration, and pack-years-achieved or exceeded parity in discrimination, overall performance, and net benefit with comparators currently in use, despite requiring only one-quarter of the predictors. In external validation in the PLCO trial, UCL-D had an AUC of 0.803 (95% CI: 0.783, 0.824) and was well calibrated with an expected/observed (E/O) ratio of 1.05 (95% CI: 0.95, 1.19). UCL-I had an AUC of 0.787 (95% CI: 0.771, 0.802), an E/O ratio of 1.0 (95% CI: 0.92, 1.07). The sensitivity of UCL-D was 85.5% and UCL-I was 83.9%, at 5-year risk thresholds of 0.68% and 1.17%, respectively, 7.9% and 6.2% higher than the USPSTF-2021 criteria at the same specificity. The main limitation of this study is that the models have not been validated outside of UK and US cohorts. CONCLUSIONS We present parsimonious ensemble machine learning models to predict the risk of lung cancer in ever-smokers, demonstrating a novel approach that could simplify the implementation of risk-based lung cancer screening in multiple settings.
Collapse
Affiliation(s)
- Thomas Callender
- Department of Respiratory Medicine, University College London, London, United Kingdom
| | - Fergus Imrie
- Department of Electrical and Computer Engineering, University of California, Los Angeles, California, United States of America
| | - Bogdan Cebere
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, United Kingdom
| | - Nora Pashayan
- Department of Applied Health Research, University College London, London, United Kingdom
| | - Neal Navani
- Department of Respiratory Medicine, University College London, London, United Kingdom
| | - Mihaela van der Schaar
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, United Kingdom
- Cambridge Centre for AI in Medicine, University of Cambridge, Cambridge, United Kingdom
- Alan Turing Institute, London, United Kingdom
| | - Sam M. Janes
- Department of Respiratory Medicine, University College London, London, United Kingdom
| |
Collapse
|
11
|
Abroshan M, Yip KH, Tekin C, van der Schaar M. Conservative Policy Construction Using Variational Autoencoders for Logged Data With Missing Values. IEEE Trans Neural Netw Learn Syst 2023; 34:6368-6378. [PMID: 35007201 DOI: 10.1109/tnnls.2021.3136385] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
In high-stakes applications of data-driven decision-making such as healthcare, it is of paramount importance to learn a policy that maximizes the reward while avoiding potentially dangerous actions when there is uncertainty. There are two main challenges usually associated with this problem. First, learning through online exploration is not possible due to the critical nature of such applications. Therefore, we need to resort to observational datasets with no counterfactuals. Second, such datasets are usually imperfect, additionally cursed with missing values in the attributes of features. In this article, we consider the problem of constructing personalized policies using logged data when there are missing values in the attributes of features in both training and test data. The goal is to recommend an action (treatment) when ~ X , a degraded version of X with missing values, is observed. We consider three strategies for dealing with missingness. In particular, we introduce the conservative strategy where the policy is designed to safely handle the uncertainty due to missingness. In order to implement this strategy, we need to estimate posterior distribution p(X| ~ X) and use a variational autoencoder to achieve this. In particular, our method is based on partial variational autoencoders (PVAEs) that are designed to capture the underlying structure of features with missing values.
Collapse
|
12
|
Bergami M, Manfrini O, Nava S, Caramori G, Yoon J, Badimon L, Cenko E, David A, Demiri I, Dorobantu M, Fabin N, Gheorghe‐Fronea O, Jankovic R, Kedev S, Ladjevic N, Lasica R, Loncar G, Mancuso G, Mendieta G, Miličić D, Mjehović P, Pašalić M, Petrović M, Poposka L, Scarpone M, Stefanovic M, van der Schaar M, Vasiljevic Z, Vavlukis M, Vega Pittao ML, Vukomanovic V, Zdravkovic M, Bugiardini R. Relationship Between Azithromycin and Cardiovascular Outcomes in Unvaccinated Patients With COVID-19 and Preexisting Cardiovascular Disease. J Am Heart Assoc 2023; 12:e028939. [PMID: 37449568 PMCID: PMC10382084 DOI: 10.1161/jaha.122.028939] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Accepted: 05/05/2023] [Indexed: 07/18/2023]
Abstract
Background Empiric antimicrobial therapy with azithromycin is highly used in patients admitted to the hospital with COVID-19, despite prior research suggesting that azithromycin may be associated with increased risk of cardiovascular events. Methods and Results This study was conducted using data from the ISACS-COVID-19 (International Survey of Acute Coronavirus Syndromes-COVID-19) registry. Patients with a confirmed diagnosis of SARS-CoV-2 infection were eligible for inclusion. The study included 793 patients exposed to azithromycin within 24 hours from hospital admission and 2141 patients who received only standard care. The primary exposure was cardiovascular disease (CVD). Main outcome measures were 30-day mortality and acute heart failure (AHF). Among 2934 patients, 1066 (36.4%) had preexisting CVD. A total of 617 (21.0%) died, and 253 (8.6%) had AHF. Azithromycin therapy was consistently associated with an increased risk of AHF in patients with preexisting CVD (risk ratio [RR], 1.48 [95% CI, 1.06-2.06]). Receiving azithromycin versus standard care was not significantly associated with death (RR, 0.94 [95% CI, 0.69-1.28]). By contrast, we found significantly reduced odds of death (RR, 0.57 [95% CI, 0.42-0.79]) and no significant increase in AHF (RR, 1.23 [95% CI, 0.75-2.04]) in patients without prior CVD. The relative risks of death from the 2 subgroups were significantly different from each other (Pinteraction=0.01). Statistically significant association was observed between AHF and death (odds ratio, 2.28 [95% CI, 1.34-3.90]). Conclusions These findings suggest that azithromycin use in patients with COVID-19 and prior history of CVD is significantly associated with an increased risk of AHF and all-cause 30-day mortality. Registration URL: https://www.clinicaltrials.gov; Unique identifier: NCT05188612.
Collapse
Affiliation(s)
- Maria Bergami
- Department of Medical and Surgical SciencesUniversity of BolognaBolognaItaly
| | - Olivia Manfrini
- Department of Medical and Surgical SciencesUniversity of BolognaBolognaItaly
- IRCCS Azienda Ospedaliero‐Universitaria di Bologna Sant’Orsola HospitalBolognaItaly
| | - Stefano Nava
- Department of Medical and Surgical SciencesUniversity of BolognaBolognaItaly
- IRCCS Azienda Ospedaliero‐Universitaria di BolognaRespiratory and Critical Care UnitBolognaItaly
| | - Gaetano Caramori
- Pneumologia, Dipartimento di Scienze Biomediche, Odontoiatriche e delle Immagini Morfologiche e Funzionali (BIOMORF)University of MessinaMessinaItaly
| | | | - Lina Badimon
- Cardiovascular Research Program ICCCIR‐IIB Sant Pau, Hospital de la Santa Creu i Sant Pau, CiberCV‐Institute Carlos IIIBarcelonaSpain
| | - Edina Cenko
- Department of Medical and Surgical SciencesUniversity of BolognaBolognaItaly
| | - Antonio David
- Department of Human Pathology of the Adult and Evolutive Age “Gaetano Barresi”, Division of Anesthesia and Critical CareUniversity of MessinaMessinaItaly
| | - Ilir Demiri
- University Clinic of Infectious DiseasesUniversity "Ss. Cyril and Methodius"SkopjeNorth Macedonia
| | - Maria Dorobantu
- "Carol Davila" University of Medicine and PharmacyBucharestRomania
| | - Natalia Fabin
- Department of Medical and Surgical SciencesUniversity of BolognaBolognaItaly
| | | | | | - Sasko Kedev
- University Clinic for CardiologySkopjeRepublic of North Macedonia
- Faculty of MedicineSs. Cyril and Methodius University in SkopjeSkopjeRepublic of North Macedonia
| | - Nebojsa Ladjevic
- Faculty of MedicineUniversity of Belgrade, University Clinical centre of SerbiaBelgradeSerbia
| | - Ratko Lasica
- Clinical Center of SerbiaUniversity of BelgradeBelgradeSerbia
| | - Goran Loncar
- Institute for Cardiovascular Diseases DedinjeBelgradeSerbia
| | - Giuseppe Mancuso
- Medical Microbiology, Department of Human PathologyUniversity of MessinaMessinaItaly
| | - Guiomar Mendieta
- Centro Nacional de Investigaciones Cardiovasculares Carlos III (CNIC)MadridSpain
- Servicio de Cardiología, Institut Clínic Cardiovascular, Hospital Clínic de BarcelonaBarcelonaSpain
- Department for Cardiovascular DiseasesUniversity Hospital Center Zagreb, University of ZagrebZagrebCroatia
| | - Davor Miličić
- Institute of Cardiovascular Diseases of Vojvodina, Sremska Kamenica, Faculty of Medicine Novi SadUniversity of Novi SadNovi SadSerbia
| | - Petra Mjehović
- Institute of Cardiovascular Diseases of Vojvodina, Sremska Kamenica, Faculty of Medicine Novi SadUniversity of Novi SadNovi SadSerbia
| | - Marijan Pašalić
- Institute of Cardiovascular Diseases of Vojvodina, Sremska Kamenica, Faculty of Medicine Novi SadUniversity of Novi SadNovi SadSerbia
| | - Milovan Petrović
- Department of Electrical and Computer EngineeringUniversity of CaliforniaCALos AngelesUSA
| | - Lidija Poposka
- University Clinic for CardiologySkopjeRepublic of North Macedonia
- Faculty of MedicineSs. Cyril and Methodius University in SkopjeSkopjeRepublic of North Macedonia
| | - Marialuisa Scarpone
- Department of Medical and Surgical SciencesUniversity of BolognaBolognaItaly
| | - Milena Stefanovic
- University Clinic of Infectious DiseasesUniversity "Ss. Cyril and Methodius"SkopjeNorth Macedonia
| | - Mihaela van der Schaar
- Cambridge Centre for Artificial Intelligence in Medicine, Department of Applied Mathematics and Theoretical Physics and Department of Population HealthUniversity of CambridgeCambridgeUnited Kingdom
- Medical FacultyUniversity of BelgradeBelgradeSerbia
| | | | - Marija Vavlukis
- University Clinic for CardiologySkopjeRepublic of North Macedonia
- Faculty of MedicineSs. Cyril and Methodius University in SkopjeSkopjeRepublic of North Macedonia
| | - Maria Laura Vega Pittao
- Department of Medical and Surgical SciencesUniversity of BolognaBolognaItaly
- IRCCS Azienda Ospedaliero‐Universitaria di BolognaRespiratory and Critical Care UnitBolognaItaly
| | - Vladan Vukomanovic
- Faculty of MedicineUniversity of Belgrade, Clinical Hospital Center Bezanijska kosaBelgradeSerbia
| | - Marija Zdravkovic
- Institut d’Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS)BarcelonaSpain
| | - Raffaele Bugiardini
- Department of Medical and Surgical SciencesUniversity of BolognaBolognaItaly
| |
Collapse
|
13
|
Imrie F, Cebere B, McKinney EF, van der Schaar M. AutoPrognosis 2.0: Democratizing diagnostic and prognostic modeling in healthcare with automated machine learning. PLOS Digit Health 2023; 2:e0000276. [PMID: 37347752 DOI: 10.1371/journal.pdig.0000276] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Accepted: 05/17/2023] [Indexed: 06/24/2023]
Abstract
Diagnostic and prognostic models are increasingly important in medicine and inform many clinical decisions. Recently, machine learning approaches have shown improvement over conventional modeling techniques by better capturing complex interactions between patient covariates in a data-driven manner. However, the use of machine learning introduces technical and practical challenges that have thus far restricted widespread adoption of such techniques in clinical settings. To address these challenges and empower healthcare professionals, we present an open-source machine learning framework, AutoPrognosis 2.0, to facilitate the development of diagnostic and prognostic models. AutoPrognosis leverages state-of-the-art advances in automated machine learning to develop optimized machine learning pipelines, incorporates model explainability tools, and enables deployment of clinical demonstrators, without requiring significant technical expertise. To demonstrate AutoPrognosis 2.0, we provide an illustrative application where we construct a prognostic risk score for diabetes using the UK Biobank, a prospective study of 502,467 individuals. The models produced by our automated framework achieve greater discrimination for diabetes than expert clinical risk scores. We have implemented our risk score as a web-based decision support tool, which can be publicly accessed by patients and clinicians. By open-sourcing our framework as a tool for the community, we aim to provide clinicians and other medical practitioners with an accessible resource to develop new risk scores, personalized diagnostics, and prognostics using machine learning techniques. Software: https://github.com/vanderschaarlab/AutoPrognosis.
Collapse
Affiliation(s)
- Fergus Imrie
- Department of Electrical and Computer Engineering, University of California, Los Angeles, California, United States of America
| | - Bogdan Cebere
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, United Kingdom
| | - Eoin F McKinney
- Department of Medicine, University of Cambridge, Cambridge, United Kingdom
| | - Mihaela van der Schaar
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, United Kingdom
- The Alan Turing Institute, London, United Kingdom
| |
Collapse
|
14
|
Callender T, van der Schaar M. Automated machine learning as a partner in predictive modelling. Lancet Digit Health 2023; 5:e254-e256. [PMID: 37100541 DOI: 10.1016/s2589-7500(23)00054-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2022] [Accepted: 02/23/2023] [Indexed: 04/28/2023]
Affiliation(s)
- Thomas Callender
- Department of Respiratory Medicine, University College London, London WC1E 6JF, UK.
| | - Mihaela van der Schaar
- Department of Applied Mathematics and Theoretical Physics and Cambridge Centre for AI in Medicine, University of Cambridge, Cambridge, UK
| |
Collapse
|
15
|
Chan A, Peck R, Gibbs M, van der Schaar M. Synthetic Model Combination: A new machine learning method for pharmacometric model ensembling. CPT Pharmacometrics Syst Pharmacol 2023. [PMID: 37042155 DOI: 10.1002/psp4.12965] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Revised: 02/20/2023] [Accepted: 03/20/2023] [Indexed: 04/13/2023] Open
Abstract
When aiming to make predictions over targets in the pharmacological setting, a data-focussed approach aims to learn models based on a collection of labelled examples. Unfortunately, data sharing is not always possible, and this can result in many different models trained on disparate populations, leading to the natural question of how best to use and combine them when making a new prediction. Previous work has focused on global model selection or ensembling, with the result of a single final model across the feature space. Machine learning models perform notoriously poorly on data outside their training domain however due to a problem known as covariate shift, and so we argue that when ensembling models the weightings for individual instances must reflect their respective domains - in other words models that are more likely to have seen information on that instance should have more attention paid to them. We introduce a method for such an instance-wise ensembling of models called Synthetic Model Combination (SMC), including a novel representation learning step for handling sparse high-dimensional domains. We demonstrate the use of SMC on an example with dosing predictions for Vancomycin, although emphasise the applicability of the method to any scenario involving the use of multiple models.
Collapse
Affiliation(s)
- Alexander Chan
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, UK
| | - Richard Peck
- Pharma Research and Development (pRED), Roche Innovation Center, Basel, Switzerland; and Department of Pharmacology & Therapeutics, University of Liverpool, Liverpool, UK
| | - Megan Gibbs
- Clinical Pharmacology and Quantitative Pharmacology, Clinical Pharmacology and Safety Sciences, R&D, AstraZeneca, Gaithersburg, Maryland, USA
| | | |
Collapse
|
16
|
Shah AA, Devana SK, Lee C, Olson TE, Upfill-Brown A, Sheppard WL, Lord EL, Shamie AN, van der Schaar M, SooHoo NF, Park DY. Development and External Validation of a Risk Calculator for Prediction of Major Complications and Readmission After Anterior Cervical Discectomy and Fusion. Spine (Phila Pa 1976) 2023; 48:460-467. [PMID: 36730869 PMCID: PMC10023283 DOI: 10.1097/brs.0000000000004531] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Accepted: 10/22/2022] [Indexed: 02/04/2023]
Abstract
STUDY DESIGN A retrospective, case-control study. OBJECTIVE We aim to build a risk calculator predicting major perioperative complications after anterior cervical fusion. In addition, we aim to externally validate this calculator with an institutional cohort of patients who underwent anterior cervical discectomy and fusion (ACDF). SUMMARY OF BACKGROUND DATA The average age and proportion of patients with at least one comorbidity undergoing ACDF have increased in recent years. Given the increased morbidity and cost associated with perioperative complications and unplanned readmission, accurate risk stratification of patients undergoing ACDF is of great clinical utility. METHODS This is a retrospective cohort study of adults who underwent anterior cervical fusion at any nonfederal California hospital between 2015 and 2017. The primary outcome was major perioperative complication or 30-day readmission. We built standard and ensemble machine learning models for risk prediction, assessing discrimination, and calibration. The best-performing model was validated on an external cohort comprised of consecutive adult patients who underwent ACDF at our institution between 2013 and 2020. RESULTS A total of 23,184 patients were included in this study; there were 1886 cases of major complication or readmissions. The ensemble model was well calibrated and demonstrated an area under the receiver operating characteristic curve of 0.728. The variables most important for the ensemble model include male sex, medical comorbidities, history of complications, and teaching hospital status. The ensemble model was evaluated on the validation cohort (n=260) with an area under the receiver operating characteristic curve of 0.802. The ensemble algorithm was used to build a web-based risk calculator. CONCLUSION We report derivation and external validation of an ensemble algorithm for prediction of major perioperative complications and 30-day readmission after anterior cervical fusion. This model has excellent discrimination and is well calibrated when tested on a contemporaneous external cohort of ACDF cases.
Collapse
Affiliation(s)
- Akash A. Shah
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, CA
| | - Sai K. Devana
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, CA
| | - Changhee Lee
- Department of Artificial Intelligence, Chung-Ang University, Seoul, South Korea
| | - Thomas E. Olson
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, CA
| | - Alexander Upfill-Brown
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, CA
| | - William L. Sheppard
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, CA
| | - Elizabeth L. Lord
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, CA
| | - Arya N. Shamie
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, CA
| | - Mihaela van der Schaar
- Department of Electrical and Computer Engineering, University of California, Los Angeles, CA
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, United Kingdom
| | - Nelson F. SooHoo
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, CA
| | - Don Y. Park
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, CA
| |
Collapse
|
17
|
Alaa AM, Harris AL, van der Schaar M. Matters Arising: PREDICT underestimates survival of patients with HER2-positive early-stage breast cancer. NPJ Breast Cancer 2023; 9:13. [PMID: 36928829 PMCID: PMC10020558 DOI: 10.1038/s41523-023-00514-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2022] [Accepted: 02/08/2023] [Indexed: 03/18/2023] Open
Affiliation(s)
- Ahmed M Alaa
- University of California, Berkeley, Berkeley, CA, USA. .,University of California, San Francisco, San Francisco, CA, USA.
| | | | - Mihaela van der Schaar
- University of Cambridge, Cambridge, UK.,University of California, Los Angeles, Los Angeles, CA, USA
| |
Collapse
|
18
|
Johnson M, Patel M, Phipps A, van der Schaar M, Boulton D, Gibbs M. The potential and pitfalls of artificial intelligence in clinical pharmacology. CPT Pharmacometrics Syst Pharmacol 2023; 12:279-284. [PMID: 36717763 PMCID: PMC10014043 DOI: 10.1002/psp4.12902] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Revised: 11/30/2022] [Accepted: 12/05/2022] [Indexed: 02/01/2023] Open
Affiliation(s)
- Martin Johnson
- Clinical Pharmacology and Quantitative Pharmacology, Clinical Pharmacology and Safety Science, R&D, AstraZeneca, Cambridge, UK
| | - Mishal Patel
- Clinical Pharmacology and Quantitative Pharmacology, Artificial Intelligence & Data Analytics, R&D, AstraZeneca, Cambridge, UK
| | - Alex Phipps
- Clinical Pharmacology and Quantitative Pharmacology, Clinical Pharmacology and Safety Science, R&D, AstraZeneca, Cambridge, UK
| | - Mihaela van der Schaar
- Cambridge Centre for Artificial Intelligence in Medicine, Department of Applied Mathematics and Theoretical Physics and Department of Population Health, University of Cambridge, Cambridge, UK
| | - Dave Boulton
- Clinical Pharmacology and Quantitative Pharmacology, Clinical Pharmacology & Safety Sciences, R&D, AstraZeneca, Gaithersburg, Maryland, USA
| | - Megan Gibbs
- Clinical Pharmacology and Quantitative Pharmacology, Clinical Pharmacology & Safety Sciences, R&D, AstraZeneca, Gaithersburg, Maryland, USA
| |
Collapse
|
19
|
Shah AA, Devana SK, Lee C, Bugarin A, Hong MK, Upfill-Brown A, Blumstein G, Lord EL, Shamie AN, van der Schaar M, SooHoo NF, Park DY. Corrigendum to "A Risk Calculator for the Prediction of C5 Nerve Root Palsy After Instrumented Cervical Fusion" [World Neurosurgery 166 (2022), e703-e710]. World Neurosurg 2023; 173:282. [PMID: 36914515 DOI: 10.1016/j.wneu.2023.02.054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2023]
Affiliation(s)
- Akash A Shah
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, California, USA.
| | - Sai K Devana
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, California, USA
| | - Changhee Lee
- Department of Artificial Intelligence, Chung-Ang University, Seoul, South Korea
| | - Amador Bugarin
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, California, USA
| | - Michelle K Hong
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, California, USA
| | - Alexander Upfill-Brown
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, California, USA
| | - Gideon Blumstein
- Department of Orthopaedic Surgery, Rush University Medical Center, Chicago, Illinois, USA
| | - Elizabeth L Lord
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, California, USA
| | - Arya N Shamie
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, California, USA
| | - Mihaela van der Schaar
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, United Kingdom; Department of Electrical & Computer Engineering, UCLA, Los Angeles, California, USA
| | - Nelson F SooHoo
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, California, USA
| | - Don Y Park
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, California, USA
| |
Collapse
|
20
|
Bugiardini R, Nava S, Caramori G, Yoon J, Badimon L, Bergami M, Cenko E, David A, Demiri I, Dorobantu M, Fronea O, Jankovic R, Kedev S, Ladjevic N, Lasica R, Loncar G, Mancuso G, Mendieta G, Miličić D, Mjehović P, Pašalić M, Petrović M, Poposka L, Scarpone M, Stefanovic M, van der Schaar M, Vasiljevic Z, Vavlukis M, Vega Pittao ML, Vukomanovic V, Zdravkovic M, Manfrini O. Sex differences and disparities in cardiovascular outcomes of COVID-19. Cardiovasc Res 2023; 119:1190-1201. [PMID: 36651866 DOI: 10.1093/cvr/cvad011] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/12/2022] [Revised: 08/24/2022] [Accepted: 11/20/2022] [Indexed: 01/19/2023] Open
Abstract
BACKGROUND Previous analyses on sex differences in case fatality rates at population-level data had limited adjustment for key patient clinical characteristics thought to be associated with COVID-19 outcomes. We aimed to estimate the risk of specific organ dysfunctions and mortality in women and men. METHODS AND RESULTS This retrospective cross-sectional study included 17 hospitals within 5 European countries participating in the International Survey of Acute Coronavirus Syndromes (ISACS) COVID-19(NCT05188612). Participants were individuals hospitalized with positive SARS-CoV-2 from March 2020 to February 2022. Risk-adjusted ratios(RR) of in-hospital mortality, acute respiratory failure(ARF), acute heart failure(AHF), and acute kidney injury(AKI) were calculated for women versus men. Estimates were evaluated by inverse probability of weighting and logistic regression models. The overall care cohort included 4,499 patients with COVID-19 associated hospitalizations. Of these, 1,524(33.9%) were admitted to ICU, and 1,117(24.8%) died during hospitalization. Compared with men, women were less likely to be admitted to ICU (RR:0.80; 95%CI: 0.71-0.91). In general wards (GW) and ICU cohorts, the adjusted women-to-men RRs for in-hospital mortality were of 1.13(95%CI: 0.90-1.42) and 0.86(95%CI: 0.70-1.05; pinteraction=0.04). Development of AHF, AKI and ARF was associated with increased mortality risk (ORs: 2.27; 95%CI; 1.73-2.98,3.85; 95%CI:3.21-4.63 and 3.95; 95%CI:3.04-5.14, respectively). The adjusted RRs for AKI and ARF were comparable among women and men regardless of intensity of care. By contrast, female sex was associated with higher odds for AHF in GW, but not in ICU (RRs:1.25; 95%CI0.94-1.67 versus 0.83; 95%CI:0.59-1.16, pinteraction=0.04). CONCLUSIONS Women in GW were at increased risk of AHF and in-hospital mortality for COVID-19 compared with men. For patients receiving ICU care, fatal complications including AHF and mortality appeared to be independent of sex. Equitable access to COVID-19 ICU care is needed to minimize the unfavourable outcome of women presenting with COVID-19 related complications.
Collapse
Affiliation(s)
- Raffaele Bugiardini
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy
| | - Stefano Nava
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy.,Respiratory and Critical Care Unit, IRCCS Azienda Ospedaliero-Universitaria di Bologna, St Orsola University Hospital, Bologna, Italy
| | - Gaetano Caramori
- Pneumologia, Dipartimento di Scienze Biomediche, Odontoiatriche e delle Immagini Morfologiche e Funzionali (BIOMORF), University of Messina, Italy
| | | | - Lina Badimon
- Cardiovascular Research Program ICCC, IR-IIB Sant Pau, Hospital de la Santa Creu i Sant Pau, CiberCV-Institute Carlos III, Barcelona, Spain
| | - Maria Bergami
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy
| | - Edina Cenko
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy
| | - Antonio David
- Unit of Emergency Medicine - A.O.U. Policlinico G. Martino, Messina, Italy
| | - Ilir Demiri
- University Clinic of Infectious Diseases, University "Ss. Cyril and Methodius", Skopje, North Macedonia
| | - Maria Dorobantu
- Emergency Clinical Hospital of Bucharest, "Carol Davila" University of Medicine and Pharmacy, Bucharest, Romania
| | - Oana Fronea
- Emergency Clinical Hospital of Bucharest, "Carol Davila" University of Medicine and Pharmacy, Bucharest, Romania
| | | | - Sasko Kedev
- University Clinic of Cardiology, Medical Faculty, University "Ss. Cyril and Methodius", Skopje, Macedonia
| | - Nebojsa Ladjevic
- Clinic for Anaesthesia, Covid Hospital Batajnica, Belgrade, Serbia
| | - Ratko Lasica
- Clinical Center of Serbia, University of Belgrade, Belgrade, Serbia
| | - Goran Loncar
- Institute for Cardiovascular Diseases Dedinje, Belgrade, Serbia
| | - Giuseppe Mancuso
- Medical Microbiology, Department of Human Pathology, University of Messina, Italy
| | - Guiomar Mendieta
- Centro Nacional de Investigaciones Cardiovasculares Carlos III (CNIC), Madrid, Spain
| | - Davor Miličić
- Department for Cardiovascular Diseases, University Hospital Center Zagreb, University of Zagreb, Zagreb, Croatia
| | - Petra Mjehović
- Department for Cardiovascular Diseases, University Hospital Center Zagreb, University of Zagreb, Zagreb, Croatia
| | - Marijan Pašalić
- Department for Cardiovascular Diseases, University Hospital Center Zagreb, University of Zagreb, Zagreb, Croatia
| | - Milovan Petrović
- Institute for Cardiovascular Diseases Sremska Kamenica, Novi Sad, Serbia
| | - Lidija Poposka
- University Clinic of Cardiology, Medical Faculty, University "Ss. Cyril and Methodius", Skopje, Macedonia
| | - Marialuisa Scarpone
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy
| | - Milena Stefanovic
- University Clinic of Infectious Diseases, University "Ss. Cyril and Methodius", Skopje, North Macedonia
| | - Mihaela van der Schaar
- Department of Electrical and Computer Engineering, University of California, Los Angeles.,Cambridge Centre for Artificial Intelligence in Medicine, Department of Applied Mathematics and Theoretical Physics and Department of Population Health, University of Cambridge, Cambridge, United Kingdom
| | | | - Marija Vavlukis
- University Clinic of Cardiology, Medical Faculty, University "Ss. Cyril and Methodius", Skopje, Macedonia
| | - Maria Laura Vega Pittao
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy
| | | | - Marija Zdravkovic
- University Hospital Medical Center Bezanijska Kosa, Belgrade, Medical Faculty, University of Belgrade, Serbia
| | - Olivia Manfrini
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy
| |
Collapse
|
21
|
Qin Y, Alaa A, Floto A, van der Schaar M. External validity of machine learning-based prognostic scores for cystic fibrosis: A retrospective study using the UK and Canadian registries. PLOS Digit Health 2023; 2:e0000179. [PMID: 36812602 PMCID: PMC9931238 DOI: 10.1371/journal.pdig.0000179] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Accepted: 12/08/2022] [Indexed: 01/14/2023]
Abstract
Precise and timely referral for lung transplantation is critical for the survival of cystic fibrosis patients with terminal illness. While machine learning (ML) models have been shown to achieve significant improvement in prognostic accuracy over current referral guidelines, the external validity of these models and their resulting referral policies has not been fully investigated. Here, we studied the external validity of machine learning-based prognostic models using annual follow-up data from the UK and Canadian Cystic Fibrosis Registries. Using a state-of-the-art automated ML framework, we derived a model for predicting poor clinical outcomes in patients enrolled in the UK registry, and conducted external validation of the derived model using the Canadian Cystic Fibrosis Registry. In particular, we studied the effect of (1) natural variations in patient characteristics across populations and (2) differences in clinical practice on the external validity of ML-based prognostic scores. Overall, decrease in prognostic accuracy on the external validation set (AUCROC: 0.88, 95% CI 0.88-0.88) was observed compared to the internal validation accuracy (AUCROC: 0.91, 95% CI 0.90-0.92). Based on our ML model, analysis on feature contributions and risk strata revealed that, while external validation of ML models exhibited high precision on average, both factors (1) and (2) can undermine the external validity of ML models in patient subgroups with moderate risk for poor outcomes. A significant boost in prognostic power (F1 score) from 0.33 (95% CI 0.31-0.35) to 0.45 (95% CI 0.45-0.45) was observed in external validation when variations in these subgroups were accounted in our model. Our study highlighted the significance of external validation of ML models for cystic fibrosis prognostication. The uncovered insights on key risk factors and patient subgroups can be used to guide the cross-population adaptation of ML-based models and inspire new research on applying transfer learning methods for fine-tuning ML models to cope with regional variations in clinical care.
Collapse
Affiliation(s)
- Yuchao Qin
- University of Cambridge, Cambridge, United Kingdom
- * E-mail:
| | - Ahmed Alaa
- University of California Berkeley, Berkeley, California, United States of America
- University of California San Francisco, San Francisco, California, United States of America
| | - Andres Floto
- University of Cambridge, Cambridge, United Kingdom
| | - Mihaela van der Schaar
- University of Cambridge, Cambridge, United Kingdom
- Alan Turing Institute, London, United Kingdom
- University of California Los Angeles, Los Angeles, California, United States of America
| |
Collapse
|
22
|
Shah AA, Devana SK, Lee C, Bugarin A, Hong MK, Upfill-Brown A, Blumstein G, Lord EL, Shamie AN, van der Schaar M, SooHoo NF, Park DY. A Risk Calculator for the Prediction of C5 Nerve Root Palsy After Instrumented Cervical Fusion. World Neurosurg 2022; 166:e703-e710. [PMID: 35872129 PMCID: PMC10410645 DOI: 10.1016/j.wneu.2022.07.082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Accepted: 07/17/2022] [Indexed: 12/15/2022]
Abstract
BACKGROUND C5 palsy is a common postoperative complication after cervical fusion and is associated with increased health care costs and diminished quality of life. Accurate prediction of C5 palsy may allow for appropriate preoperative counseling and risk stratification. We primarily aim to develop an algorithm for the prediction of C5 palsy after instrumented cervical fusion and identify novel features for risk prediction. Additionally, we aim to build a risk calculator to provide the risk of C5 palsy. METHODS We identified adult patients who underwent instrumented cervical fusion at a tertiary care medical center between 2013 and 2020. The primary outcome was postoperative C5 palsy. We developed ensemble machine learning, standard machine learning, and logistic regression models predicting the risk of C5 palsy-assessing discrimination and calibration. Additionally, a web-based risk calculator was built with the best-performing model. RESULTS A total of 1024 patients were included, with 52 cases of C5 palsy. The ensemble model was well-calibrated and demonstrated excellent discrimination with an area under the receiver-operating characteristic curve of 0.773. The following features were the most important for ensemble model performance: diabetes mellitus, bipolar disorder, C5 or C4 level, surgical approach, preoperative non-motor neurologic symptoms, degenerative disease, number of fused levels, and age. CONCLUSIONS We report a risk calculator that generates patient-specific C5 palsy risk after instrumented cervical fusion. Individualized risk prediction for patients may facilitate improved preoperative patient counseling and risk stratification as well as potential intraoperative mitigating measures. This tool may also aid in addressing potentially modifiable risk factors such as diabetes and obesity.
Collapse
Affiliation(s)
- Akash A Shah
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, California, USA.
| | - Sai K Devana
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, California, USA
| | - Changhee Lee
- Department of Artificial Intelligence, Chung-Ang University, Seoul, South Korea
| | - Amador Bugarin
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, California, USA
| | - Michelle K Hong
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, California, USA
| | - Alexander Upfill-Brown
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, California, USA
| | - Gideon Blumstein
- Department of Orthopaedic Surgery, Rush University Medical Center, Chicago, Illinois, USA
| | - Elizabeth L Lord
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, California, USA
| | - Arya N Shamie
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, California, USA
| | - Mihaela van der Schaar
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, United Kingdom; Department of Electrical & Computer Engineering, UCLA, Los Angeles, California, USA
| | - Nelson F SooHoo
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, California, USA
| | - Don Y Park
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, California, USA
| |
Collapse
|
23
|
Shah AA, Devana SK, Lee C, Bugarin A, Lord EL, Shamie AN, Park DY, van der Schaar M, SooHoo NF. Machine learning-driven identification of novel patient factors for prediction of major complications after posterior cervical spinal fusion. Eur Spine J 2022; 31:1952-1959. [PMID: 34392418 PMCID: PMC8844303 DOI: 10.1007/s00586-021-06961-7] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/09/2021] [Revised: 06/23/2021] [Accepted: 08/08/2021] [Indexed: 01/20/2023]
Abstract
PURPOSE Posterior cervical fusion is associated with increased rates of complications and readmission when compared to anterior fusion. Machine learning (ML) models for risk stratification of patients undergoing posterior cervical fusion remain limited. We aim to develop a novel ensemble ML algorithm for prediction of major perioperative complications and readmission after posterior cervical fusion and identify factors important to model performance. METHODS This is a retrospective cohort study of adults who underwent posterior cervical fusion at non-federal California hospitals between 2015 and 2017. The primary outcome was readmission or major complication. We developed an ensemble model predicting complication risk using an automated ML framework. We compared performance with standard ML models and logistic regression (LR), ranking contribution of included variables to model performance. RESULTS Of the included 6822 patients, 18.8% suffered a major complication or readmission. The ensemble model demonstrated slightly superior predictive performance compared to LR and standard ML models. The most important features to performance include sex, malignancy, pneumonia, stroke, and teaching hospital status. Seven of the ten most important features for the ensemble model were markedly less important for LR. CONCLUSION We report an ensemble ML model for prediction of major complications and readmission after posterior cervical fusion with a modest risk prediction advantage compared to LR and benchmark ML models. Notably, the features most important to the ensemble are markedly different from those for LR, suggesting that advanced ML methods may identify novel prognostic factors for adverse outcomes after posterior cervical fusion.
Collapse
Affiliation(s)
- Akash A Shah
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, 90095, USA.
| | - Sai K Devana
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, 90095, USA
| | - Changhee Lee
- Department of Electrical and Computer Engineering, University of California, Los Angeles, CA, USA
| | - Amador Bugarin
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, 90095, USA
| | - Elizabeth L Lord
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, 90095, USA
| | - Arya N Shamie
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, 90095, USA
| | - Don Y Park
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, 90095, USA
| | - Mihaela van der Schaar
- Department of Electrical and Computer Engineering, University of California, Los Angeles, CA, USA
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK
| | - Nelson F SooHoo
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, 90095, USA
| |
Collapse
|
24
|
Bugiardini R, Yoon J, Mendieta G, Kedev S, Zdravkovic M, Vasiljevic Z, Miličić D, Manfrini O, van der Schaar M, Gale CP, Bergami M, Badimon L, Cenko E. Reduced Heart Failure and Mortality in Patients Receiving Statin Therapy Before Initial Acute Coronary Syndrome. J Am Coll Cardiol 2022; 79:2021-2033. [PMID: 35589164 DOI: 10.1016/j.jacc.2022.03.354] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Revised: 02/09/2022] [Accepted: 03/10/2022] [Indexed: 10/18/2022]
Abstract
BACKGROUND There is uncertainty regarding the impact of statins on the risk of atherosclerotic cardiovascular disease (ASCVD) and its major complication, acute heart failure (AHF). OBJECTIVES The aim of this study was to investigate whether previous statin therapy translates into lower AHF events and improved survival from AHF among patients presenting with an acute coronary syndrome (ACS) as a first manifestation of ASCVD. METHODS Data were drawn from the International Survey of Acute Coronary Syndromes Archives. The study participants consisted of 14,542 Caucasian patients presenting with ACS without previous ASCVD events. Statin users before the index event were compared with nonusers by using inverse probability weighting models. Estimates were compared by test of interaction on the log scale. Main outcome measures were the incidence of AHF according to Killip class and the rate of 30-day all-cause mortality in patients presenting with AHF. RESULTS Previous statin therapy was associated with a significantly decreased rate of AHF on admission (4.3% absolute risk reduction; risk ratio [RR]: 0.72; 95% CI: 0.62-0.83) regardless of younger (40-75 years) or older age (interaction P = 0.27) and sex (interaction P = 0.22). Moreover, previous statin therapy predicted a lower risk of 30-day mortality in the subset of patients presenting with AHF on admission (5.2 % absolute risk reduction; RR: 0.71; 95% CI: 0.50-0.99). CONCLUSIONS Among adults presenting with ACS as a first manifestation of ASCVD, previous statin therapy is associated with a reduced risk of AHF and improved survival from AHF. (International Survey of Acute Coronary Syndromes [ISACS] Archives; NCT04008173).
Collapse
Affiliation(s)
- Raffaele Bugiardini
- Department of Experimental, Diagnostic, and Specialty Medicine, University of Bologna, Bologna, Italy.
| | - Jinsung Yoon
- Google Cloud AI, Sunnyvale, California, USA; Department of Electrical and Computer Engineering, University of California, Los Angeles, Los Angeles, California, USA
| | - Guiomar Mendieta
- Centro Nacional de Investigaciones Cardiovasculares Carlos III (CNIC), Madrid, Spain
| | - Sasko Kedev
- University Clinic of Cardiology, Faculty of Medicine Ss. Cyril and Methodius University, Skopje, Macedonia
| | - Marija Zdravkovic
- University Hospital Medical Center Bezanijska Kosa, Belgrade, Serbia
| | | | - Davor Miličić
- Department of Cardiovascular Diseases, University Hospital Center Zagreb, University of Zagreb, Zagreb, Croatia
| | - Olivia Manfrini
- Department of Experimental, Diagnostic, and Specialty Medicine, University of Bologna, Bologna, Italy
| | - Mihaela van der Schaar
- Department of Electrical and Computer Engineering, University of California, Los Angeles, Los Angeles, California, USA; Cambridge Centre for Artificial Intelligence in Medicine, Department of Applied Mathematics and Theoretical Physics and Department of Population Health, University of Cambridge, Cambridge, United Kingdom
| | - Chris P Gale
- Leeds Institute of Cardiovascular and Metabolic Medicine, University of Leeds, Leeds, United Kingdom
| | - Maria Bergami
- Department of Experimental, Diagnostic, and Specialty Medicine, University of Bologna, Bologna, Italy
| | - Lina Badimon
- Cardiovascular Research Program ICCC, IR-IIB Sant Pau, Hospital de la Santa Creu i Sant Pau, CiberCV-Institute Carlos III, Barcelona, Spain
| | - Edina Cenko
- Department of Experimental, Diagnostic, and Specialty Medicine, University of Bologna, Bologna, Italy
| |
Collapse
|
25
|
Devana SK, Shah AA, Lee C, Jensen AR, Cheung E, van der Schaar M, SooHoo NF. Development of a Machine Learning Algorithm for Prediction of Complications and Unplanned Readmission Following Primary Anatomic Total Shoulder Replacements. J Shoulder Elb Arthroplast 2022; 6:24715492221075444. [PMID: 35669619 PMCID: PMC9163721 DOI: 10.1177/24715492221075444] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Revised: 12/23/2021] [Accepted: 01/05/2022] [Indexed: 11/16/2022] Open
Abstract
Background The demand and incidence of anatomic total shoulder arthroplasty (aTSA) procedures is projected to increase substantially over the next decade. There is a paucity of accurate risk prediction models which would be of great utility in minimizing morbidity and costs associated with major post-operative complications. Machine learning is a powerful predictive modeling tool and has become increasingly popular, especially in orthopedics. We aimed to build a ML model for prediction of major complications and readmission following primary aTSA. Methods A large California administrative database was retrospectively reviewed for all adults undergoing primary aTSA between 2015 to 2017. The primary outcome was any major complication or readmission following aTSA. A wide scope of standard ML benchmarks, including Logistic regression (LR), XGBoost, Gradient boosting, AdaBoost and Random Forest were employed to determine their power to predict outcomes. Additionally, important patient features to the prediction models were indentified. Results There were a total of 10,302 aTSAs with 598 (5.8%) having at least one major post-operative complication or readmission. XGBoost had the highest discriminative power (area under receiver operating curve AUROC of 0.689) of the 5 ML benchmarks with an area under precision recall curve AURPC of 0.207. History of implant complication, severe chronic kidney disease, teaching hospital status, coronary artery disease and male sex were the most important features for the performance of XGBoost. In addition, XGBoost identified teaching hospital status and male sex as markedly more important predictors of outcomes compared to LR models. Conclusion We report a well calibrated XGBoost ML algorithm for predicting major complications and 30-day readmission following aTSA. History of prior implant complication was the most important patient feature for XGBoost performance, a novel patient feature that surgeons should consider when counseling patients.
Collapse
Affiliation(s)
- Sai K Devana
- David Geffen School of Medicine UCLA, Los Angeles, CA
| | - Akash A Shah
- David Geffen School of Medicine UCLA, Los Angeles, CA
| | | | | | - Edward Cheung
- David Geffen School of Medicine UCLA, Los Angeles, CA
| | | | | |
Collapse
|
26
|
Devana SK, Shah AA, Lee C, Gudapati V, Jensen AR, Cheung E, Solorzano C, van der Schaar M, SooHoo NF. Development of a Machine Learning Algorithm for Prediction of Complications and Unplanned Readmission Following Reverse Total Shoulder Arthroplasty. J Shoulder Elb Arthroplast 2022; 5:24715492211038172. [PMID: 35330785 PMCID: PMC8938598 DOI: 10.1177/24715492211038172] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2021] [Revised: 06/21/2021] [Accepted: 07/20/2021] [Indexed: 11/22/2022] Open
Abstract
Background Reverse total shoulder arthroplasty (rTSA) offers tremendous promise for the treatment of complex pathologies beyond the scope of anatomic total shoulder arthroplasty but is associated with a higher rate of major postoperative complications. We aimed to design and validate a machine learning (ML) model to predict major postoperative complications or readmission following rTSA. Methods We retrospectively reviewed California's Office of Statewide Health Planning and Development database for patients who underwent rTSA between 2015 and 2017. We implemented logistic regression (LR), extreme gradient boosting (XGBoost), gradient boosting machines, adaptive boosting, and random forest classifiers in Python and trained these models using 64 binary, continuous, and discrete variables to predict the occurrence of at least one major postoperative complication or readmission following primary rTSA. Models were validated using the standard metrics of area under the receiver operating characteristic (AUROC) curve, area under the precision–recall curve (AUPRC), and Brier scores. The key factors for the top-performing model were determined. Results Of 2799 rTSAs performed during the study period, 152 patients (5%) had at least 1 major postoperative complication or 30-day readmission. XGBoost had the highest AUROC and AUPRC of 0.681 and 0.129, respectively. The key predictive features in this model were patients with a history of implant complications, protein-calorie malnutrition, and a higher number of comorbidities. Conclusion Our study reports an ML model for the prediction of major complications or 30-day readmission following rTSA. XGBoost outperformed traditional LR models and also identified key predictive features of complications and readmission.
Collapse
Affiliation(s)
- Sai K Devana
- David Geffen School of Medicine UCLA, Los Angeles, CA, USA
| | - Akash A Shah
- David Geffen School of Medicine UCLA, Los Angeles, CA, USA
| | | | - Varun Gudapati
- David Geffen School of Medicine UCLA, Los Angeles, CA, USA
| | | | - Edward Cheung
- David Geffen School of Medicine UCLA, Los Angeles, CA, USA
| | | | | | | |
Collapse
|
27
|
Simovic S, Davidovic G, Yoon J, Kedev S, Zdravkovic M, Vasiljevic Z, Milicic D, Manfrini O, Schaar MVD, Gale CP, Bergami M, Badimon L, Cenko E, Bugiardini R. IS A FAMILY HISTORY OF CORONARY HEART DISEASE AN INDEPENDENT CARDIOVASCULAR RISK FACTOR? J Am Coll Cardiol 2022. [DOI: 10.1016/s0735-1097(22)02064-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
28
|
Bergami M, Cenko E, Yoon J, Mendieta G, Kedev S, Zdravkovic M, Vasiljevic Z, Miličić D, Manfrini O, van der Schaar M, Gale CP, Badimon L, Bugiardini R. Statins for primary prevention among elderly men and women. Cardiovasc Res 2021; 118:3000-3009. [PMID: 34864917 DOI: 10.1093/cvr/cvab348] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/14/2021] [Accepted: 11/24/2021] [Indexed: 11/14/2022] Open
Abstract
AIMS We undertook a propensity match-weighted cohort study to investigate whether statin treatment recommendations for statins translate into improved cardiovascular (CV) outcomes in the current routine clinical care of the elderly. METHODS AND RESULTS We included in our analysis (ISACS Archives -NCT04008173) a total of 5,619 Caucasian patients with no known prior history of CV disease who presented to hospital with a first manifestation of CV disease with age of 65 years or older. The risk of ST segment elevation myocardial infarction (STEMI) was much lower in statin users than in nonusers in both patients aged 65 to 75 years (14.7% absolute risk reduction; relative risk [RR]: 0.55, 95% CI 0.45 to 0.66) and those aged 76 years and older (13.3% absolute risk reduction; RR: 0.58, 95% CI 0.46 to 0.72). Estimates were similar in patients with and without history of hypercholesterolemia (interaction test; p value= 0.2408). Proportional reductions in STEMI diminished with female sex in the old (p for interaction = 0.002), but not in the very old age (p for interaction = 0.26). We also observed a remarkable reduction in the risk of 30- day mortality from STEMI with statin therapy in both age groups (10.2% absolute risk reduction; RR: 0.39; 95%CI 0.23-0.68 for patients aged 76 or over and 3.8% absolute risk reduction; RR 0.37; 95%CI 0.17-0.82 for patients aged 65 to 75 years old; interaction test, p value = 0.4570). CONCLUSIONS Preventive statin therapy in the elderly reduces the risk of STEMI with benefits in mortality from STEMI, irrespective of the presence of a history of hypercholesterolemia. This effect persists after the age of 76 years. Benefits are less pronounced in women. Randomized clinical trials may contribute to more definitively determine the role of statin therapy in the elderly. TRANSLATIONAL PERSPECTIVE In this register-based cohort study with match propensity-based design of patients without known prior history of CV disease, we compared statin users versus nonusers in two age groups: 65 to 75 years and 76 years and older. Statin use was associated with a 13% absolute reduction in the risk of ST segment elevation myocardial infarction (STEMI) in patients 76 years and older irrespective of the presence of a history of hypercholesterolemia. Statin use was also significantly related to a 10.2% reduction in 30-day mortality from STEMI. Estimates were similar in patients aged 65 to 75 years. Benefits were less pronounced in women. This study demonstrates that preventive statin therapy is broadly effective at reducing the risk of major cardiovascular events and mortality in the elderly. Results may inform future research and current guidelines.
Collapse
Affiliation(s)
- Maria Bergami
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy
| | - Edina Cenko
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy
| | - Jinsung Yoon
- Google Cloud AI, Sunnyvale, California, USA.,Department of Electrical and Computer Engineering, University of California, Los Angeles
| | - Guiomar Mendieta
- Centro Nacional de Investigaciones Cardiovasculares Carlos III (CNIC), Madrid, Spain
| | - Sasko Kedev
- University Clinic of Cardiology, Medical Faculty, University "Ss. Cyril and Methodius", Skopje, Macedonia
| | - Marija Zdravkovic
- University Clinical Hospital Center Bezanijska Kosa, Faculty of Medicine, University of Belgrade, Serbia
| | | | - Davor Miličić
- Department for Cardiovascular Diseases, University Hospital Center Zagreb, University of Zagreb, Zagreb, Croatia
| | - Olivia Manfrini
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy
| | - Mihaela van der Schaar
- Department of Electrical and Computer Engineering, University of California, Los Angeles.,Cambridge Centre for Artificial Intelligence in Medicine, Department of Applied Mathematics and Theoretical Physics and Department of Population Health, University of Cambridge, Cambridge, United Kingdom
| | - Chris P Gale
- Leeds Institute of Cardiovascular and Metabolic Medicine, University of Leeds, Leeds, United Kingdom
| | - Lina Badimon
- Cardiovascular Research Program ICCC, IR-IIB Sant Pau, Hospital de la Santa Creu i Sant Pau, CiberCV-Institute Carlos III, Barcelona, Spain
| | - Raffaele Bugiardini
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy
| |
Collapse
|
29
|
Baqui P, Marra V, Alaa AM, Bica I, Ercole A, van der Schaar M. Comparing COVID-19 risk factors in Brazil using machine learning: the importance of socioeconomic, demographic and structural factors. Sci Rep 2021; 11:15591. [PMID: 34341397 PMCID: PMC8329284 DOI: 10.1038/s41598-021-95004-8] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Accepted: 06/22/2021] [Indexed: 12/24/2022] Open
Abstract
The COVID-19 pandemic continues to have a devastating impact on Brazil. Brazil's social, health and economic crises are aggravated by strong societal inequities and persisting political disarray. This complex scenario motivates careful study of the clinical, socioeconomic, demographic and structural factors contributing to increased risk of mortality from SARS-CoV-2 in Brazil specifically. We consider the Brazilian SIVEP-Gripe catalog, a very rich respiratory infection dataset which allows us to estimate the importance of several non-laboratorial and socio-geographic factors on COVID-19 mortality. We analyze the catalog using machine learning algorithms to account for likely complex interdependence between metrics. The XGBoost algorithm achieved excellent performance, producing an AUC-ROC of 0.813 (95% CI 0.810-0.817), and outperforming logistic regression. Using our model we found that, in Brazil, socioeconomic, geographical and structural factors are more important than individual comorbidities. Particularly important factors were: The state of residence and its development index; the distance to the hospital (especially for rural and less developed areas); the level of education; hospital funding model and strain. Ethnicity is also confirmed to be more important than comorbidities but less than the aforementioned factors. In conclusion, socioeconomic and structural factors are as important as biological factors in determining the outcome of COVID-19. This has important consequences for policy making, especially on vaccination/non-pharmacological preventative measures, hospital management and healthcare network organization.
Collapse
Affiliation(s)
- Pedro Baqui
- Núcleo de Astrofísica e Cosmologia, Universidade Federal do Espírito Santo, Vitória, ES, Brazil
| | - Valerio Marra
- Núcleo de Astrofísica e Cosmologia, Universidade Federal do Espírito Santo, Vitória, ES, Brazil
- Departamento de Física, Universidade Federal do Espírito Santo, Vitória, ES, Brazil
| | - Ahmed M Alaa
- Department of Electrical and Computer Engineering, University of California Los Angeles, Los Angeles, CA, USA
| | - Ioana Bica
- Department of Engineering Science, University of Oxford, Oxford, UK
- The Alan Turing Institute, London, UK
| | - Ari Ercole
- Department of Medicine, University of Cambridge, Cambridge, UK
- Cambridge Centre for Artificial Intelligence in Medicine, Cambridge, UK
| | - Mihaela van der Schaar
- Department of Electrical and Computer Engineering, University of California Los Angeles, Los Angeles, CA, USA.
- The Alan Turing Institute, London, UK.
- Cambridge Centre for Artificial Intelligence in Medicine, Cambridge, UK.
- Department of Applied Mathematics and Theoretical Physics and Department of Population Health, University of Cambridge, Cambridge, UK.
| |
Collapse
|
30
|
Devana SK, Shah AA, Lee C, Roney AR, van der Schaar M, SooHoo NF. A Novel, Potentially Universal Machine Learning Algorithm to Predict Complications in Total Knee Arthroplasty. Arthroplast Today 2021; 10:135-143. [PMID: 34401416 PMCID: PMC8349766 DOI: 10.1016/j.artd.2021.06.020] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/24/2021] [Revised: 06/23/2021] [Accepted: 06/25/2021] [Indexed: 11/26/2022] Open
Abstract
BACKGROUND There remains a lack of accurate and validated outcome-prediction models in total knee arthroplasty (TKA). While machine learning (ML) is a powerful predictive tool, determining the proper algorithm to apply across diverse data sets is challenging. AutoPrognosis (AP) is a novel method that uses automated ML framework to incorporate the best performing stages of prognostic modeling into a single well-calibrated algorithm. We aimed to compare various ML methods to AP in predictive performance of complications after TKA. METHODS Thirty-eight preoperative patient demographics and clinical features from all primary TKAs performed at California-licensed hospitals between 2015 and 2017 were evaluated as predictors of major complications after TKA. Traditional logistic regression (LR), various other ML methods (XGBoost, Gradient Boosting, AdaBoost, and Random Forest), and AP were used for model building to determine discriminative power (area under receiver operating curve), calibration (Brier score), and feature importance. RESULTS Between 2015 and 2017, there were a total of 156,750 TKAs with 1109 (0.7%) total major complications. AP had the highest discriminative performance with area under receiver operating curve 0.679 compared with LR, XGBoost, Gradient Boosting, AdaBoost, and Random Forest (0.617, 0.601, 0.662, 0.657, and 0.545, respectively). AP (Brier score 0.007) had similar calibration as the other ML methods (0.006, 0.006, 0.022, 0.007, and 0.008, respectively). The variables that are most important for AP differ from those that are most important for LR. CONCLUSION Compared to conventional ML algorithms, AP has superior discriminative ability with similar calibration and suggests nonlinear relationships between variables in outcomes of TKA.
Collapse
Affiliation(s)
- Sai K. Devana
- Department of Orthopaedic Surgery, University of California, Los Angeles, USA
| | - Akash A. Shah
- Department of Orthopaedic Surgery, University of California, Los Angeles, USA
| | - Changhee Lee
- Department of Electrical and Computer Engineering, University of California, Los Angeles, USA
| | - Andrew R. Roney
- Department of Orthopaedic Surgery, University of California, Los Angeles, USA
| | - Mihaela van der Schaar
- Department of Electrical and Computer Engineering, University of California, Los Angeles, USA
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, London, UK
- The Alan Turing Institute, London, UK
| | - Nelson F. SooHoo
- Department of Orthopaedic Surgery, University of California, Los Angeles, USA
| |
Collapse
|
31
|
Abstract
Chronic diseases evolve slowly throughout a patient's lifetime creating heterogeneous progression patterns that make clinical outcomes remarkably varied across individual patients. A tool capable of identifying temporal phenotypes based on the patients different progression patterns and clinical outcomes would allow clinicians to better forecast disease progression by recognizing a group of similar past patients, and to better design treatment guidelines that are tailored to specific phenotypes. To build such a tool, we propose a deep learning approach, which we refer to as outcome-oriented deep temporal phenotyping (ODTP), to identify temporal phenotypes of disease progression considering what type of clinical outcomes will occur and when based on the longitudinal observations. More specifically, we model clinical outcomes throughout a patient's longitudinal observations via time-to-event (TTE) processes whose conditional intensity functions are estimated as non-linear functions using a recurrent neural network. Temporal phenotyping of disease progression is carried out by our novel loss function that is specifically designed to learn discrete latent representations that best characterize the underlying TTE processes. The key insight here is that learning such discrete representations groups progression patterns considering the similarity in expected clinical outcomes, and thus naturally provides outcome-oriented temporal phenotypes. We demonstrate the power of ODTP by applying it to a real-world heterogeneous cohort of 11 779 stage III breast cancer patients from the U.K. National Cancer Registration and Analysis Service. The experiments show that ODTP identifies temporal phenotypes that are strongly associated with the future clinical outcomes and achieves significant gain on the homogeneity and heterogeneity measures over existing methods. Furthermore, we are able to identify the key driving factors that lead to transitions between phenotypes which can be translated into actionable information to support better clinical decision-making.
Collapse
|
32
|
Bugiardini R, Cenko E, Yoon J, van der Schaar M, Kedev S, Gale CP, Vasiljevic Z, Bergami M, Miličić D, Zdravkovic M, Krljanac G, Badimon L, Manfrini O. Concerns about the use of digoxin in acute coronary syndromes. Eur Heart J Cardiovasc Pharmacother 2021; 8:474-482. [PMID: 34251454 DOI: 10.1093/ehjcvp/pvab055] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/24/2021] [Revised: 06/16/2021] [Accepted: 07/09/2021] [Indexed: 12/21/2022]
Abstract
AIMS The use of digitalis has been plagued by controversy since its initial use. We aimed to determine the relationship between digoxin use and outcomes in hospitalized patients with acute coronary syndromes (ACSs) complicated by heart failure (HF) accounting for sex difference and prior heart diseases. METHODS AND RESULTS Of the 25,187 patients presenting with acute HF (Killip class ≥ 2) in the International Survey of Acute Coronary Syndromes (ISACS)-Archives (NCT04008173) registry, 4,722 (18.7%) received digoxin on hospital admission. The main outcome measure was all cause 30-day mortality. Estimates were evaluated by inverse probability of treatment weighting models. Women who received digoxin had a higher rate of death than women who did not receive it (33.8% vs. 29.2%; relative risk [RR] ratio:1.24;95% confidence interval [CI]: 1.12-1.37). Similar odds for mortality with digoxin were observed in men (28.5% vs. 24.9%; RR ratio 1.20; 95% CI:1.10-1.32). Comparable results were obtained in patients with no prior coronary heart disease (RR ratios:1.26; 95% CI: 1.10 to 1.45 in women and RR:1.21; 95% CI: 1.06 to 1.39 in men) and those in sinus rhythm at admission (RR ratios:1.34; 95% CI 1.15 to 1.54 in women and 1.26; 95% CI 1.10 to 1.45 in men). CONCLUSION Digoxin therapy is associated with an increased risk of early death among women and men with ACS complicated by HF. This finding highlights the need for re-examination of digoxin use in the clinical setting of ACS.
Collapse
Affiliation(s)
- Raffaele Bugiardini
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy
| | - Edina Cenko
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy
| | - Jinsung Yoon
- Google Cloud AI, Sunnyvale, California, USA.,Department of Electrical and Computer Engineering, University of California, Los Angeles
| | - Mihaela van der Schaar
- Department of Electrical and Computer Engineering, University of California, Los Angeles.,Cambridge Centre for Artificial Intelligence in Medicine, Department of Applied Mathematics and Theoretical Physics and Department of Population Health, University of Cambridge, Cambridge, United Kingdom
| | - Sasko Kedev
- University Clinic of Cardiology, Medical Faculty, University "Ss. Cyril and Methodius", Skopje, Macedonia
| | - Chris P Gale
- Clinical and Population Sciences Department, Leeds Institute of Cardiovascular and Metabolic Medicine, University of Leeds, Leeds, UK
| | | | - Maria Bergami
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy
| | - Davor Miličić
- Department for Cardiovascular Diseases, University Hospital Centre Zagreb, University of Zagreb, Zagreb, Croatia
| | - Marija Zdravkovic
- University Clinical Hospital Center Bezanijska Kosa, Faculty of Medicine, University of Belgrade, Serbia
| | - Gordana Krljanac
- Cardiology Department, Clinical Centre of Serbia, Medical Faculty, University of Belgrade, Serbia
| | - Lina Badimon
- Cardiovascular Research Program ICCC, IR-IIB Sant Pau, Hospital de la Santa Creu i Sant Pau, CiberCV-Institute Carlos III, Barcelona, Spain
| | - Olivia Manfrini
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy
| |
Collapse
|
33
|
Vasiljevic Z, Scarpone M, Bergami M, Yoon J, van der Schaar M, Krljanac G, Asanin M, Davidovic G, Simovic S, Manfrini O, Mickovski-Katalina N, Badimon L, Cenko E, Bugiardini R. Smoking and sex differences in first manifestation of cardiovascular disease. Atherosclerosis 2021; 330:43-51. [PMID: 34233252 DOI: 10.1016/j.atherosclerosis.2021.06.909] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Revised: 06/22/2021] [Accepted: 06/24/2021] [Indexed: 01/11/2023]
Abstract
BACKGROUND AND AIMS An increasing proportion of women believe that smoking few cigarettes daily substantially reduces their risk of developing cardiovascular (CV) related disorders. The effect of low intensity smoking is still largely understudied. We investigated the relation among sex, age, cigarette smoking and ST segment elevation myocardial infarction (STEMI) as initial manifestation of CV disease. METHODS We analyzed data of 50,713 acute coronary syndrome patients with no prior manifestation of CV disease from the ISACS-Archives (NCT04008173) registry. We compared the rates of STEMI in current smokers (n = 11,530) versus nonsmokers (n = 39,183). RESULTS In the young middle age group (<60 years), there was evidence of a more harmful effect in women compared with men (RR ratios: 1.90; 95% CI: 1.69-2.14 versus 1.68; 95% CI: 1.56-1.80). This association persisted even in women who smoked 1 to 10 packs per year (RR ratios: 2.02; 95% CI: 1.65 to 2.48 versus 1.38; 95% CI: 1.22 to 1.57). In the older group, rates of STEMI were similar for women and men (RR ratios: 1.36; 95% CI: 1.22-1.53 versus 1.39; 95% CI: 1.28-1.50). STEMI was associated with a twofold higher 30-day mortality rate in young middle age women compared with men of the same age (odds ratios, 5.54; 95% CI, 3.83-8.03 vs. 2.93; 95% CI, 2.33-3.69). CONCLUSIONS Low intensity smoking provides inadequate protection in young - middle age women as they still have a substantially higher rate of STEMI and related mortality compared with men even smoking less than 10 packs per year. This finding is worrying as more young - middle age women are smoking, and rates of smoking among young-middle age men continue to fall.
Collapse
Affiliation(s)
| | - Marialuisa Scarpone
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy
| | - Maria Bergami
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy
| | - Jinsung Yoon
- Google Cloud AI, Sunnyvale, CA, USA; Department of Electrical and Computer Engineering, University of California, Los Angeles, USA
| | - Mihaela van der Schaar
- Department of Electrical and Computer Engineering, University of California, Los Angeles, USA; Cambridge Centre for Artificial Intelligence in Medicine, Department of Applied Mathematics and Theoretical Physics and Department of Population Health, University of Cambridge, Cambridge, United Kingdom
| | - Gordana Krljanac
- Cardiology Department, Clinical Centre of Serbia, Medical Faculty, University of Belgrade, Serbia
| | - Milika Asanin
- Cardiology Department, Clinical Centre of Serbia, Medical Faculty, University of Belgrade, Serbia
| | - Goran Davidovic
- Department of Internal Medicine, Faculty of Medical Sciences, University of Kragujevac, Kragujevac, Serbia; Clinic for Cardiology, University Clinical Center Kragujevac, Kragujevac, Serbia
| | - Stefan Simovic
- Department of Internal Medicine, Faculty of Medical Sciences, University of Kragujevac, Kragujevac, Serbia; Clinic for Cardiology, University Clinical Center Kragujevac, Kragujevac, Serbia
| | - Olivia Manfrini
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy
| | - Natasa Mickovski-Katalina
- Institute of Public Health of Serbia "Dr Milan Jovanović Batut", Center for Prevention and Control of Diseases, Department for Prevention and Control of Non-communicable Disease, Belgrade, Serbia
| | - Lina Badimon
- Cardiovascular Research Program ICCC, IR-IIB Sant Pau, Hospital de la Santa Creu i Sant Pau, CiberCV-Institute Carlos III, Barcelona, Spain
| | - Edina Cenko
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy
| | - Raffaele Bugiardini
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Bologna, Italy.
| |
Collapse
|
34
|
Alaa AM, Gurdasani D, Harris AL, Rashbass J, van der Schaar M. Machine learning to guide the use of adjuvant therapies for breast cancer. NAT MACH INTELL 2021. [DOI: 10.1038/s42256-021-00353-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
35
|
Abroshan M, Alaa AM, Rayner O, van der Schaar M. Opportunities for machine learning to transform care for people with cystic fibrosis. J Cyst Fibros 2021; 19:6-8. [PMID: 32000972 DOI: 10.1016/j.jcf.2020.01.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Affiliation(s)
- Mahed Abroshan
- The Department of Applied Mathematics and Theoretical Physics and The Department of Public Health, University of Cambridge, United Kingdom
| | - Ahmed M Alaa
- Department of Electrical and Computer Engineering, University of California Los Angeles, Los Angeles, California, USA
| | - Oli Rayner
- Person with CF, Plymouth, United Kingdom
| | - Mihaela van der Schaar
- The Department of Applied Mathematics and Theoretical Physics and The Department of Public Health, University of Cambridge, United Kingdom; Department of Electrical and Computer Engineering, University of California Los Angeles, Los Angeles, California, USA.
| |
Collapse
|
36
|
Thoral PJ, Peppink JM, Driessen RH, Sijbrands EJG, Kompanje EJO, Kaplan L, Bailey H, Kesecioglu J, Cecconi M, Churpek M, Clermont G, van der Schaar M, Ercole A, Girbes ARJ, Elbers PWG. Sharing ICU Patient Data Responsibly Under the Society of Critical Care Medicine/European Society of Intensive Care Medicine Joint Data Science Collaboration: The Amsterdam University Medical Centers Database (AmsterdamUMCdb) Example. Crit Care Med 2021; 49:e563-e577. [PMID: 33625129 PMCID: PMC8132908 DOI: 10.1097/ccm.0000000000004916] [Citation(s) in RCA: 74] [Impact Index Per Article: 24.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
OBJECTIVES Critical care medicine is a natural environment for machine learning approaches to improve outcomes for critically ill patients as admissions to ICUs generate vast amounts of data. However, technical, legal, ethical, and privacy concerns have so far limited the critical care medicine community from making these data readily available. The Society of Critical Care Medicine and the European Society of Intensive Care Medicine have identified ICU patient data sharing as one of the priorities under their Joint Data Science Collaboration. To encourage ICUs worldwide to share their patient data responsibly, we now describe the development and release of Amsterdam University Medical Centers Database (AmsterdamUMCdb), the first freely available critical care database in full compliance with privacy laws from both the United States and Europe, as an example of the feasibility of sharing complex critical care data. SETTING University hospital ICU. SUBJECTS Data from ICU patients admitted between 2003 and 2016. INTERVENTIONS We used a risk-based deidentification strategy to maintain data utility while preserving privacy. In addition, we implemented contractual and governance processes, and a communication strategy. Patient organizations, supporting hospitals, and experts on ethics and privacy audited these processes and the database. MEASUREMENTS AND MAIN RESULTS AmsterdamUMCdb contains approximately 1 billion clinical data points from 23,106 admissions of 20,109 patients. The privacy audit concluded that reidentification is not reasonably likely, and AmsterdamUMCdb can therefore be considered as anonymous information, both in the context of the U.S. Health Insurance Portability and Accountability Act and the European General Data Protection Regulation. The ethics audit concluded that responsible data sharing imposes minimal burden, whereas the potential benefit is tremendous. CONCLUSIONS Technical, legal, ethical, and privacy challenges related to responsible data sharing can be addressed using a multidisciplinary approach. A risk-based deidentification strategy, that complies with both U.S. and European privacy regulations, should be the preferred approach to releasing ICU patient data. This supports the shared Society of Critical Care Medicine and European Society of Intensive Care Medicine vision to improve critical care outcomes through scientific inquiry of vast and combined ICU datasets.
Collapse
Affiliation(s)
- Patrick J Thoral
- Department of Intensive Care Medicine, Amsterdam Medical Data Science (AMDS), Amsterdam Cardiovascular Sciences (ACS), Amsterdam Infection and Immunity Institute (AI&II), Amsterdam UMC, Vrije Universiteit, Universiteit van Amsterdam, Amsterdam, The Netherlands
| | - Jan M Peppink
- Department of Intensive Care Medicine, Amsterdam Medical Data Science (AMDS), Amsterdam Cardiovascular Sciences (ACS), Amsterdam Infection and Immunity Institute (AI&II), Amsterdam UMC, Vrije Universiteit, Universiteit van Amsterdam, Amsterdam, The Netherlands
| | - Ronald H Driessen
- Department of Intensive Care Medicine, Amsterdam Medical Data Science (AMDS), Amsterdam Cardiovascular Sciences (ACS), Amsterdam Infection and Immunity Institute (AI&II), Amsterdam UMC, Vrije Universiteit, Universiteit van Amsterdam, Amsterdam, The Netherlands
| | | | - Erwin J O Kompanje
- Department of Intensive Care Medicine, Erasmus MC, Rotterdam, The Netherlands
| | - Lewis Kaplan
- Division of Trauma, Surgical Critical Care and Emergency Surgery, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA
- Executive Committee, Society of Critical Care Medicine, Mount Prospect, IL
| | - Heatherlee Bailey
- Department of Emergency Medicine, Durham VA Medical Center, Durham, NC
- Executive Committee, Society of Critical Care Medicine, Mount Prospect, IL
| | - Jozef Kesecioglu
- Department of Intensive Care Medicine, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
- Executive Committee, European Society of Intensive Care Medicine, Brussels, Belgium
| | - Maurizio Cecconi
- Executive Committee, European Society of Intensive Care Medicine, Brussels, Belgium
- Department of Anaesthesia and Intensive Care, Humanitas Research Hospital, Humanitas University, Milan, Italy
| | - Matthew Churpek
- Department of Medicine, University of Wisconsin, Madison, WI
| | - Gilles Clermont
- Department of Critical Care Medicine, CRISMA Laboratory, University of Pittsburgh, Pittsburgh, PA
| | - Mihaela van der Schaar
- University of Cambridge, Cambridge, United Kingdom
- Alan Turing Institute, London, United Kingdom
| | - Ari Ercole
- Division of Anaesthesia, University of Cambridge, Cambridge, United Kingdom
- Data Science Section, European Society of Intensive Care Medicine, Brussels, Belgium
| | - Armand R J Girbes
- Department of Intensive Care Medicine, Amsterdam Medical Data Science (AMDS), Amsterdam Cardiovascular Sciences (ACS), Amsterdam Infection and Immunity Institute (AI&II), Amsterdam UMC, Vrije Universiteit, Universiteit van Amsterdam, Amsterdam, The Netherlands
- Executive Committee, European Society of Intensive Care Medicine, Brussels, Belgium
| | - Paul W G Elbers
- Department of Intensive Care Medicine, Amsterdam Medical Data Science (AMDS), Amsterdam Cardiovascular Sciences (ACS), Amsterdam Infection and Immunity Institute (AI&II), Amsterdam UMC, Vrije Universiteit, Universiteit van Amsterdam, Amsterdam, The Netherlands
- Data Science Section, European Society of Intensive Care Medicine, Brussels, Belgium
| |
Collapse
|
37
|
Shah AA, Devana SK, Lee C, Kianian R, van der Schaar M, SooHoo NF. Development of a Novel, Potentially Universal Machine Learning Algorithm for Prediction of Complications After Total Hip Arthroplasty. J Arthroplasty 2021; 36:1655-1662.e1. [PMID: 33478891 PMCID: PMC10371358 DOI: 10.1016/j.arth.2020.12.040] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Revised: 11/19/2020] [Accepted: 12/22/2020] [Indexed: 02/02/2023] Open
Abstract
BACKGROUND As the prevalence of hip osteoarthritis increases, the number of total hip arthroplasty (THA) procedures performed is also projected to increase. Accurately risk-stratifying patients who undergo THA would be of great utility, given the significant cost and morbidity associated with developing perioperative complications. We aim to develop a novel machine learning (ML)-based ensemble algorithm for the prediction of major complications after THA, as well as compare its performance against standard benchmark ML methods. METHODS This is a retrospective cohort study of 89,986 adults who underwent primary THA at any California-licensed hospital between 2015 and 2017. The primary outcome was major complications (eg infection, venous thromboembolism, cardiac complication, pulmonary complication). We developed a model predicting complication risk using AutoPrognosis, an automated ML framework that configures the optimally performing ensemble of ML-based prognostic models. We compared our model with logistic regression and standard benchmark ML models, assessing discrimination and calibration. RESULTS There were 545 patients who had major complications (0.61%). Our novel algorithm was well-calibrated and improved risk prediction compared to logistic regression, as well as outperformed the other four standard benchmark ML algorithms. The variables most important for AutoPrognosis (eg malnutrition, dementia, cancer) differ from those that are most important for logistic regression (eg chronic atherosclerosis, renal failure, chronic obstructive pulmonary disease). CONCLUSION We report a novel ensemble ML algorithm for the prediction of major complications after THA. It demonstrates superior risk prediction compared to logistic regression and other standard ML benchmark algorithms. By providing accurate prognostic information, this algorithm may facilitate more informed preoperative shared decision-making.
Collapse
Affiliation(s)
- Akash A Shah
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, CA
| | - Sai K Devana
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, CA
| | - Changhee Lee
- Department of Electrical and Computer Engineering, University of California, Los Angeles, CA
| | - Reza Kianian
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, CA
| | - Mihaela van der Schaar
- Department of Electrical and Computer Engineering, University of California, Los Angeles, CA; Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, United Kingdom
| | - Nelson F SooHoo
- Department of Orthopaedic Surgery, David Geffen School of Medicine at UCLA, Los Angeles, CA
| |
Collapse
|
38
|
Cenko E, Bergami M, Yoon J, Fabin N, van der Schaar M, Manfrini O, Vasiljevic Z, Zdravkovic M, Vavlukis M, Kedev S, Milicic D, Badimon L, Bugiardini R. STATINS AND SEVERITY OF CLINICAL MANIFESTATIONS AMONG WOMEN AND MEN WITH INCIDENT CORONARY HEART DISEASE. J Am Coll Cardiol 2021. [DOI: 10.1016/s0735-1097(21)01521-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
39
|
Lee C, Light A, Alaa A, Thurtle D, van der Schaar M, Gnanapragasam VJ. Application of a novel machine learning framework for predicting non-metastatic prostate cancer-specific mortality in men using the Surveillance, Epidemiology, and End Results (SEER) database. Lancet Digit Health 2021; 3:e158-e165. [PMID: 33549512 DOI: 10.1016/s2589-7500(20)30314-9] [Citation(s) in RCA: 40] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/13/2020] [Revised: 12/03/2020] [Accepted: 12/10/2020] [Indexed: 12/23/2022]
Abstract
BACKGROUND Accurate prognostication is crucial in treatment decisions made for men diagnosed with non-metastatic prostate cancer. Current models rely on prespecified variables, which limits their performance. We aimed to investigate a novel machine learning approach to develop an improved prognostic model for predicting 10-year prostate cancer-specific mortality and compare its performance with existing validated models. METHODS We derived and tested a machine learning-based model using Survival Quilts, an algorithm that automatically selects and tunes ensembles of survival models using clinicopathological variables. Our study involved a US population-based cohort of 171 942 men diagnosed with non-metastatic prostate cancer between Jan 1, 2000, and Dec 31, 2016, from the prospectively maintained Surveillance, Epidemiology, and End Results (SEER) Program. The primary outcome was prediction of 10-year prostate cancer-specific mortality. Model discrimination was assessed using the concordance index (c-index), and calibration was assessed using Brier scores. The Survival Quilts model was compared with nine other prognostic models in clinical use, and decision curve analysis was done. FINDINGS 647 151 men with prostate cancer were enrolled into the SEER database, of whom 171 942 were included in this study. Discrimination improved with greater granularity, and multivariable models outperformed tier-based models. The Survival Quilts model showed good discrimination (c-index 0·829, 95% CI 0·820-0·838) for 10-year prostate cancer-specific mortality, which was similar to the top-ranked multivariable models: PREDICT Prostate (0·820, 0·811-0·829) and Memorial Sloan Kettering Cancer Center (MSKCC) nomogram (0·787, 0·776-0·798). All three multivariable models showed good calibration with low Brier scores (Survival Quilts 0·036, 95% CI 0·035-0·037; PREDICT Prostate 0·036, 0·035-0·037; MSKCC 0·037, 0·035-0·039). Of the tier-based systems, the Cancer of the Prostate Risk Assessment model (c-index 0·782, 95% CI 0·771-0·793) and Cambridge Prognostic Groups model (0·779, 0·767-0·791) showed higher discrimination for predicting 10-year prostate cancer-specific mortality. c-indices for models from the National Comprehensive Cancer Care Network, Genitourinary Radiation Oncologists of Canada, American Urological Association, European Association of Urology, and National Institute for Health and Care Excellence ranged from 0·711 (0·701-0·721) to 0·761 (0·750-0·772). Discrimination for the Survival Quilts model was maintained when stratified by age and ethnicity. Decision curve analysis showed an incremental net benefit from the Survival Quilts model compared with the MSKCC and PREDICT Prostate models currently used in practice. INTERPRETATION A novel machine learning-based approach produced a prognostic model, Survival Quilts, with discrimination for 10-year prostate cancer-specific mortality similar to the top-ranked prognostic models, using only standard clinicopathological variables. Future integration of additional data will likely improve model performance and accuracy for personalised prognostics. FUNDING None.
Collapse
Affiliation(s)
- Changhee Lee
- Department of Electrical and Computer Engineering, University of California, Los Angeles, CA, USA
| | - Alexander Light
- Department of Surgery, Division of Urology, University of Cambridge, Cambridge, UK; Department of Urology, Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK
| | - Ahmed Alaa
- Department of Electrical and Computer Engineering, University of California, Los Angeles, CA, USA
| | - David Thurtle
- Department of Surgery, Division of Urology, University of Cambridge, Cambridge, UK; Department of Urology, Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK
| | - Mihaela van der Schaar
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK; Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK
| | - Vincent J Gnanapragasam
- Department of Surgery, Division of Urology, University of Cambridge, Cambridge, UK; Cambridge Urology Translational Research and Clinical Trials Office, Cambridge Biomedical Campus, University of Cambridge, Cambridge, UK; Department of Urology, Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK.
| |
Collapse
|
40
|
Qian Z, Alaa AM, van der Schaar M. CPAS: the UK's national machine learning-based hospital capacity planning system for COVID-19. Mach Learn 2020; 110:15-35. [PMID: 33250568 PMCID: PMC7685302 DOI: 10.1007/s10994-020-05921-4] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Revised: 09/05/2020] [Accepted: 09/26/2020] [Indexed: 12/20/2022]
Abstract
The coronavirus disease 2019 (COVID-19) global pandemic poses the threat of overwhelming healthcare systems with unprecedented demands for intensive care resources. Managing these demands cannot be effectively conducted without a nationwide collective effort that relies on data to forecast hospital demands on the national, regional, hospital and individual levels. To this end, we developed the COVID-19 Capacity Planning and Analysis System (CPAS)—a machine learning-based system for hospital resource planning that we have successfully deployed at individual hospitals and across regions in the UK in coordination with NHS Digital. In this paper, we discuss the main challenges of deploying a machine learning-based decision support system at national scale, and explain how CPAS addresses these challenges by (1) defining the appropriate learning problem, (2) combining bottom-up and top-down analytical approaches, (3) using state-of-the-art machine learning algorithms, (4) integrating heterogeneous data sources, and (5) presenting the result with an interactive and transparent interface. CPAS is one of the first machine learning-based systems to be deployed in hospitals on a national scale to address the COVID-19 pandemic—we conclude the paper with a summary of the lessons learned from this experience.
Collapse
Affiliation(s)
| | | | - Mihaela van der Schaar
- University of Cambridge, Cambridge, UK
- University of California, Los Angeles, USA
- The Alan Turing Institute, London, UK
| |
Collapse
|
41
|
Alaa A, Qian Z, Rashbass J, Benger J, van der Schaar M. Retrospective cohort study of admission timing and mortality following COVID-19 infection in England. BMJ Open 2020; 10:e042712. [PMID: 33234660 PMCID: PMC7684820 DOI: 10.1136/bmjopen-2020-042712] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Revised: 10/27/2020] [Accepted: 11/10/2020] [Indexed: 12/15/2022] Open
Abstract
OBJECTIVES We investigated whether the timing of hospital admission is associated with the risk of mortality for patients with COVID-19 in England, and the factors associated with a longer interval between symptom onset and hospital admission. DESIGN Retrospective observational cohort study of data collected by the COVID-19 Hospitalisation in England Surveillance System (CHESS). Data were analysed using multivariate regression analysis. SETTING Acute hospital trusts in England that submit data to CHESS routinely. PARTICIPANTS Of 14 150 patients included in CHESS until 13 May 2020, 401 lacked a confirmed diagnosis of COVID-19 and 7666 lacked a recorded date of symptom onset. This left 6083 individuals, of whom 15 were excluded because the time between symptom onset and hospital admission exceeded 3 months. The study cohort therefore comprised 6068 unique individuals. MAIN OUTCOME MEASURES All-cause mortality during the study period. RESULTS Timing of hospital admission was an independent predictor of mortality following adjustment for age, sex, comorbidities, ethnicity and obesity. Each additional day between symptom onset and hospital admission was associated with a 1% increase in mortality risk (HR 1.01; p<0.005). Healthcare workers were most likely to have an increased interval between symptom onset and hospital admission, as were people from Black, Asian and minority ethnic (BAME) backgrounds, and patients with obesity. CONCLUSION The timing of hospital admission is associated with mortality in patients with COVID-19. Healthcare workers and individuals from a BAME background are at greater risk of later admission, which may contribute to reports of poorer outcomes in these groups. Strategies to identify and admit patients with high-risk and those showing signs of deterioration in a timely way may reduce the consequent mortality from COVID-19, and should be explored.
Collapse
Affiliation(s)
- Ahmed Alaa
- University of California, Los Angeles, California, USA
| | - Zhaozhi Qian
- Centre for Mathematical Sciences, Cambridge University, Cambridge, UK
| | | | | | | |
Collapse
|
42
|
Abstract
State-switching models such as hidden Markov models or Markov-switching regression models are routinely applied to analyse sequences of observations that are driven by underlying non-observable states. Coupled state-switching models extend these approaches to address the case of multiple observation sequences whose underlying state variables interact. In this article, we provide an overview of the modelling techniques related to coupling in state-switching models, thereby forming a rich and flexible statistical framework particularly useful for modelling correlated time series. Simulation experiments demonstrate the relevance of being able to account for an asynchronous evolution as well as interactions between the underlying latent processes. The models are further illustrated using two case studies related to (a) interactions between a dolphin mother and her calf as inferred from movement data and (b) electronic health record data collected on 696 patients within an intensive care unit.
Collapse
Affiliation(s)
| | | | - Mihaela van der Schaar
- University of Cambridge, Cambridge, UK
- The Alan Turing Institute, London, UK
- University of California, Los Angeles, California, USA
| | - Ruth King
- The Alan Turing Institute, London, UK
- University of Edinburgh, Edinburgh, UK
| | | |
Collapse
|
43
|
Bugiardini R, Pavasović S, Yoon J, van der Schaar M, Kedev S, Vavlukis M, Vasiljevic Z, Bergami M, Miličić D, Manfrini O, Cenko E, Badimon L. Aspirin for primary prevention of ST segment elevation myocardial infarction in persons with diabetes and multiple risk factors. EClinicalMedicine 2020; 27:100548. [PMID: 33150322 PMCID: PMC7599315 DOI: 10.1016/j.eclinm.2020.100548] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/20/2020] [Revised: 08/12/2020] [Accepted: 08/28/2020] [Indexed: 01/31/2023] Open
Abstract
BACKGROUND Controversy exists as to whether low-dose aspirin use may give benefit in primary prevention of cardiovascular (CV) events. We hypothesized that the benefits of aspirin are underevaluated. METHODS We investigated 12,123 Caucasian patients presenting to hospital with acute coronary syndromes as first manifestation of CV disease from 2010 to 2019 in the ISACS-TC multicenter registry (ClinicalTrials.gov, NCT01218776). Individual risk of ST segment elevation myocardial infarction (STEMI) and its association with 30-day mortality was quantified using inverse probability of treatment weighting models matching for concomitant medications. Estimates were compared by test of interaction on the log scale. FINDINGS The risk of STEMI was lower in the aspirin users (absolute reduction: 6·8%; OR: 0·73; 95%CI: 0·65-0·82) regardless of sex (p for interaction=0·1962) or age (p for interaction=0·1209). Benefits of aspirin were seen in patients with hypertension, hypercholesterolemia, and in smokers. In contrast, aspirin failed to demonstrate a significant risk reduction in STEMI among diabetic patients (OR:1·10;95%CI:0·89-1·35) with a significant interaction (p: <0·0001) when compared with controls (OR:0·64,95%CI:0·56-0·73). Stratification of diabetes in risk categories revealed benefits (p interaction=0·0864) only in patients with concomitant hypertension and hypercholesterolemia (OR:0·87, 95% CI:0·65-1·15), but not in smokers. STEMI was strongly related to 30-day mortality (OR:1·93; 95%CI:1·59-2·35). INTERPRETATION Low-dose aspirin reduces the risk of STEMI as initial manifestation of CV disease with potential benefit in mortality. Patients with diabetes derive substantial benefit from aspirin only in the presence of multiple risk factors. In the era of precision medicine, a more tailored strategy is required. FUNDING None.
Collapse
Affiliation(s)
- Raffaele Bugiardini
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Policlinico Sant'Orsola Malpighi, Padiglione 11, Via Massarenti 9, 40138 Bologna, Italy
- Corresponding author.
| | - Saša Pavasović
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Policlinico Sant'Orsola Malpighi, Padiglione 11, Via Massarenti 9, 40138 Bologna, Italy
- Department for Cardiovascular Diseases, University Hospital Centre Zagreb, University of Zagreb, Zagreb, Croatia
| | | | - Mihaela van der Schaar
- Cambridge Centre for Artificial Intelligence in Medicine, Department of Applied Mathematics and Theoretical Physics and Department of Population Health, University of Cambridge, Cambridge, United Kingdom
| | - Sasko Kedev
- University Clinic of Cardiology, Medical Faculty, University "Ss. Cyril and Methodius", Skopje, Macedonia
| | - Marija Vavlukis
- University Clinic of Cardiology, Medical Faculty, University "Ss. Cyril and Methodius", Skopje, Macedonia
| | | | - Maria Bergami
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Policlinico Sant'Orsola Malpighi, Padiglione 11, Via Massarenti 9, 40138 Bologna, Italy
| | - Davor Miličić
- Department for Cardiovascular Diseases, University Hospital Centre Zagreb, University of Zagreb, Zagreb, Croatia
| | - Olivia Manfrini
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Policlinico Sant'Orsola Malpighi, Padiglione 11, Via Massarenti 9, 40138 Bologna, Italy
| | - Edina Cenko
- Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Policlinico Sant'Orsola Malpighi, Padiglione 11, Via Massarenti 9, 40138 Bologna, Italy
| | - Lina Badimon
- Cardiovascular Research Program ICCC, IR-IIB Sant Pau, Hospital de la Santa Creu i Sant Pau, CiberCV-Institute Carlos III, Barcelona, Spain
| |
Collapse
|
44
|
Manfrini O, Yoon J, van der Schaar M, Kedev S, Vavlukis M, Stankovic G, Scarpone M, Miličić D, Vasiljevic Z, Badimon L, Cenko E, Bugiardini R. Sex Differences in Modifiable Risk Factors and Severity of Coronary Artery Disease. J Am Heart Assoc 2020; 9:e017235. [PMID: 32981423 PMCID: PMC7792418 DOI: 10.1161/jaha.120.017235] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Background It is still unknown whether traditional risk factors may have a sex‐specific impact on coronary artery disease (CAD) burden. Methods and Results We identified 14 793 patients who underwent coronary angiography for acute coronary syndromes in the ISACS‐TC (International Survey of Acute Coronary Syndromes in Transitional Countries; ClinicalTrials.gov, NCT01218776) registry from 2010 to 2019. The main outcome measure was the association between traditional risk factors and severity of CAD and its relationship with 30‐day mortality. Relative risk (RR) ratios and 95% CIs were calculated from the ratio of the absolute risks of women versus men using inverse probability of weighting. Estimates were compared by test of interaction on the log scale. Severity of CAD was categorized as obstructive (≥50% stenosis) versus nonobstructive CAD. The RR ratio for obstructive CAD in women versus men among people without diabetes mellitus was 0.49 (95% CI, 0.41–0.60) and among those with diabetes mellitus was 0.89 (95% CI, 0.62–1.29), with an interaction by diabetes mellitus status of P =0.002. Exposure to smoking shifted the RR ratios from 0.50 (95% CI, 0.41–0.61) in nonsmokers to 0.75 (95% CI, 0.54–1.03) in current smokers, with an interaction by smoking status of P=0.018. There were no significant sex‐related interactions with hypercholesterolemia and hypertension. Women with obstructive CAD had higher 30‐day mortality rates than men (RR, 1.75; 95% CI, 1.48–2.07). No sex differences in mortality were observed in patients with nonobstructive CAD. Conclusions Obstructive CAD in women signifies a higher risk for mortality compared with men. Current smoking and diabetes mellitus disproportionally increase the risk of obstructive CAD in women. Achieving the goal of improving cardiovascular health in women still requires intensive efforts toward further implementation of lifestyle and treatment interventions. Registration URL: https://www.clinicaltrials.gov; Unique identifier: NCT01218776.
Collapse
Affiliation(s)
- Olivia Manfrini
- Department of Experimental, Diagnostic and Specialty Medicine University of Bologna Bologna Italy
| | - Jinsung Yoon
- Department of Electrical and Computer Engineering University of California Los Angeles CA
| | - Mihaela van der Schaar
- Cambridge Centre for Artificial Intelligence in Medicine Department of Applied Mathematics and Theoretical Physics and Department of Population Health University of Cambridge Cambridge United Kingdom
| | - Sasko Kedev
- University Clinic of Cardiology Medical Faculty University "Ss. Cyril and Methodius" Skopje Macedonia
| | - Marija Vavlukis
- University Clinic of Cardiology Medical Faculty University "Ss. Cyril and Methodius" Skopje Macedonia
| | - Goran Stankovic
- Clinic of Cardiology University Clinical Centre of Serbia Belgrade Serbia.,Medical Faculty University of Belgrade Serbia
| | - Marialuisa Scarpone
- Department of Experimental, Diagnostic and Specialty Medicine University of Bologna Bologna Italy
| | - Davor Miličić
- Department for Cardiovascular Diseases University Hospital Center Zagreb University of Zagreb Croatia
| | | | - Lina Badimon
- Cardiovascular Research Program ICCC, IR-IIBSant Pau, Hospital de la Santa Creu i Sant Pau, CiberCV-Institute Carlos III Barcelona Spain
| | - Edina Cenko
- Department of Experimental, Diagnostic and Specialty Medicine University of Bologna Bologna Italy
| | - Raffaele Bugiardini
- Department of Experimental, Diagnostic and Specialty Medicine University of Bologna Bologna Italy
| |
Collapse
|
45
|
Zame WR, Bica I, Shen C, Curth A, Lee HS, Bailey S, Weatherall J, Wright D, Bretz F, van der Schaar M. Machine learning for clinical trials in the era of COVID-19. Stat Biopharm Res 2020; 12:506-517. [PMID: 34191983 PMCID: PMC8011491 DOI: 10.1080/19466315.2020.1797867] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2020] [Revised: 06/18/2020] [Accepted: 07/03/2020] [Indexed: 12/18/2022]
Abstract
The world is in the midst of a pandemic. We still know little about the disease COVID-19 or about the virus (SARS-CoV-2) that causes it. We do not have a vaccine or a treatment (aside from managing symptoms). We do not know if recovery from COVID-19 produces immunity, and if so for how long, hence we do not know if "herd immunity" will eventually reduce the risk or if a successful vaccine can be developed - and this knowledge may be a long time coming. In the meantime, the COVID-19 pandemic is presenting enormous challenges to medical research, and to clinical trials in particular. This paper identifies some of those challenges and suggests ways in which machine learning can help in response to those challenges. We identify three areas of challenge: ongoing clinical trials for non-COVID-19 drugs; clinical trials for repurposing drugs to treat COVID-19, and clinical trials for new drugs to treat COVID-19. Within each of these areas, we identify aspects for which we believe machine learning can provide invaluable assistance.
Collapse
Affiliation(s)
- William R. Zame
- Department of Economics and Mathematics, UCLA, Los Angeles, CA, USA
| | - Ioana Bica
- University of Oxford, Oxford, UK
- The Alan Turing Institute, London, UK
| | - Cong Shen
- Department of Electrical and Computer Engineering, University of Virginia, Charlottesville, VA, USA
| | | | - Hyun-Suk Lee
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, UK
| | | | | | | | - Frank Bretz
- Novartis Pharma AG, Basel, Switzerland
- Section for Medical Statistics, Medical University of Vienna, Vienna, Austria
| | - Mihaela van der Schaar
- The Alan Turing Institute, London, UK
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, UK
- Department of Electrical and Computer Engineering, UCLA, Los Angeles, CA, USA
| |
Collapse
|
46
|
Baqui P, Bica I, Marra V, Ercole A, van der Schaar M. Ethnic and regional variations in hospital mortality from COVID-19 in Brazil: a cross-sectional observational study. Lancet Glob Health 2020; 8:e1018-e1026. [PMID: 32622400 PMCID: PMC7332269 DOI: 10.1016/s2214-109x(20)30285-0] [Citation(s) in RCA: 319] [Impact Index Per Article: 79.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2020] [Revised: 06/15/2020] [Accepted: 06/16/2020] [Indexed: 12/13/2022]
Abstract
BACKGROUND Brazil ranks second worldwide in total number of COVID-19 cases and deaths. Understanding the possible socioeconomic and ethnic health inequities is particularly important given the diverse population and fragile political and economic situation. We aimed to characterise the COVID-19 pandemic in Brazil and assess variations in mortality according to region, ethnicity, comorbidities, and symptoms. METHODS We conducted a cross-sectional observational study of COVID-19 hospital mortality using data from the SIVEP-Gripe (Sistema de Informação de Vigilância Epidemiológica da Gripe) dataset to characterise the COVID-19 pandemic in Brazil. In the study, we included hospitalised patients who had a positive RT-PCR test for severe acute respiratory syndrome coronavirus 2 and who had ethnicity information in the dataset. Ethnicity of participants was classified according to the five categories used by the Brazilian Institute of Geography and Statistics: Branco (White), Preto (Black), Amarelo (East Asian), Indígeno (Indigenous), or Pardo (mixed ethnicity). We assessed regional variations in patients with COVID-19 admitted to hospital by state and by two socioeconomically grouped regions (north and central-south). We used mixed-effects Cox regression survival analysis to estimate the effects of ethnicity and comorbidity at an individual level in the context of regional variation. FINDINGS Of 99 557 patients in the SIVEP-Gripe dataset, we included 11 321 patients in our study. 9278 (82·0%) of these patients were from the central-south region, and 2043 (18·0%) were from the north region. Compared with White Brazilians, Pardo and Black Brazilians with COVID-19 who were admitted to hospital had significantly higher risk of mortality (hazard ratio [HR] 1·45, 95% CI 1·33-1·58 for Pardo Brazilians; 1·32, 1·15-1·52 for Black Brazilians). Pardo ethnicity was the second most important risk factor (after age) for death. Comorbidities were more common in Brazilians admitted to hospital in the north region than in the central-south, with similar proportions between the various ethnic groups. States in the north had higher HRs compared with those of the central-south, except for Rio de Janeiro, which had a much higher HR than that of the other central-south states. INTERPRETATION We found evidence of two distinct but associated effects: increased mortality in the north region (regional effect) and in the Pardo and Black populations (ethnicity effect). We speculate that the regional effect is driven by increasing comorbidity burden in regions with lower levels of socioeconomic development. The ethnicity effect might be related to differences in susceptibility to COVID-19 and access to health care (including intensive care) across ethnicities. Our analysis supports an urgent effort on the part of Brazilian authorities to consider how the national response to COVID-19 can better protect Pardo and Black Brazilians, as well as the population of poorer states, from their higher risk of dying of COVID-19. FUNDING None.
Collapse
Affiliation(s)
- Pedro Baqui
- Núcleo de Astrofísica e Cosmologia, Universidade Federal do Espírito Santo, Vitória, ES, Brazil
| | - Ioana Bica
- Department of Engineering Science, University of Oxford, Oxford, UK; The Alan Turing Institute, London, UK
| | - Valerio Marra
- Núcleo de Astrofísica e Cosmologia, Universidade Federal do Espírito Santo, Vitória, ES, Brazil; Departamento de Física, Universidade Federal do Espírito Santo, Vitória, ES, Brazil.
| | - Ari Ercole
- Department of Medicine, University of Cambridge, Cambridge, UK; Cambridge Centre for Artificial Intelligence in Medicine, Cambridge, UK
| | - Mihaela van der Schaar
- The Alan Turing Institute, London, UK; Cambridge Centre for Artificial Intelligence in Medicine, Cambridge, UK; Department of Applied Mathematics and Theoretical Physics and Department of Population Health, University of Cambridge, Cambridge, UK; Department of Electrical and Computer Engineering, University of California Los Angeles, Los Angeles, CA, USA
| |
Collapse
|
47
|
Yoon J, Drumright LN, van der Schaar M. Anonymization Through Data Synthesis Using Generative Adversarial Networks (ADS-GAN). IEEE J Biomed Health Inform 2020; 24:2378-2388. [PMID: 32167919 DOI: 10.1109/jbhi.2020.2980262] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The medical and machine learning communities are relying on the promise of artificial intelligence (AI) to transform medicine through enabling more accurate decisions and personalized treatment. However, progress is slow. Legal and ethical issues around unconsented patient data and privacy is one of the limiting factors in data sharing, resulting in a significant barrier in accessing routinely collected electronic health records (EHR) by the machine learning community. We propose a novel framework for generating synthetic data that closely approximates the joint distribution of variables in an original EHR dataset, providing a readily accessible, legally and ethically appropriate solution to support more open data sharing, enabling the development of AI solutions. In order to address issues around lack of clarity in defining sufficient anonymization, we created a quantifiable, mathematical definition for "identifiability". We used a conditional generative adversarial networks (GAN) framework to generate synthetic data while minimize patient identifiability that is defined based on the probability of re-identification given the combination of all data on any individual patient. We compared models fitted to our synthetically generated data to those fitted to the real data across four independent datasets to evaluate similarity in model performance, while assessing the extent to which original observations can be identified from the synthetic data. Our model, ADS-GAN, consistently outperformed state-of-the-art methods, and demonstrated reliability in the joint distributions. We propose that this method could be used to develop datasets that can be made publicly available while considerably lowering the risk of breaching patient confidentiality.
Collapse
|
48
|
Bugiardini R, Yoon J, Kedev S, Stankovic G, Vasiljevic Z, Miličić D, Manfrini O, van der Schaar M, Gale CP, Badimon L, Cenko E. Prior Beta-Blocker Therapy for Hypertension and Sex-Based Differences in Heart Failure Among Patients With Incident Coronary Heart Disease. Hypertension 2020; 76:819-826. [PMID: 32654558 DOI: 10.1161/hypertensionaha.120.15323] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
The usefulness of β-blockers has been questioned for patients who have hypertension without a prior manifestation of coronary heart disease or heart failure. In addition, sex-based differences in the efficacy of β-blockers for prevention of heart failure during acute myocardial ischemia have never been evaluated. We explored whether the effect of β-blocker therapy varied according to the sex among patients with hypertension who have no prior history of cardiovascular disease. Data were drawn from the ISACS (International Survey of Acute Coronary Syndromes)-Archives. The study population consisted of 13 764 patients presenting with acute coronary syndromes. There were 2590 patients in whom hypertension was treated previously with β-blocker (954 women and 1636 men). Primary outcome measure was the incidence of heart failure according to Killip class classification. Subsidiary analyses were conducted to estimate the association between heart failure and all-cause mortality at 30 days. Outcome rates were assessed using the inverse probability of treatment weighting and logistic regression models. Estimates were compared by test of interaction on the log scale. Among patients taking β-blockers before admission, there was an absolute difference of 4.6% between women and men in the rate of heart failure (Killip ≥2) at hospital presentation (21.3% versus 16.7%; relative risk ratio, 1.35 [95% CI, 1.10-1.65]). On the opposite, the rate of heart failure was approximately similar among women and men who did not receive β-blockers (17.2% versus 16.1%; relative risk ratio, 1.09 [95% CI, 0.97-1.21]). The test of interaction identified a significant (P=0.034) association between sex and β-blocker therapy. Heart failure was predictive of mortality at 30-day either in women (odds ratio, 7.54 [95% CI, 5.78-9.83]) or men (odds ratio, 9.62 [95% CI, 7.67-12.07]). In conclusion, β-blockers use may be an acute precipitant of heart failure in new-onset coronary heart disease among women, but not men. Heart failure increases the risk of death. Registration URL: https://www.clinicaltrials.gov. Unique identifier: NCT04008173.
Collapse
Affiliation(s)
- Raffaele Bugiardini
- From the Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Italy (R.B., O.M., E.C.)
| | - Jinsung Yoon
- Electrical Engineering Department, University of California, UCLA, Los Angeles (J.Y.)
| | - Sasko Kedev
- University Clinic of Cardiology, Medical Faculty, University "Ss. Cyril and Methodius," Skopje, Macedonia (S.K.)
| | - Goran Stankovic
- Department of Cardiology, Clinical Center of Serbia, and Faculty of Medicine, University of Belgrade (G.S.), University of Belgrade, Serbia
| | | | - Davor Miličić
- Department for Cardiovascular Diseases, University Hospital Center Zagreb, University of Zagreb, Croatia (D.M.)
| | - Olivia Manfrini
- From the Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Italy (R.B., O.M., E.C.)
| | | | - Chris P Gale
- Leeds Institute of Cardiovascular and Metabolic Medicine, University of Leeds, United Kingdom (C.P.G.)
| | - Lina Badimon
- Cardiovascular Research Institute (ICCC), CiberCV-Institute Carlos III, IIB-Sant Pau, Hospital de la Santa Creu i Sant Pau, Autonomous University of Barcelona, Spain (L.B.)
| | - Edina Cenko
- From the Department of Experimental, Diagnostic and Specialty Medicine, University of Bologna, Italy (R.B., O.M., E.C.)
| |
Collapse
|
49
|
Rahbar H, Hippe DS, Alaa A, Cheeney SH, van der Schaar M, Partridge SC, Lee CI. The Value of Patient and Tumor Factors in Predicting Preoperative Breast MRI Outcomes. Radiol Imaging Cancer 2020; 2:e190099. [PMID: 32803166 DOI: 10.1148/rycan.2020190099] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Revised: 02/12/2020] [Accepted: 03/16/2020] [Indexed: 12/17/2022]
Abstract
Purpose To identify patient and tumor features that predict true-positive, false-positive, and negative breast preoperative MRI outcomes. Materials and Methods Using a breast MRI database from a large regional cancer center, the authors retrospectively identified all women with unilateral breast cancer who underwent preoperative MRI from January 2005 to February 2015. A total of 1396 women with complete data were included. Patient features (ie, age, breast density) and index tumor features (ie, type, grade, hormone receptor, human epidermal growth factor receptor type 2/neu, Ki-67) were extracted and compared with preoperative MRI outcomes (ie, true positive, false positive, negative) using univariate (ie, Fisher exact) and multivariate machine learning approaches (ie, least absolute shrinkage and selection operator, AutoPrognosis). Overall prediction performance was summarized using the area under the receiver operating characteristic curve (AUC), calculated using internal validation techniques (bootstrap and cross-validation) to account for model training. Results At the examination level, 181 additional cancers were identified among 1396 total preoperative MRI examinations (median patient age, 56 years; range, 25-94 years), resulting in a positive predictive value for biopsy of 43% (181 true-positive findings of 419 core-needle biopsies). In univariate analysis, no patient or tumor feature was associated with a true-positive outcome (P > .05), although greater mammographic density (P = .022) and younger age (< 50 years, P = .025) were associated with false-positive examinations. Machine learning approaches provided weak performance for predicting true-positive, false-positive, and negative examinations (AUC range, 0.50-0.57). Conclusion Commonly used patient and tumor factors driving expert opinion for the use of preoperative MRI provide limited predictive value for determining preoperative MRI outcomes in women. Supplemental material is available for this article. © RSNA, 2020See also the commentary by Grimm in this issue.
Collapse
Affiliation(s)
- Habib Rahbar
- Department of Radiology, University of Washington School of Medicine, Seattle Cancer Care Alliance, 1144 Eastlake Ave E, Room LG2-211, Seattle, WA 98109 (H.R., D.S.H., S.H.C., S.C.P., C.I.L.); Department of Electrical and Computer Engineering, University of California, Los Angeles, Los Angeles, Calif (A.A., M.v.d.S.); and Departments of Applied Mathematics and Theoretical Physics and Public Health, University of Cambridge, Cambridge, England (M.v.d.S.)
| | - Daniel S Hippe
- Department of Radiology, University of Washington School of Medicine, Seattle Cancer Care Alliance, 1144 Eastlake Ave E, Room LG2-211, Seattle, WA 98109 (H.R., D.S.H., S.H.C., S.C.P., C.I.L.); Department of Electrical and Computer Engineering, University of California, Los Angeles, Los Angeles, Calif (A.A., M.v.d.S.); and Departments of Applied Mathematics and Theoretical Physics and Public Health, University of Cambridge, Cambridge, England (M.v.d.S.)
| | - Ahmed Alaa
- Department of Radiology, University of Washington School of Medicine, Seattle Cancer Care Alliance, 1144 Eastlake Ave E, Room LG2-211, Seattle, WA 98109 (H.R., D.S.H., S.H.C., S.C.P., C.I.L.); Department of Electrical and Computer Engineering, University of California, Los Angeles, Los Angeles, Calif (A.A., M.v.d.S.); and Departments of Applied Mathematics and Theoretical Physics and Public Health, University of Cambridge, Cambridge, England (M.v.d.S.)
| | - Safia H Cheeney
- Department of Radiology, University of Washington School of Medicine, Seattle Cancer Care Alliance, 1144 Eastlake Ave E, Room LG2-211, Seattle, WA 98109 (H.R., D.S.H., S.H.C., S.C.P., C.I.L.); Department of Electrical and Computer Engineering, University of California, Los Angeles, Los Angeles, Calif (A.A., M.v.d.S.); and Departments of Applied Mathematics and Theoretical Physics and Public Health, University of Cambridge, Cambridge, England (M.v.d.S.)
| | - Mihaela van der Schaar
- Department of Radiology, University of Washington School of Medicine, Seattle Cancer Care Alliance, 1144 Eastlake Ave E, Room LG2-211, Seattle, WA 98109 (H.R., D.S.H., S.H.C., S.C.P., C.I.L.); Department of Electrical and Computer Engineering, University of California, Los Angeles, Los Angeles, Calif (A.A., M.v.d.S.); and Departments of Applied Mathematics and Theoretical Physics and Public Health, University of Cambridge, Cambridge, England (M.v.d.S.)
| | - Savannah C Partridge
- Department of Radiology, University of Washington School of Medicine, Seattle Cancer Care Alliance, 1144 Eastlake Ave E, Room LG2-211, Seattle, WA 98109 (H.R., D.S.H., S.H.C., S.C.P., C.I.L.); Department of Electrical and Computer Engineering, University of California, Los Angeles, Los Angeles, Calif (A.A., M.v.d.S.); and Departments of Applied Mathematics and Theoretical Physics and Public Health, University of Cambridge, Cambridge, England (M.v.d.S.)
| | - Christoph I Lee
- Department of Radiology, University of Washington School of Medicine, Seattle Cancer Care Alliance, 1144 Eastlake Ave E, Room LG2-211, Seattle, WA 98109 (H.R., D.S.H., S.H.C., S.C.P., C.I.L.); Department of Electrical and Computer Engineering, University of California, Los Angeles, Los Angeles, Calif (A.A., M.v.d.S.); and Departments of Applied Mathematics and Theoretical Physics and Public Health, University of Cambridge, Cambridge, England (M.v.d.S.)
| |
Collapse
|
50
|
Ruan Y, Bellot A, Moysova Z, Tan GD, Lumb A, Davies J, van der Schaar M, Rea R. Predicting the Risk of Inpatient Hypoglycemia With Machine Learning Using Electronic Health Records. Diabetes Care 2020; 43:1504-1511. [PMID: 32350021 DOI: 10.2337/dc19-1743] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/30/2019] [Accepted: 04/04/2020] [Indexed: 02/03/2023]
Abstract
OBJECTIVE We analyzed data from inpatients with diabetes admitted to a large university hospital to predict the risk of hypoglycemia through the use of machine learning algorithms. RESEARCH DESIGN AND METHODS Four years of data were extracted from a hospital electronic health record system. This included laboratory and point-of-care blood glucose (BG) values to identify biochemical and clinically significant hypoglycemic episodes (BG ≤3.9 and ≤2.9 mmol/L, respectively). We used patient demographics, administered medications, vital signs, laboratory results, and procedures performed during the hospital stays to inform the model. Two iterations of the data set included the doses of insulin administered and the past history of inpatient hypoglycemia. Eighteen different prediction models were compared using the area under the receiver operating characteristic curve (AUROC) through a 10-fold cross validation. RESULTS We analyzed data obtained from 17,658 inpatients with diabetes who underwent 32,758 admissions between July 2014 and August 2018. The predictive factors from the logistic regression model included people undergoing procedures, weight, type of diabetes, oxygen saturation level, use of medications (insulin, sulfonylurea, and metformin), and albumin levels. The machine learning model with the best performance was the XGBoost model (AUROC 0.96). This outperformed the logistic regression model, which had an AUROC of 0.75 for the estimation of the risk of clinically significant hypoglycemia. CONCLUSIONS Advanced machine learning models are superior to logistic regression models in predicting the risk of hypoglycemia in inpatients with diabetes. Trials of such models should be conducted in real time to evaluate their utility to reduce inpatient hypoglycemia.
Collapse
Affiliation(s)
- Yue Ruan
- Oxford Centre for Diabetes, Endocrinology and Metabolism, Oxford University Hospitals National Health Service Foundation Trust, Oxford, U.K.,Oxford National Institute for Health Research Biomedical Research Centre, Oxford, U.K.,Oxford Centre for Diabetes, Endocrinology and Metabolism, University of Oxford, Oxford, U.K
| | - Alexis Bellot
- Department of Mathematics, University of Cambridge, Cambridge, U.K.,Alan Turing Institute, London, U.K
| | - Zuzana Moysova
- Big Data Institute, University of Oxford Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, U.K
| | - Garry D Tan
- Oxford Centre for Diabetes, Endocrinology and Metabolism, Oxford University Hospitals National Health Service Foundation Trust, Oxford, U.K.,Oxford National Institute for Health Research Biomedical Research Centre, Oxford, U.K
| | - Alistair Lumb
- Oxford Centre for Diabetes, Endocrinology and Metabolism, Oxford University Hospitals National Health Service Foundation Trust, Oxford, U.K.,Oxford National Institute for Health Research Biomedical Research Centre, Oxford, U.K
| | - Jim Davies
- Big Data Institute, University of Oxford Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, U.K
| | - Mihaela van der Schaar
- Department of Mathematics, University of Cambridge, Cambridge, U.K.,Alan Turing Institute, London, U.K
| | - Rustam Rea
- Oxford Centre for Diabetes, Endocrinology and Metabolism, Oxford University Hospitals National Health Service Foundation Trust, Oxford, U.K. .,Oxford National Institute for Health Research Biomedical Research Centre, Oxford, U.K
| |
Collapse
|