1
|
Petch J, Nelson W, Wu M, Ghassemi M, Benz A, Fatemi M, Di S, Carnicelli A, Granger C, Giugliano R, Hong H, Patel M, Wallentin L, Eikelboom J, Connolly SJ. Optimizing warfarin dosing for patients with atrial fibrillation using machine learning. Sci Rep 2024; 14:4516. [PMID: 38402362 PMCID: PMC10894214 DOI: 10.1038/s41598-024-55110-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Accepted: 02/20/2024] [Indexed: 02/26/2024] Open
Abstract
While novel oral anticoagulants are increasingly used to reduce risk of stroke in patients with atrial fibrillation, vitamin K antagonists such as warfarin continue to be used extensively for stroke prevention across the world. While effective in reducing the risk of strokes, the complex pharmacodynamics of warfarin make it difficult to use clinically, with many patients experiencing under- and/or over- anticoagulation. In this study we employed a novel implementation of deep reinforcement learning to provide clinical decision support to optimize time in therapeutic International Normalized Ratio (INR) range. We used a novel semi-Markov decision process formulation of the Batch-Constrained deep Q-learning algorithm to develop a reinforcement learning model to dynamically recommend optimal warfarin dosing to achieve INR of 2.0-3.0 for patients with atrial fibrillation. The model was developed using data from 22,502 patients in the warfarin treated groups of the pivotal randomized clinical trials of edoxaban (ENGAGE AF-TIMI 48), apixaban (ARISTOTLE) and rivaroxaban (ROCKET AF). The model was externally validated on data from 5730 warfarin-treated patients in a fourth trial of dabigatran (RE-LY) using multilevel regression models to estimate the relationship between center-level algorithm consistent dosing, time in therapeutic INR range (TTR), and a composite clinical outcome of stroke, systemic embolism or major hemorrhage. External validation showed a positive association between center-level algorithm-consistent dosing and TTR (R2 = 0.56). Each 10% increase in algorithm-consistent dosing at the center level independently predicted a 6.78% improvement in TTR (95% CI 6.29, 7.28; p < 0.001) and a 11% decrease in the composite clinical outcome (HR 0.89; 95% CI 0.81, 1.00; p = 0.015). These results were comparable to those of a rules-based clinical algorithm used for benchmarking, for which each 10% increase in algorithm-consistent dosing independently predicted a 6.10% increase in TTR (95% CI 5.67, 6.54, p < 0.001) and a 10% decrease in the composite outcome (HR 0.90; 95% CI 0.83, 0.98, p = 0.018). Our findings suggest that a deep reinforcement learning algorithm can optimize time in therapeutic range for patients taking warfarin. A digital clinical decision support system to promote algorithm-consistent warfarin dosing could optimize time in therapeutic range and improve clinical outcomes in atrial fibrillation globally.
Collapse
Affiliation(s)
- Jeremy Petch
- Centre for Data Science and Digital Health, Hamilton Health Sciences, Hamilton, ON, Canada.
- Population Health Research Institute, Hamilton, ON, Canada.
- Division of Cardiology, Department of Medicine, McMaster University, Hamilton, ON, Canada.
- Institute of Health Policy, Management and Evaluation, University of Toronto, Toronto, ON, Canada.
| | - Walter Nelson
- Centre for Data Science and Digital Health, Hamilton Health Sciences, Hamilton, ON, Canada
- Department of Statistical Sciences, University of Toronto, Toronto, ON, Canada
| | - Mary Wu
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Marzyeh Ghassemi
- Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA, USA
- Institute for Medical and Evaluative Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA
- Vector Institute, Toronto, ON, Canada
| | - Alexander Benz
- Population Health Research Institute, Hamilton, ON, Canada
- Department of Cardiology, University Medical Center, Johannes Gutenberg University Mainz, Mainz, Germany
| | | | - Shuang Di
- Centre for Data Science and Digital Health, Hamilton Health Sciences, Hamilton, ON, Canada
- Dalla Lana School of Public Health, University of Toronto, Toronto, ON, Canada
| | - Anthony Carnicelli
- Division of Cardiology, Department of Medicine, Duke University Medical Center, Durham, NC, USA
- Duke Clinical Research Institute, Duke University, Durham, NC, USA
| | - Christopher Granger
- Division of Cardiology, Department of Medicine, Duke University Medical Center, Durham, NC, USA
- Duke Clinical Research Institute, Duke University, Durham, NC, USA
| | - Robert Giugliano
- Division of Cardiovascular Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Hwanhee Hong
- Duke Clinical Research Institute, Duke University, Durham, NC, USA
- Department of Biostatistics and Bioinformatics, Duke University School of Medicine, Durham, NC, USA
| | - Manesh Patel
- Division of Cardiology, Department of Medicine, Duke University Medical Center, Durham, NC, USA
| | - Lars Wallentin
- Department of Medical Sciences, Cardiology, Uppsala University, Uppsala, Sweden
- Uppsala Clinical Research Center, Uppsala University, Uppsala, Sweden
| | - John Eikelboom
- Population Health Research Institute, Hamilton, ON, Canada
- Division of Hematology and Thromboembolism, Department of Medicine, McMaster University, Hamilton, ON, Canada
| | - Stuart J Connolly
- Population Health Research Institute, Hamilton, ON, Canada
- Division of Cardiology, Department of Medicine, McMaster University, Hamilton, ON, Canada
| |
Collapse
|