1
|
Khajuria R, Sarwar A. Review of reinforcement learning applications in segmentation, chemotherapy, and radiotherapy of cancer. Micron 2024; 178:103583. [PMID: 38185018 DOI: 10.1016/j.micron.2023.103583] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Revised: 10/16/2023] [Accepted: 12/20/2023] [Indexed: 01/09/2024]
Abstract
Owing to early diagnosis and treatment of cancer as a prerequisite in recent times, the role of machine learning has been increased substantially. The mathematically powerful and optimized solutions for the detection and cure of cancer are constantly being explored and novel models based upon standard algorithms are also being developed. Leveraging one such solution is Reinforcement Learning (RL), which is a semi-supervised type of learning. The paper presents a detailed discussion on the various RL techniques, algorithms, and open issues, in addition to the review of literature for diagnosis and treatment of cancer. A smaller number of publications for diagnosis and treatment of cancer have been reported before 2011 but now after the success of Deep Learning (DL) and the advent of Deep Reinforcement Learning (DRL), the publications have grown in number from 2017 onwards. The scope of RL for cancer diagnosis and treatment is also demystified and provides the research community with the insights of how to formulate RL problem as a Cancer diagnostic problem. RL has been found successful for landmark detection in medical images and optimal control of drugs and radiations.
Collapse
|
2
|
Wu D, Goldfeld KS, Petkova E, Park HG. Improving Individualized Treatment Decisions: A Bayesian Multivariate Hierarchical Model for Developing a Treatment Benefit Index using Mixed Types of Outcomes. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2023.11.17.23298711. [PMID: 38014277 PMCID: PMC10680905 DOI: 10.1101/2023.11.17.23298711] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
Background Precision medicine has led to the development of targeted treatment strategies tailored to individual patients based on their characteristics and disease manifestations. Although precision medicine often focuses on a single health outcome for individualized treatment decision rules (ITRs), relying only on a single outcome rather than all available outcomes information leads to suboptimal data usage when developing optimal ITRs. Methods To address this limitation, we propose a Bayesian multivariate hierarchical model that leverages the wealth of correlated health outcomes collected in clinical trials. The approach jointly models mixed types of correlated outcomes, facilitating the "borrowing of information" across the multivariate outcomes, and results in a more accurate estimation of heterogeneous treatment effects compared to using single regression models for each outcome. We develop a treatment benefit index, which quantifies the relative treatment benefit of the experimental treatment over the control treatment, based on the proposed multivariate outcome model. Results We demonstrate the strengths of the proposed approach through extensive simulations and an application to an international Coronavirus Disease 2019 (COVID-19) treatment trial. Simulation results indicate that the proposed method reduces the occurrence of erroneous treatment decisions compared to a single regression model for a single health outcome. Additionally, the sensitivity analysis demonstrates the robustness of the model across various study scenarios. Application of the method to the COVID-19 trial exhibits improvements in estimating the individual-level treatment efficacy (indicated by narrower credible intervals for odds ratios) and optimal ITRs. Conclusion The study jointly models mixed types of outcomes in the context of developing ITRs. By considering multiple health outcomes, the proposed approach can advance the development of more effective and reliable personalized treatment.
Collapse
Affiliation(s)
- Danni Wu
- Department of Population Health, New York University Grossman School of Medicine, 180 Madison Avenue, New York, 10016, New York, USA
| | - Keith S. Goldfeld
- Department of Population Health, New York University Grossman School of Medicine, 180 Madison Avenue, New York, 10016, New York, USA
| | - Eva Petkova
- Department of Population Health, New York University Grossman School of Medicine, 180 Madison Avenue, New York, 10016, New York, USA
| | - Hyung G. Park
- Department of Population Health, New York University Grossman School of Medicine, 180 Madison Avenue, New York, 10016, New York, USA
| |
Collapse
|
3
|
Tran TD, Abad AA, Verbeke G, Molenberghs G, Van Mechelen I. Reflections on the concept of optimality of single decision point treatment regimes. Biom J 2023; 65:e2200285. [PMID: 37736675 DOI: 10.1002/bimj.202200285] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Revised: 06/28/2023] [Accepted: 07/30/2023] [Indexed: 09/23/2023]
Abstract
In many areas, applied researchers as well as practitioners have to choose between different solutions for a problem at hand; this calls for optimal decision rules to settle the choices involved. As a key example, one may think of the search for optimal treatment regimes (OTRs) in clinical research, that specify which treatment alternative should be administered to each patient under study. Motivated by the fact that the concept of optimality of decision rules in general and treatment regimes in particular has received so far relatively little attention and discussion, we will present a number of reflections on it, starting from the basics of any optimization problem. Specifically, we will analyze the search space and the to be optimized criterion function underlying the search of single decision point OTRs, along with the many choice aspects that show up in their specification. Special attention is paid to formal characteristics and properties as well as to substantive concerns and hypotheses that may guide these choices. We illustrate with a few empirical examples taken from the literature. Finally, we discuss how the presented reflections may help sharpen statistical thinking about optimality of decision rules for treatment assignment and to facilitate the dialogue between the statistical consultant and the applied researcher in search of an OTR.
Collapse
Affiliation(s)
- Trung Dung Tran
- Quantitative Psychology and Individual Differences, KU Leuven, Leuven, Belgium
| | | | - Geert Verbeke
- I-BioStat, KU Leuven, Leuven, Belgium
- I-BioStat, Universiteit Hasselt, Hasselt, Belgium
| | - Geert Molenberghs
- I-BioStat, KU Leuven, Leuven, Belgium
- I-BioStat, Universiteit Hasselt, Hasselt, Belgium
| | - Iven Van Mechelen
- Quantitative Psychology and Individual Differences, KU Leuven, Leuven, Belgium
| |
Collapse
|
4
|
Li Z, Chen J, Laber E, Liu F, Baumgartner R. Optimal Treatment Regimes: A Review and Empirical Comparison. Int Stat Rev 2023. [DOI: 10.1111/insr.12536] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/24/2023]
Affiliation(s)
- Zhen Li
- Department of Statistics North Carolina State University Raleigh 27607 NC USA
| | - Jie Chen
- Department of Biometrics Overland Pharmaceuticals Dover 19901 DE USA
| | - Eric Laber
- Department of Statistical Science, Department of Biostatistics and Bioinformatics Duke University Durham 27708 NC USA
| | - Fang Liu
- Biostatistics and Research Decision Sciences Merck & Co., Inc. Kenilworth NJ 07033 USA
| | - Richard Baumgartner
- Biostatistics and Research Decision Sciences Merck & Co., Inc. Kenilworth NJ 07033 USA
| |
Collapse
|
5
|
Ma H, Zeng D, Liu Y. Learning Optimal Group-structured Individualized Treatment Rules with Many Treatments. JOURNAL OF MACHINE LEARNING RESEARCH : JMLR 2023; 24:102. [PMID: 37588020 PMCID: PMC10426767] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 08/18/2023]
Abstract
Data driven individualized decision making problems have received a lot of attentions in recent years. In particular, decision makers aim to determine the optimal Individualized Treatment Rule (ITR) so that the expected specified outcome averaging over heterogeneous patient-specific characteristics is maximized. Many existing methods deal with binary or a moderate number of treatment arms and may not take potential treatment effect structure into account. However, the effectiveness of these methods may deteriorate when the number of treatment arms becomes large. In this article, we propose GRoup Outcome Weighted Learning (GROWL) to estimate the latent structure in the treatment space and the optimal group-structured ITRs through a single optimization. In particular, for estimating group-structured ITRs, we utilize the Reinforced Angle based Multicategory Support Vector Machines (RAMSVM) to learn group-based decision rules under the weighted angle based multi-class classification framework. Fisher consistency, the excess risk bound, and the convergence rate of the value function are established to provide a theoretical guarantee for GROWL. Extensive empirical results in simulation studies and real data analysis demonstrate that GROWL enjoys better performance than several other existing methods.
Collapse
Affiliation(s)
- Haixu Ma
- Department of Statistics and Operations Research, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Donglin Zeng
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Yufeng Liu
- Department of Statistics and Operations Research, Department of Genetics, Department of Biostatistics, Carolina Center for Genome Science, Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| |
Collapse
|
6
|
Kulasekera K, Siriwardhana C. Quantiles based personalized treatment selection for multivariate outcomes and multiple treatments. Stat Med 2022; 41:2695-2710. [PMID: 35699385 PMCID: PMC9232994 DOI: 10.1002/sim.9377] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2021] [Revised: 02/24/2022] [Accepted: 02/26/2022] [Indexed: 11/29/2023]
Abstract
In this work, we propose a method for individualized treatment selection when there are correlated multiple responses for the K treatment ( K ≥ 2 ) scenario. Here we use ranks of quantiles of outcome variables for each treatment conditional on patient-specific scores constructed from collected covariate measurements. Our method covers any number of treatments and outcome variables using any number of quantiles and it can be applied for a broad set of models. We propose a rank aggregation technique for combining several lists of ranks where both these lists and elements within each list can be correlated. The method has the flexibility to incorporate patient and clinician preferences into the optimal treatment decision on an individual case basis. A simulation study demonstrates the performance of the proposed method in finite samples. We also present illustrations using two different datasets from diabetes and HIV-1 clinical trials to show the applicability of the proposed procedure for real data.
Collapse
Affiliation(s)
- K.B. Kulasekera
- Department of Bioinformatics & Biostatistics, University of Louisville, Louisville, KY 40202, USA
| | - Chathura Siriwardhana
- Department of Quantitave Helath Sciences, University of Hawaii John A. Burns School of Medicine, Honolulu, HI 96813, USA
| |
Collapse
|
7
|
Wang Y, Zhao Y, Zheng Y. Targeted Search for Individualized Clinical Decision Rules to Optimize Clinical Outcomes. STATISTICS IN BIOSCIENCES 2022. [DOI: 10.1007/s12561-022-09343-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
8
|
Xu R, Chen G, Connor M, Murphy J. Novel Use of Patient-Specific Covariates From Oncology Studies in the Era of Biomedical Data Science: A Review of Latest Methodologies. J Clin Oncol 2022; 40:3546-3553. [PMID: 35258995 DOI: 10.1200/jco.21.01957] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
In this article, we review different applications of how to incorporate individual patient variables into clinical research within oncology. These methodologies range from the more traditional use of baseline covariates from randomized clinical trials, as well as observational studies, to using covariates to generalize the results of randomized clinical trials to other populations. Individual patient variables also allow for the consideration of heterogeneity in treatment effects and individualized treatment rules. We primarily consider two treatment groups and mostly focus on time-to-event outcomes where such methodologies have been well established and widely applied. We also discuss more conceptually newer statistical research that has not been widely applied in clinical oncology, but is likely to make an impact in future oncology research. With the increasing amount of biomedical data available for analysis, it is inevitable that more methods are developed to make best use of information, to advance oncology research.
Collapse
Affiliation(s)
- Ronghui Xu
- Univerity of California, San Diego, San Diego, CA
| | | | | | - James Murphy
- Univerity of California, San Diego, San Diego, CA
| |
Collapse
|
9
|
Kulasekera K, Siriwardhana C. Multi-Response Based Personalized Treatment Selection with Data from Crossover Designs for Multiple Treatments. COMMUN STAT-SIMUL C 2022; 51:554-569. [PMID: 35299995 PMCID: PMC8923529 DOI: 10.1080/03610918.2019.1656739] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
In this work we propose a novel method for treatment selection based on individual covariate information when the treatment response is multivariate and data are available from a crossover design. Our method covers any number of treatments and it can be applied for a broad set of models. The proposed method uses a rank aggregation technique to estimate an ordering of treatments based on ranked lists of treatment performance measures such as smooth conditional means and conditional probability of a response for one treatment dominating others. An empirical study demonstrates the performance of the proposed method in finite samples.
Collapse
Affiliation(s)
- K.B. Kulasekera
- Department of Bioinformatics & Biostatistics, University of Louisville, Louisville, KY 40202, USA
| | - Chathura Siriwardhana
- Department of Quantitative Health Sciences, University of Hawaii John A. Burns School of Medicine, Honolulu, HI 96813, USA
| |
Collapse
|
10
|
Tang M, Wang L, Gorin MA, Taylor JMG. Step-adjusted tree-based reinforcement learning for evaluating nested dynamic treatment regimes using test-and-treat observational data. Stat Med 2021; 40:6164-6177. [PMID: 34490942 DOI: 10.1002/sim.9177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Revised: 07/31/2021] [Accepted: 08/09/2021] [Indexed: 11/08/2022]
Abstract
Dynamic treatment regimes (DTRs) include a sequence of treatment decision rules, in which treatment is adapted over time in response to the changes in an individual's disease progression and health care history. In medical practice, nested test-and-treat strategies are common to improve cost-effectiveness. For example, for patients at risk of prostate cancer, only patients who have high prostate-specific antigen (PSA) need a biopsy, which is costly and invasive, to confirm the diagnosis and help determine the treatment if needed. A decision about treatment happens after the biopsy, and is thus nested within the decision of whether to do the test. However, current existing statistical methods are not able to accommodate such a naturally embedded property of the treatment decision within the test decision. Therefore, we developed a new statistical learning method, step-adjusted tree-based reinforcement learning, to evaluate DTRs within such a nested multistage dynamic decision framework using observational data. At each step within each stage, we combined the robust semiparametric estimation via augmented inverse probability weighting with a tree-based reinforcement learning method to deal with the counterfactual optimization. The simulation studies demonstrated robust performance of the proposed methods under different scenarios. We further applied our method to evaluate the necessity of prostate biopsy and identify the optimal test-and-treat regimes for prostate cancer patients using data from the Johns Hopkins University prostate cancer active surveillance dataset.
Collapse
Affiliation(s)
- Ming Tang
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA
| | - Lu Wang
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA
| | - Michael A Gorin
- The James Buchanan Brady Urological Institute and Department of Urology, Johns Hopkins University School of Medicine, Baltimore, Maryland, USA
| | - Jeremy M G Taylor
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA
| |
Collapse
|
11
|
Doubleday K, Zhou J, Zhou H, Fu H. Risk controlled decision trees and random forests for precision Medicine. Stat Med 2021; 41:719-735. [PMID: 34786731 PMCID: PMC8863134 DOI: 10.1002/sim.9253] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2021] [Revised: 10/15/2021] [Accepted: 10/15/2021] [Indexed: 11/08/2022]
Abstract
Statistical methods generating individualized treatment rules (ITRs) often focus on maximizing expected benefit, but these rules may expose patients to excess risk. For instance, aggressive treatment of type 2 diabetes (T2D) with insulin therapies may result in an ITR which controls blood glucose levels but increases rates of hypoglycemia, diminishing the appeal of the ITR. This work proposes two methods to identify risk-controlled ITRs (rcITR), a class of ITR which maximizes a benefit while controlling risk at a prespecified threshold. A novel penalized recursive partitioning algorithm is developed which optimizes an unconstrained, penalized value function. The final rule is a risk-controlled decision tree (rcDT) that is easily interpretable. A natural extension of the rcDT model, risk controlled random forests (rcRF), is also proposed. Simulation studies demonstrate the robustness of rcRF modeling. Three variable importance measures are proposed to further guide clinical decision-making. Both rcDT and rcRF procedures can be applied to data from randomized controlled trials or observational studies. An extensive simulation study interrogates the performance of the proposed methods. A data analysis of the DURABLE diabetes trial in which two therapeutics were compared is additionally presented. An R package implements the proposed methods ( https://github.com/kdoub5ha/rcITR).
Collapse
Affiliation(s)
- Kevin Doubleday
- Department of Biostatistics, University of Arizona, Tucson, Arizona, USA
| | - Jin Zhou
- Department of Biostatistics, University of California, Los Angeles, California, USA
| | - Hua Zhou
- Department of Biostatistics, University of California, Los Angeles, California, USA
| | - Haoda Fu
- Eli Lilly and Company, Indianapolis, Indiana, USA
| |
Collapse
|
12
|
Siriwardhana C, Kulasekera K. Optimal Personalized Treatment Selection with Multivariate Outcome Measures in a Multiple Treatment Case. COMMUN STAT-SIMUL C 2021; 52:5773-5787. [PMID: 38371330 PMCID: PMC10871612 DOI: 10.1080/03610918.2021.1999473] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2021] [Accepted: 10/24/2021] [Indexed: 10/19/2022]
Abstract
In this work we propose a novel method for individualized treatment selection when there are correlated multiple treatment responses. For the K treatment (K ≥ 2) scenario, we compare quantities that are suitable indexes based on outcome variables for each treatment conditional on patient-specific scores constructed from collected covariate measurements. Our method covers any number of treatments and outcome variables, and it can be applied for a broad set of models. The proposed method uses a rank aggregation technique that takes into account possible correlations among ranked lists to estimate an ordering of treatments based on treatment performance measures such as the smooth conditional mean. The method has the flexibility to incorporate patient and clinician preferences into the optimal treatment decision on an individual case basis. A simulation study demonstrates the performance of the proposed method in finite samples. We also present data analyses using HIV clinical trial data to show the applicability of the proposed procedure for real data.
Collapse
Affiliation(s)
- Chathura Siriwardhana
- Department of Quantitative Health Sciences, University of Hawaii John A. Burns School of Medicine, HI, USA
| | - K.B. Kulasekera
- Department of Bioinformatics & Biostatistics, University of Louisville, Louisville, KY, USA
| |
Collapse
|
13
|
Kapelner A, Bleich J, Levine A, Cohen ZD, DeRubeis RJ, Berk R. Evaluating the Effectiveness of Personalized Medicine With Software. Front Big Data 2021; 4:572532. [PMID: 34085036 PMCID: PMC8167073 DOI: 10.3389/fdata.2021.572532] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2020] [Accepted: 02/03/2021] [Indexed: 11/13/2022] Open
Abstract
We present methodological advances in understanding the effectiveness of personalized medicine models and supply easy-to-use open-source software. Personalized medicine involves the systematic use of individual patient characteristics to determine which treatment option is most likely to result in a better average outcome for the patient. Why is personalized medicine not done more in practice? One of many reasons is because practitioners do not have any easy way to holistically evaluate whether their personalization procedure does better than the standard of care, termed improvement. Our software, "Personalized Treatment Evaluator" (the R package PTE), provides inference for improvement out-of-sample in many clinical scenarios. We also extend current methodology by allowing evaluation of improvement in the case where the endpoint is binary or survival. In the software, the practitioner inputs 1) data from a single-stage randomized trial with one continuous, incidence or survival endpoint and 2) an educated guess of a functional form of a model for the endpoint constructed from domain knowledge. The bootstrap is then employed on data unseen during model fitting to provide confidence intervals for the improvement for the average future patient (assuming future patients are similar to the patients in the trial). One may also test against a null scenario where the hypothesized personalization are not more useful than a standard of care. We demonstrate our method's promise on simulated data as well as on data from a randomized comparative trial investigating two treatments for depression.
Collapse
Affiliation(s)
- Adam Kapelner
- Department of Mathematics, Queens College, CUNY, Queens, NY, United States
| | - Justin Bleich
- Department of Statistics, The Wharton School of the University of Pennsylvania, Philadelphia, PA, United States
| | - Alina Levine
- Department of Mathematics, Queens College, CUNY, Queens, NY, United States
| | - Zachary D. Cohen
- Department of Psychology, University of Pennsylvania, Philadelphia, PA, United States
| | - Robert J. DeRubeis
- Department of Psychology, University of Pennsylvania, Philadelphia, PA, United States
| | - Richard Berk
- Department of Statistics, The Wharton School of the University of Pennsylvania, Philadelphia, PA, United States
| |
Collapse
|
14
|
Luckett DJ, Laber EB, Kim S, Kosorok MR. Estimation and Optimization of Composite Outcomes. JOURNAL OF MACHINE LEARNING RESEARCH : JMLR 2021; 22:167. [PMID: 34733120 PMCID: PMC8562677] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
There is tremendous interest in precision medicine as a means to improve patient outcomes by tailoring treatment to individual characteristics. An individualized treatment rule formalizes precision medicine as a map from patient information to a recommended treatment. A treatment rule is defined to be optimal if it maximizes the mean of a scalar outcome in a population of interest, e.g., symptom reduction. However, clinical and intervention scientists often seek to balance multiple and possibly competing outcomes, e.g., symptom reduction and the risk of an adverse event. One approach to precision medicine in this setting is to elicit a composite outcome which balances all competing outcomes; unfortunately, eliciting a composite outcome directly from patients is difficult without a high-quality instrument, and an expert-derived composite outcome may not account for heterogeneity in patient preferences. We propose a new paradigm for the study of precision medicine using observational data that relies solely on the assumption that clinicians are approximately (i.e., imperfectly) making decisions to maximize individual patient utility. Estimated composite outcomes are subsequently used to construct an estimator of an individualized treatment rule which maximizes the mean of patient-specific composite outcomes. The estimated composite outcomes and estimated optimal individualized treatment rule provide new insights into patient preference heterogeneity, clinician behavior, and the value of precision medicine in a given domain. We derive inference procedures for the proposed estimators under mild conditions and demonstrate their finite sample performance through a suite of simulation experiments and an illustrative application to data from a study of bipolar depression.
Collapse
Affiliation(s)
| | - Eric B Laber
- Department of Statistics, North Carolina State University, Raleigh, NC 27607, USA
| | - Siyeon Kim
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27607, USA
| | - Michael R Kosorok
- Departments of Biostatistics and Statistics & Operations Research, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| |
Collapse
|
15
|
Sperger J, Freeman NLB, Jiang X, Bang D, Marchi D, Kosorok MR. The future of precision health is data‐driven decision support. Stat Anal Data Min 2020. [DOI: 10.1002/sam.11475] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- John Sperger
- Department of Biostatistics University of North Carolina at Chapel Hill Chapel Hill North Carolina USA
| | - Nikki L. B. Freeman
- Department of Biostatistics University of North Carolina at Chapel Hill Chapel Hill North Carolina USA
| | - Xiaotong Jiang
- Department of Biostatistics University of North Carolina at Chapel Hill Chapel Hill North Carolina USA
| | - David Bang
- Department of Biostatistics University of North Carolina at Chapel Hill Chapel Hill North Carolina USA
| | - Daniel Marchi
- Department of Biostatistics University of North Carolina at Chapel Hill Chapel Hill North Carolina USA
| | - Michael R. Kosorok
- Department of Biostatistics University of North Carolina at Chapel Hill Chapel Hill North Carolina USA
| |
Collapse
|
16
|
Siriwardhana C, Kulasekera KB. Personalized treatment plans with multivariate outcomes. Biom J 2020; 62:1973-1985. [PMID: 32627863 DOI: 10.1002/bimj.201800072] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2018] [Revised: 03/03/2020] [Accepted: 03/21/2020] [Indexed: 11/09/2022]
Abstract
In this work, we propose a novel method for individualized treatment selection when the treatment response is multivariate. Our method covers any number of treatments and it can be applied for a broad set of models. The proposed method uses a Mahalanobis-type distance measure to establish an ordering of treatments based on treatment performance measures. Our investigation in this work deals with means of responses conditional on lower dimensional composite scores based on covariates where these scores are built using single index models to approximate mean responses against patient covariates. Smoothed estimates of such conditional means are combined to construct an estimate of the aforementioned distance measure, which is then used to estimate the optimal treatment. An empirical study demonstrates the performance of the proposed method in finite samples. We also present a data analysis using an HIV clinical trial data to show the applicability of the proposed procedure for real data.
Collapse
Affiliation(s)
- Chathura Siriwardhana
- Department of Quantitative Health Sciences, John A. Burns School of Medicine, University of Hawaii, Honolulu, HI, USA
| | | |
Collapse
|
17
|
Dong L, Laber E, Goldberg Y, Song R, Yang S. Ascertaining properties of weighting in the estimation of optimal treatment regimes under monotone missingness. Stat Med 2020; 39:3503-3520. [PMID: 32729973 DOI: 10.1002/sim.8678] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Revised: 04/28/2020] [Accepted: 04/30/2020] [Indexed: 11/10/2022]
Abstract
Dynamic treatment regimes operationalize precision medicine as a sequence of decision rules, one per stage of clinical intervention, that map up-to-date patient information to a recommended intervention. An optimal treatment regime maximizes the mean utility when applied to the population of interest. Methods for estimating an optimal treatment regime assume the data to be fully observed, which rarely occurs in practice. A common approach is to first use multiple imputation and then pool the estimators across imputed datasets. However, this approach requires estimating the joint distribution of patient trajectories, which can be high-dimensional, especially when there are multiple stages of intervention. We examine the application of inverse probability weighted estimating equations as an alternative to multiple imputation in the context of monotonic missingness. This approach applies to a broad class of estimators of an optimal treatment regime including both Q-learning and a generalization of outcome weighted learning. We establish consistency under mild regularity conditions and demonstrate its advantages in finite samples using a series of simulation experiments and an application to a schizophrenia study.
Collapse
Affiliation(s)
- Lin Dong
- Department of Statistics, North Carolina State University, Raleigh, North Carolina, USA
| | - Eric Laber
- Department of Statistics, North Carolina State University, Raleigh, North Carolina, USA
| | - Yair Goldberg
- Department of Statistics, Technion Israel Institute of Technology, Haifa, Israel
| | - Rui Song
- Department of Statistics, North Carolina State University, Raleigh, North Carolina, USA
| | - Shu Yang
- Department of Statistics, North Carolina State University, Raleigh, North Carolina, USA
| |
Collapse
|
18
|
Wang Y, Zhao YQ, Zheng Y. Learning-based biomarker-assisted rules for optimized clinical benefit under a risk constraint. Biometrics 2020; 76:853-862. [PMID: 31833561 PMCID: PMC7292743 DOI: 10.1111/biom.13199] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2018] [Revised: 11/27/2019] [Accepted: 11/29/2019] [Indexed: 11/28/2022]
Abstract
Novel biomarkers, in combination with currently available clinical information, have been sought to improve clinical decision making in many branches of medicine, including screening, surveillance, and prognosis. Statistical methods are needed to integrate such diverse information to develop targeted interventions that balance benefit and harm. In the specific setting of disease detection, we propose novel approaches to construct a multiple-marker-based decision rule by directly optimizing a benefit function, while controlling harm at a maximally tolerable level. These new approaches include plug-in and direct-optimization-based algorithms, and they allow for the construction of both nonparametric and parametric rules. A study of asymptotic properties of the proposed estimators is provided. Simulation results demonstrate good clinical utilities for the resulting decision rules under various scenarios. The methods are applied to a biomarker study in prostate cancer surveillance.
Collapse
Affiliation(s)
- Yanqing Wang
- Institute for Insight, Georgia State University, Atlanta, GA 30302
| | - Ying-Qi Zhao
- Biostatistics Program, Fred Hutchinson Cancer Research Center, Seattle, WA 98109
| | - Yingye Zheng
- Biostatistics Program, Fred Hutchinson Cancer Research Center, Seattle, WA 98109
| |
Collapse
|
19
|
|
20
|
Huang X, Xu J. Estimating individualized treatment rules with risk constraint. Biometrics 2020; 76:1310-1318. [DOI: 10.1111/biom.13232] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2019] [Revised: 01/20/2020] [Accepted: 01/27/2020] [Indexed: 11/29/2022]
Affiliation(s)
- Xinyang Huang
- Key Laboratory of Advanced Theory and Application in Statistics and Data Science ‐ MOE School of Statistics East China Normal University Shanghai China
| | - Jin Xu
- Key Laboratory of Advanced Theory and Application in Statistics and Data Science ‐ MOE School of Statistics East China Normal University Shanghai China
| |
Collapse
|
21
|
Meng H, Zhao YQ, Fu H, Qiao X. Near-optimal Individualized Treatment Recommendations. JOURNAL OF MACHINE LEARNING RESEARCH : JMLR 2020; 21:183. [PMID: 34335111 PMCID: PMC8324003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
The individualized treatment recommendation (ITR) is an important analytic framework for precision medicine. The goal of ITR is to assign the best treatments to patients based on their individual characteristics. From the machine learning perspective, the solution to the ITR problem can be formulated as a weighted classification problem to maximize the mean benefit from the recommended treatments given patients' characteristics. Several ITR methods have been proposed in both the binary setting and the multicategory setting. In practice, one may prefer a more flexible recommendation that includes multiple treatment options. This motivates us to develop methods to obtain a set of near-optimal individualized treatment recommendations alternative to each other, called alternative individualized treatment recommendations (A-ITR). We propose two methods to estimate the optimal A-ITR within the outcome weighted learning (OWL) framework. Simulation studies and a real data analysis for Type 2 diabetic patients with injectable antidiabetic treatments are conducted to show the usefulness of the proposed A-ITR framework. We also show the consistency of these methods and obtain an upper bound for the risk between the theoretically optimal recommendation and the estimated one. An R package aitr has been developed, found at https://github.com/menghaomiao/aitr.
Collapse
Affiliation(s)
- Haomiao Meng
- Department of Mathematical Sciences, Binghamton University, State University of New York, Binghamton, NY 13902, USA
| | - Ying-Qi Zhao
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Haoda Fu
- Eli Lilly and Company, Indianapolis, IN 46285, USA
| | - Xingye Qiao
- Department of Mathematical Sciences, Binghamton University, State University of New York, Binghamton, NY 13902, USA
| |
Collapse
|
22
|
Guan Q, Reich BJ, Laber EB, Bandyopadhyay D. Bayesian Nonparametric Policy Search with Application to Periodontal Recall Intervals. J Am Stat Assoc 2019; 115:1066-1078. [PMID: 33012901 PMCID: PMC7531024 DOI: 10.1080/01621459.2019.1660169] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2017] [Revised: 07/18/2019] [Accepted: 08/05/2019] [Indexed: 10/26/2022]
Abstract
Tooth loss from periodontal disease is a major public health burden in the United States. Standard clinical practice is to recommend a dental visit every six months; however, this practice is not evidence-based, and poor dental outcomes and increasing dental insurance premiums indicate room for improvement. We consider a tailored approach that recommends recall time based on patient characteristics and medical history to minimize disease progression without increasing resource expenditures. We formalize this method as a dynamic treatment regime which comprises a sequence of decisions, one per stage of intervention, that follow a decision rule which maps current patient information to a recommendation for their next visit time. The dynamics of periodontal health, visit frequency, and patient compliance are complex, yet the estimated optimal regime must be interpretable to domain experts if it is to be integrated into clinical practice. We combine non-parametric Bayesian dynamics modeling with policy-search algorithms to estimate the optimal dynamic treatment regime within an interpretable class of regimes. Both simulation experiments and application to a rich database of electronic dental records from the HealthPartners HMO shows that our proposed method leads to better dental health without increasing the average recommended recall time relative to competing methods.
Collapse
Affiliation(s)
- Qian Guan
- Department of Statistics, North Carolina State University, Raleigh, North Carolina
| | - Brian J Reich
- Department of Statistics, North Carolina State University, Raleigh, North Carolina
| | - Eric B Laber
- Department of Statistics, North Carolina State University, Raleigh, North Carolina
| | | |
Collapse
|
23
|
Abstract
Precision medicine seeks to maximize the quality of healthcare by individualizing the healthcare process to the uniquely evolving health status of each patient. This endeavor spans a broad range of scientific areas including drug discovery, genetics/genomics, health communication, and causal inference all in support of evidence-based, i.e., data-driven, decision making. Precision medicine is formalized as a treatment regime which comprises a sequence of decision rules, one per decision point, which map up-to-date patient information to a recommended action. The potential actions could be the selection of which drug to use, the selection of dose, timing of administration, specific diet or exercise recommendation, or other aspects of treatment or care. Statistics research in precision medicine is broadly focused on methodological development for estimation of and inference for treatment regimes which maximize some cumulative clinical outcome. In this review, we provide an overview of this vibrant area of research and present important and emerging challenges.
Collapse
Affiliation(s)
- Michael R Kosorok
- Department of Biostatistics and Department of Statistics and Operations Research, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, 27599, U.S.A.;
| | - Eric B Laber
- Department of Statistics, North Carolina State University, Raleight, North Carolina, 27695, U.S.A.;
| |
Collapse
|
24
|
Laber EB, Wu F, Munera C, Lipkovich I, Colucci S, Ripa S. Identifying optimal dosage regimes under safety constraints: An application to long term opioid treatment of chronic pain. Stat Med 2018; 37:1407-1418. [PMID: 29468702 PMCID: PMC6293986 DOI: 10.1002/sim.7566] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2016] [Revised: 08/26/2017] [Accepted: 10/30/2017] [Indexed: 11/08/2022]
Abstract
There is growing interest and investment in precision medicine as a means to provide the best possible health care. A treatment regime formalizes precision medicine as a sequence of decision rules, one per clinical intervention period, that specify if, when and how current treatment should be adjusted in response to a patient's evolving health status. It is standard to define a regime as optimal if, when applied to a population of interest, it maximizes the mean of some desirable clinical outcome, such as efficacy. However, in many clinical settings, a high-quality treatment regime must balance multiple competing outcomes; eg, when a high dose is associated with substantial symptom reduction but a greater risk of an adverse event. We consider the problem of estimating the most efficacious treatment regime subject to constraints on the risk of adverse events. We combine nonparametric Q-learning with policy-search to estimate a high-quality yet parsimonious treatment regime. This estimator applies to both observational and randomized data, as well as settings with variable, outcome-dependent follow-up, mixed treatment types, and multiple time points. This work is motivated by and framed in the context of dosing for chronic pain; however, the proposed framework can be applied generally to estimate a treatment regime which maximizes the mean of one primary outcome subject to constraints on one or more secondary outcomes. We illustrate the proposed method using data pooled from 5 open-label flexible dosing clinical trials for chronic pain.
Collapse
|
25
|
Butler EL, Laber EB, Davis SM, Kosorok MR. Incorporating Patient Preferences into Estimation of Optimal Individualized Treatment Rules. Biometrics 2018; 74:18-26. [PMID: 28742260 PMCID: PMC5785589 DOI: 10.1111/biom.12743] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2016] [Revised: 05/01/2017] [Accepted: 06/01/2017] [Indexed: 11/29/2022]
Abstract
Precision medicine seeks to provide treatment only if, when, to whom, and at the dose it is needed. Thus, precision medicine is a vehicle by which healthcare can be made both more effective and efficient. Individualized treatment rules operationalize precision medicine as a map from current patient information to a recommended treatment. An optimal individualized treatment rule is defined as maximizing the mean of a pre-specified scalar outcome. However, in settings with multiple outcomes, choosing a scalar composite outcome by which to define optimality is difficult. Furthermore, when there is heterogeneity across patient preferences for these outcomes, it may not be possible to construct a single composite outcome that leads to high-quality treatment recommendations for all patients. We simultaneously estimate the optimal individualized treatment rule for all composite outcomes representable as a convex combination of the (suitably transformed) outcomes. For each patient, we use a preference elicitation questionnaire and item response theory to derive the posterior distribution over preferences for these composite outcomes and subsequently derive an estimator of an optimal individualized treatment rule tailored to patient preferences. We prove that as the number of subjects and items on the questionnaire diverge, our estimator is consistent for an oracle optimal individualized treatment rule wherein each patient's preference is known a priori. We illustrate the proposed method using data from a clinical trial on antipsychotic medications for schizophrenia.
Collapse
Affiliation(s)
- Emily L Butler
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, U.S.A
| | - Eric B Laber
- Department of Statistics, North Carolina State University, Raleigh, North Carolina, U.S.A
| | - Sonia M Davis
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, U.S.A
| | - Michael R Kosorok
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, U.S.A
- Department of Statistics and Operations Research, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, U.S.A
| |
Collapse
|
26
|
Lizotte DJ, Tahmasebi A. Prediction and tolerance intervals for dynamic treatment regimes. Stat Methods Med Res 2017; 26:1611-1629. [PMID: 28695763 DOI: 10.1177/0962280217708662] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
We develop and evaluate tolerance interval methods for dynamic treatment regimes (DTRs) that can provide more detailed prognostic information to patients who will follow an estimated optimal regime. Although the problem of constructing confidence intervals for DTRs has been extensively studied, prediction and tolerance intervals have received little attention. We begin by reviewing in detail different interval estimation and prediction methods and then adapting them to the DTR setting. We illustrate some of the challenges associated with tolerance interval estimation stemming from the fact that we do not typically have data that were generated from the estimated optimal regime. We give an extensive empirical evaluation of the methods and discussed several practical aspects of method choice, and we present an example application using data from a clinical trial. Finally, we discuss future directions within this important emerging area of DTR research.
Collapse
Affiliation(s)
- Daniel J Lizotte
- Departments of Computer Science and Epidemiology & Biostatistics, The University of Western Ontario, London, Ontario, Canada
| | - Arezoo Tahmasebi
- Departments of Computer Science and Epidemiology & Biostatistics, The University of Western Ontario, London, Ontario, Canada
| |
Collapse
|
27
|
Laber EB, Staicu AM. Functional feature construction for individualized treatment regimes. J Am Stat Assoc 2017; 113:1219-1227. [PMID: 30416232 PMCID: PMC6223315 DOI: 10.1080/01621459.2017.1321545] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2015] [Revised: 01/01/2017] [Indexed: 10/19/2022]
Abstract
Evidence-based personalized medicine formalizes treatment selection as an individualized treatment regime that maps up-to-date patient information into the space of possible treatments. Available patient information may include static features such race, gender, family history, genetic and genomic information, as well as longitudinal information including the emergence of comorbidities, waxing and waning of symptoms, side-effect burden, and adherence. Dynamic information measured at multiple time points before treatment assignment should be included as input to the treatment regime. However, subject longitudinal measurements are typically sparse, irregularly spaced, noisy, and vary in number across subjects. Existing estimators for treatment regimes require equal information be measured on each subject and thus standard practice is to summarize longitudinal subject information into a scalar, ad hoc summary during data pre-processing. This reduction of the longitudinal information to a scalar feature precedes estimation of a treatment regime and is therefore not informed by subject outcomes, treatments, or covariates. Furthermore, we show that this reduction requires more stringent causal assumptions for consistent estimation than are necessary. We propose a data-driven method for constructing maximally prescriptive yet interpretable features that can be used with standard methods for estimating optimal treatment regimes. In our proposed framework, we treat the subject longitudinal information as a realization of a stochastic process observed with error at discrete time points. Functionals of this latent process are then combined with outcome models to estimate an optimal treatment regime. The proposed methodology requires weaker causal assumptions than Q-learning with an ad hoc scalar summary and is consistent for the optimal treatment regime.
Collapse
Affiliation(s)
- Eric B Laber
- Department of Statistics, North Carolina State University, Raleigh, NC, 27695, U.S.A
| | - Ana-Maria Staicu
- Department of Statistics, North Carolina State University, Raleigh, NC, 27695, U.S.A
| |
Collapse
|
28
|
Schnell P, Tang Q, Müller P, Carlin BP. Subgroup inference for multiple treatments and multiple endpoints in an Alzheimer’s disease treatment trial. Ann Appl Stat 2017. [DOI: 10.1214/17-aoas1024] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
29
|
Linn KA, Laber EB, Stefanski LA. Interactive Q-learning for Quantiles. J Am Stat Assoc 2017; 112:638-649. [PMID: 28890584 PMCID: PMC5586239 DOI: 10.1080/01621459.2016.1155993] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2014] [Revised: 01/01/2016] [Indexed: 12/18/2022]
Abstract
A dynamic treatment regime is a sequence of decision rules, each of which recommends treatment based on features of patient medical history such as past treatments and outcomes. Existing methods for estimating optimal dynamic treatment regimes from data optimize the mean of a response variable. However, the mean may not always be the most appropriate summary of performance. We derive estimators of decision rules for optimizing probabilities and quantiles computed with respect to the response distribution for two-stage, binary treatment settings. This enables estimation of dynamic treatment regimes that optimize the cumulative distribution function of the response at a prespecified point or a prespecified quantile of the response distribution such as the median. The proposed methods perform favorably in simulation experiments. We illustrate our approach with data from a sequentially randomized trial where the primary outcome is remission of depression symptoms.
Collapse
Affiliation(s)
- Kristin A Linn
- Department of Biostatistics and Epidemiology, University of Pennsylvania, Philadelphia, PA 19104
| | - Eric B Laber
- Department of Statistics, North Carolina State University, Raleigh, NC 27695
| | - Leonard A Stefanski
- Department of Statistics, North Carolina State University, Raleigh, NC 27695
| |
Collapse
|
30
|
Wang Y, Fu H, Zeng D. Learning Optimal Personalized Treatment Rules in Consideration of Benefit and Risk: with an Application to Treating Type 2 Diabetes Patients with Insulin Therapies. J Am Stat Assoc 2017; 113:1-13. [PMID: 30034060 PMCID: PMC6051551 DOI: 10.1080/01621459.2017.1303386] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2015] [Revised: 01/01/2017] [Indexed: 12/26/2022]
Abstract
Individualized medical decision making is often complex due to patient treatment response heterogeneity. Pharmacotherapy may exhibit distinct efficacy and safety profiles for different patient populations. An "optimal" treatment that maximizes clinical benefit for a patient may also lead to concern of safety due to a high risk of adverse events. Thus, to guide individualized clinical decision making and deliver optimal tailored treatments, maximizing clinical benefit should be considered in the context of controlling for potential risk. In this work, we propose two approaches to identify personalized optimal treatment strategy that maximizes clinical benefit under a constraint on the average risk. We derive the theoretical optimal treatment rule under the risk constraint and draw an analogy to the Neyman-Pearson lemma to prove the theorem. We present algorithms that can be easily implemented by any off-the-shelf quadratic programming package. We conduct extensive simulation studies to show satisfactory risk control when maximizing the clinical benefit. Lastly, we apply our method to a randomized trial of type 2 diabetes patients to guide optimal utilization of the first line insulin treatments based on individual patient characteristics while controlling for the rate of hypoglycemia events. We identify baseline glycated hemoglobin level, body mass index, and fasting blood glucose as three key factors among 18 biomarkers to differentiate treatment assignments, and demonstrate a successful control of the risk of hypoglycemia in both the training and testing data set.
Collapse
Affiliation(s)
- Yuanjia Wang
- Department of Biostatistics, Mailman School of Public Health, Columbia University, New York, NY 10032
| | | | - Donglin Zeng
- Department of Biostatistics, University of North Carolina at Chapel Hill
| |
Collapse
|
31
|
Chen G, Zeng D, Kosorok MR. Personalized Dose Finding Using Outcome Weighted Learning. J Am Stat Assoc 2017; 111:1509-1521. [PMID: 28255189 PMCID: PMC5327863 DOI: 10.1080/01621459.2016.1148611] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2014] [Revised: 12/01/2015] [Indexed: 10/22/2022]
Abstract
In dose-finding clinical trials, it is becoming increasingly important to account for individual level heterogeneity while searching for optimal doses to ensure an optimal individualized dose rule (IDR) maximizes the expected beneficial clinical outcome for each individual. In this paper, we advocate a randomized trial design where candidate dose levels assigned to study subjects are randomly chosen from a continuous distribution within a safe range. To estimate the optimal IDR using such data, we propose an outcome weighted learning method based on a nonconvex loss function, which can be solved efficiently using a difference of convex functions algorithm. The consistency and convergence rate for the estimated IDR are derived, and its small-sample performance is evaluated via simulation studies. We demonstrate that the proposed method outperforms competing approaches. Finally, we illustrate this method using data from a cohort study for Warfarin (an anti-thrombotic drug) dosing.
Collapse
Affiliation(s)
- Guanhua Chen
- Assistant Professor, Department of Biostatistics, Vanderbilt University, Nashville, TN 37203
| | - Donglin Zeng
- Professor, Department of Biostatistics, University of North Carolina, Chapel Hill, NC 27599
| | - Michael R Kosorok
- W. R. Kenan, Jr. Distinguished Professor and Chair, Department of Biostatistics, and Professor, Department of Statistics and Operations Research, University of North Carolina, Chapel Hill, NC 27599
| |
Collapse
|
32
|
Lizotte DJ, Laber EB. Multi-Objective Markov Decision Processes for Data-Driven Decision Support. JOURNAL OF MACHINE LEARNING RESEARCH : JMLR 2016; 17:211. [PMID: 28018133 PMCID: PMC5179144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
We present new methodology based on Multi-Objective Markov Decision Processes for developing sequential decision support systems from data. Our approach uses sequential decision-making data to provide support that is useful to many different decision-makers, each with different, potentially time-varying preference. To accomplish this, we develop an extension of fitted-Q iteration for multiple objectives that computes policies for all scalarization functions, i.e. preference functions, simultaneously from continuous-state, finite-horizon data. We identify and address several conceptual and computational challenges along the way, and we introduce a new solution concept that is appropriate when different actions have similar expected outcomes. Finally, we demonstrate an application of our method using data from the Clinical Antipsychotic Trials of Intervention Effectiveness and show that our approach offers decision-makers increased choice by a larger class of optimal policies.
Collapse
Affiliation(s)
- Daniel J Lizotte
- Department of Computer Science, Department of Epidemiology & Biostatistics, The University of Western Ontario, 1151 Richmond Street, London, ON N6A 3K7, Canada
| | - Eric B Laber
- Department of Statistics, North Carolina State University, Raliegh, NC 27695, USA
| |
Collapse
|
33
|
Laber EB, Zhao YQ, Regh T, Davidian M, Tsiatis A, Stanford JB, Zeng D, Song R, Kosorok MR. Using pilot data to size a two-arm randomized trial to find a nearly optimal personalized treatment strategy. Stat Med 2015; 35:1245-56. [PMID: 26506890 DOI: 10.1002/sim.6783] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2015] [Revised: 10/07/2015] [Accepted: 10/08/2015] [Indexed: 12/18/2022]
Abstract
A personalized treatment strategy formalizes evidence-based treatment selection by mapping patient information to a recommended treatment. Personalized treatment strategies can produce better patient outcomes while reducing cost and treatment burden. Thus, among clinical and intervention scientists, there is a growing interest in conducting randomized clinical trials when one of the primary aims is estimation of a personalized treatment strategy. However, at present, there are no appropriate sample size formulae to assist in the design of such a trial. Furthermore, because the sampling distribution of the estimated outcome under an estimated optimal treatment strategy can be highly sensitive to small perturbations in the underlying generative model, sample size calculations based on standard (uncorrected) asymptotic approximations or computer simulations may not be reliable. We offer a simple and robust method for powering a single stage, two-armed randomized clinical trial when the primary aim is estimating the optimal single stage personalized treatment strategy. The proposed method is based on inverting a plugin projection confidence interval and is thereby regular and robust to small perturbations of the underlying generative model. The proposed method requires elicitation of two clinically meaningful parameters from clinical scientists and uses data from a small pilot study to estimate nuisance parameters, which are not easily elicited. The method performs well in simulated experiments and is illustrated using data from a pilot study of time to conception and fertility awareness.
Collapse
Affiliation(s)
- Eric B Laber
- John Wiley & Sons, Ltd, The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, U.K
| | - Ying-Qi Zhao
- John Wiley & Sons, Ltd, The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, U.K
| | - Todd Regh
- John Wiley & Sons, Ltd, The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, U.K
| | - Marie Davidian
- John Wiley & Sons, Ltd, The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, U.K
| | - Anastasios Tsiatis
- John Wiley & Sons, Ltd, The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, U.K
| | - Joseph B Stanford
- John Wiley & Sons, Ltd, The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, U.K
| | - Donglin Zeng
- John Wiley & Sons, Ltd, The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, U.K
| | - Rui Song
- John Wiley & Sons, Ltd, The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, U.K
| | - Michael R Kosorok
- John Wiley & Sons, Ltd, The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, U.K
| |
Collapse
|
34
|
Wu F, Laber EB, Lipkovich IA, Severus E. Who will benefit from antidepressants in the acute treatment of bipolar depression? A reanalysis of the STEP-BD study by Sachs et al. 2007, using Q-learning. Int J Bipolar Disord 2015; 3:7. [PMID: 25844303 PMCID: PMC4383759 DOI: 10.1186/s40345-014-0018-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/30/2014] [Accepted: 12/30/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND There is substantial uncertainty regarding the efficacy of antidepressants in the treatment of bipolar disorders. METHODS Traditional randomized controlled trials and statistical methods are not designed to discover if, when, and to whom an intervention should be applied; thus, other methodological approaches are needed that allow for the practice of personalized, evidence-based medicine with patients with bipolar depression. RESULTS Dynamic treatment regimes operationalize clinical decision-making as a sequence of decision rules, one per stage of clinical intervention, that map patient information to a recommended treatment. Using data from the acute depression randomized care (RAD) pathway of the Systematic Treatment Enhancement Program for Bipolar Disorder (STEP-BD) study, we estimate an optimal dynamic treatment regime via Q-learning. CONCLUSIONS The estimated optimal treatment regime presents some evidence that patients in the RAD pathway of STEP-BD who experienced a (hypo)manic episode before the depressive episode may do better to forgo adding an antidepressant to a mandatory mood stabilizer.
Collapse
Affiliation(s)
- Fan Wu
- Department of Statistics, North Carolina State University, 2311 Stinson Drive, Raleigh, 27695 USA
| | - Eric B Laber
- Department of Statistics, North Carolina State University, 2311 Stinson Drive, Raleigh, 27695 USA
| | | | - Emanuel Severus
- Department of Psychiatry and Psychotherapy, University Hospital Carl Gustav Carus, Technische Universität Dresden,, Fetscherstraße 74,, 01307 Dresden Germany
| |
Collapse
|
35
|
Abstract
Evidence-based rules for optimal treatment allocation are key components in the quest for efficient, effective health care delivery. Q-learning, an approximate dynamic programming algorithm, is a popular method for estimating optimal sequential decision rules from data. Q-learning requires the modeling of nonsmooth, nonmonotone transformations of the data, complicating the search for adequately expressive, yet parsimonious, statistical models. The default Q-learning working model is multiple linear regression, which is not only provably misspecified under most data-generating models, but also results in nonregular regression estimators, complicating inference. We propose an alternative strategy for estimating optimal sequential decision rules for which the requisite statistical modeling does not depend on nonsmooth, nonmonotone transformed data, does not result in nonregular regression estimators, is consistent under a broader array of data-generation models than Q-learning, results in estimated sequential decision rules that have better sampling properties, and is amenable to established statistical approaches for exploratory data analysis, model building, and validation. We derive the new method, IQ-learning, via an interchange in the order of certain steps in Q-learning. In simulated experiments IQ-learning improves on Q-learning in terms of integrated mean squared error and power. The method is illustrated using data from a study of major depressive disorder.
Collapse
Affiliation(s)
- Eric B. Laber
- Department of Statistics, North Carolina State University, 2311 Stinson Drive, 5216 SAS Hall, Raleigh, North Carolina, 27695-8203, USA
| | - Kristin A. Linn
- Department of Statistics, North Carolina State University, 2311 Stinson Drive, 5216 SAS Hall, Raleigh, North Carolina, 27695-8203, USA
| | - Leonard A. Stefanski
- Department of Statistics, North Carolina State University, 2311 Stinson Drive, 5216 SAS Hall, Raleigh, North Carolina, 27695-8203, USA
| |
Collapse
|
36
|
|
37
|
Anderson K, Joffe M, Kosorok MR. University of Pennsylvania 6th annual conference on statistical issues in clinical trials: Dynamic treatment regimes (morning session). Clin Trials 2014; 11:418-425. [DOI: 10.1177/1740774514538553] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
38
|
Abstract
BACKGROUND Recent advances in medical research suggest that the optimal treatment rules should be adaptive to patients over time. This has led to an increasing interest in studying dynamic treatment regime, a sequence of individualized treatment rules, one per stage of clinical intervention, which maps present patient information to a recommended treatment. There has been a recent surge of statistical work for estimating optimal dynamic treatment regimes from randomized and observational studies. The purpose of this article is to review recent methodological progress and applied issues associated with estimating optimal dynamic treatment regimes. METHODS We discuss sequential multiple assignment randomized trials, a clinical trial design used to study treatment sequences. We use a common estimator of an optimal dynamic treatment regime that applies to sequential multiple assignment randomized trials data as a platform to discuss several practical and methodological issues. RESULTS We provide a limited survey of practical issues associated with modeling sequential multiple assignment randomized trials data. We review some existing estimators of optimal dynamic treatment regimes and discuss practical issues associated with these methods including model building, missing data, statistical inference, and choosing an outcome when only non-responders are re-randomized. We mainly focus on the estimation and inference of dynamic treatment regimes using sequential multiple assignment randomized trials data. Dynamic treatment regimes can also be constructed from observational data, which may be easier to obtain in practice; however, care must be taken to account for potential confounding.
Collapse
Affiliation(s)
- Ying-Qi Zhao
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, USA
| | - Eric B Laber
- Department of Statistics, North Carolina State University, Raleigh, NC, USA
| |
Collapse
|
39
|
Laber EB, Lizotte DJ, Qian M, Pelham WE, Murphy SA. Rejoinder of “Dynamic treatment regimes: Technical challenges and applications”. Electron J Stat 2014. [DOI: 10.1214/14-ejs920rej] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
40
|
Lizotte DJ, Bowling M, Murphy SA. Linear Fitted-Q Iteration with Multiple Reward Functions. JOURNAL OF MACHINE LEARNING RESEARCH : JMLR 2012; 13:3253-3295. [PMID: 23741197 PMCID: PMC3670261] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
We present a general and detailed development of an algorithm for finite-horizon fitted-Q iteration with an arbitrary number of reward signals and linear value function approximation using an arbitrary number of state features. This includes a detailed treatment of the 3-reward function case using triangulation primitives from computational geometry and a method for identifying globally dominated actions. We also present an example of how our methods can be used to construct a real-world decision aid by considering symptom reduction, weight gain, and quality of life in sequential treatments for schizophrenia. Finally, we discuss future directions in which to take this work that will further enable our methods to make a positive impact on the field of evidence-based clinical decision support.
Collapse
Affiliation(s)
- Daniel J. Lizotte
- David R. Cheriton School of Computer Science, University of Waterloo, Waterloo, ON N2L 3G1, Canada,
| | - Michael Bowling
- Department of Computing Science, University of Alberta, Edmonton, AB T6G 2E8, Canada,
| | - Susan A. Murphy
- Department of Statistics, University of Michigan, Ann Arbor, MI 48109-1107, USA,
| |
Collapse
|