1
|
Shah KS, Fu H, Kosorok MR. Stabilized direct learning for efficient estimation of individualized treatment rules. Biometrics 2023; 79:2843-2856. [PMID: 36585916 DOI: 10.1111/biom.13818] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 12/23/2022] [Indexed: 01/01/2023]
Abstract
In recent years, the field of precision medicine has seen many advancements. Significant focus has been placed on creating algorithms to estimate individualized treatment rules (ITRs), which map from patient covariates to the space of available treatments with the goal of maximizing patient outcome. Direct learning (D-Learning) is a recent one-step method which estimates the ITR by directly modeling the treatment-covariate interaction. However, when the variance of the outcome is heterogeneous with respect to treatment and covariates, D-Learning does not leverage this structure. Stabilized direct learning (SD-Learning), proposed in this paper, utilizes potential heteroscedasticity in the error term through a residual reweighting which models the residual variance via flexible machine learning algorithms such as XGBoost and random forests. We also develop an internal cross-validation scheme which determines the best residual model among competing models. SD-Learning improves the efficiency of D-Learning estimates in binary and multi-arm treatment scenarios. The method is simple to implement and an easy way to improve existing algorithms within the D-Learning family, including original D-Learning, Angle-based D-Learning (AD-Learning), and Robust D-learning (RD-Learning). We provide theoretical properties and justification of the optimality of SD-Learning. Head-to-head performance comparisons with D-Learning methods are provided through simulations, which demonstrate improvement in terms of average prediction error (APE), misclassification rate, and empirical value, along with a data analysis of an acquired immunodeficiency syndrome (AIDS) randomized clinical trial.
Collapse
Affiliation(s)
- Kushal S Shah
- Department of Biostatistics, University of North Carolina at Chapel Hill, North Carolina, USA
| | - Haoda Fu
- Eli Lilly and Company, Lilly Corporate Center, Indianapolis, USA
| | - Michael R Kosorok
- Department of Biostatistics, University of North Carolina at Chapel Hill, North Carolina, USA
| |
Collapse
|
2
|
Moodie EEM, Talbot D. On "Reflections on the concept of optimality of single decision point treatment regimes". Biom J 2023; 65:e2300027. [PMID: 37797173 DOI: 10.1002/bimj.202300027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Revised: 04/26/2023] [Accepted: 06/22/2023] [Indexed: 10/07/2023]
Abstract
This is a discussion of "Reflections on the concept of optimality of single decision point treatment regimes" by Trung Dung Tran, Ariel Alonso Abad, Geert Verbeke, Geert Molenberghs, and Iven Van Mechelen. The authors propose a thoughtful consideration of optimization targets and the implications of such targets for the resulting optimal treatment rule. However, we contest the assertation that targets of optimization have been overlooked and suggest additional considerations that researchers must contemplate as part of a complete framework for learning about optimal treatment regimes.
Collapse
Affiliation(s)
- Erica E M Moodie
- Department of Epidemiology & Biostatistics, McGill University, Montreal, Quebec, Canada
| | - Denis Talbot
- Department of Social and Preventive Medicine, Université Laval, Quebec, Canada
| |
Collapse
|
3
|
Liu Z, Zhan Z, Lin C, Zhang B. Estimation in optimal treatment regimes based on mean residual lifetimes with right-censored data. Biom J 2023; 65:e2200340. [PMID: 37789592 DOI: 10.1002/bimj.202200340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Revised: 07/06/2023] [Accepted: 07/09/2023] [Indexed: 10/05/2023]
Abstract
An optimal individualized treatment regime (ITR) is a decision rule in allocating the best treatment to each patient and, hence, maximizing overall benefits. In this paper, we propose a novel framework based on nonparametric inverse probability weighting (IPW) and augmented inverse probability weighting (AIPW) estimators of the value function when the data are subject to right censoring. In contrast to most existing approaches that are designed to maximize the expected survival time under a binary treatment framework, the proposed method targets maximizing the mean residual lifetime of patients. Specifically, the proposed IPW method searches the optimal ITR by maximizing an estimator for the overall population outcome directly, without specifying the regression model for the conditional mean residual lifetime, whereas the AIPW method integrates the model information of the mean residual lifetime to improve the robustness. Furthermore, to overcome the computational difficulty in a nonsmooth value estimator, smoothed IPW and AIPW estimators are constructed. In theory, we establish the asymptotic properties of the proposed method under suitable regularity conditions. The empirical performances of the proposed IPW and AIPW estimators are evaluated using simulation studies and are further illustrated with an application to the real-world data set from the Acquired Immunodeficiency Syndrome Clinical Trial Group Protocol 175 (ACTG175).
Collapse
Affiliation(s)
- Zhishuai Liu
- Department of Biostatistics & Bioinformatics, Duke University, Durham, USA
| | - Zishu Zhan
- Department of Biostatistics, School of Public Health, Southern Medical University, Guangzhou, China
| | - Cunjie Lin
- Center for Applied Statistics and School of Statistics, Renmin University of China, Beijing, China
| | - Baqun Zhang
- School of Statistics and Management, Shanghai University of Finance and Economics, Shanghai, China
| |
Collapse
|
4
|
Bakoyannis G. Estimating optimal individualized treatment rules with multistate processes. Biometrics 2023; 79:2830-2842. [PMID: 37015010 PMCID: PMC10553793 DOI: 10.1111/biom.13864] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Accepted: 03/23/2023] [Indexed: 04/06/2023]
Abstract
Multistate process data are common in studies of chronic diseases such as cancer. These data are ideal for precision medicine purposes as they can be leveraged to improve more refined health outcomes, compared to standard survival outcomes, as well as incorporate patient preferences regarding quantity versus quality of life. However, there are currently no methods for the estimation of optimal individualized treatment rules with such data. In this paper, we propose a nonparametric outcome weighted learning approach for this problem in randomized clinical trial settings. The theoretical properties of the proposed methods, including Fisher consistency and asymptotic normality of the estimated expected outcome under the estimated optimal individualized treatment rule, are rigorously established. A consistent closed-form variance estimator is provided and methodology for the calculation of simultaneous confidence intervals is proposed. Simulation studies show that the proposed methodology and inference procedures work well even with small-sample sizes and high rates of right censoring. The methodology is illustrated using data from a randomized clinical trial on the treatment of metastatic squamous-cell carcinoma of the head and neck.
Collapse
Affiliation(s)
- Giorgos Bakoyannis
- Department of Biostatistics and Health Data Science, Indiana University, Indianapolis, Indiana, USA
| |
Collapse
|
5
|
Zhang J, Zhang P, Ma J, Shentu Y. Covariate-adjusted value-guided subgroup identification via boosting. J Biopharm Stat 2023:1-18. [PMID: 37955423 DOI: 10.1080/10543406.2023.2275757] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2023] [Accepted: 10/22/2023] [Indexed: 11/14/2023]
Abstract
It is widely recognized that treatment effects could differ across subgroups of patients. Subgroup analysis, which assesses such heterogeneity, provides valuable information in developing personalized therapies. There has been extensive research developing novel statistical methods for subgroup identification. The recent contribution is a value-guided subgroup identification method that directly maximizes treatment benefit at the subgroup level for survival outcome, rather than relying on individual treatment effect estimation. In this paper, we first completed this framework by illustrating its application to continuous and binary outcomes. More importantly, we extended the original framework to account for the prognostic effects and named this new method Covariate-Adjusted Value-guided subgroup identification via boosting (CAVboost). The original method directly used the outcome to formulate the value function for subgroup identification. Since the outcome can further be decomposed as prognostic effects and treatment effects, specifying the prognostic effects as the covariates of a model for the outcome can single out the treatment effects and improve the power to detect them across subgroups. Our proposed CAVboost was based on this key idea. It used a covariate-adjusted treatment effect estimator, instead of the outcome itself, to formulate the value function for subgroup identification. CAVboost estimates the treatment effect by using covariates to account for the prognostic effects, which mimics the idea of using covariates in an ANCOVA estimator. We showed that CAVboost could effectively improve the subgroup identification capability for both continuous and binary outcomes.
Collapse
Affiliation(s)
| | - Pingye Zhang
- Gilead Sciences Inc, Foster City, California, USA
| | - Junshui Ma
- Merck & Co. MRL, BARDS, Rahway, New Jersey, USA
| | - Yue Shentu
- Merck & Co. MRL, BARDS, Rahway, New Jersey, USA
| |
Collapse
|
6
|
Jetsupphasuk M, Hudgens MG, Lu H, Cole SR, Edwards JK, Adimora AA, Althoff KN, Silverberg MJ, Rebeiro PF, Lima VD, Marconi VC, Sterling TR, Horberg MA, Gill MJ, Kitahata MM, Moore RD, Lang R, Gebo K, Rabkin C, Eron JJ. Optimizing Treatment for Human Immunodeficiency Virus to Improve Clinical Outcomes Using Precision Medicine. Am J Epidemiol 2023; 192:1341-1349. [PMID: 36922393 PMCID: PMC10666965 DOI: 10.1093/aje/kwad057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Revised: 01/03/2023] [Accepted: 03/09/2023] [Indexed: 03/18/2023] Open
Abstract
In first-line antiretroviral therapy (ART) for human immunodeficiency virus (HIV) treatment, some subgroups of patients may respond better to an efavirenz-based regimen than an integrase strand transfer inhibitor (InSTI)-based regimen, or vice versa, due to patient characteristics modifying treatment effects. Using data based on nearly 16,000 patients from the North American AIDS Cohort Collaboration on Research and Design from 2009-2016, statistical methods for precision medicine were employed to estimate an optimal treatment rule that minimizes the 5-year risk of the composite outcome of acquired immune deficiency syndrome (AIDS)-defining illnesses, serious non-AIDS events, and all-cause mortality. The treatment rules considered were functions that recommend either an efavirenz- or InSTI-based regimen conditional on baseline patient characteristics such as demographic information, laboratory results, and health history. The estimated 5-year risk under the estimated optimal treatment rule was 10.0% (95% confidence interval (CI): 8.6, 11.3), corresponding to an absolute risk reduction of 2.3% (95% CI: 0.9, 3.8) when compared with recommending an efavirenz-based regimen for all patients and 2.6% (95% CI: 1.0, 4.2) when compared with recommending an InSTI-based regimen for all. Tailoring ART to individual patient characteristics may reduce 5-year risk of the composite outcome compared with assigning all patients the same drug regimen.
Collapse
Affiliation(s)
- Michael Jetsupphasuk
- Correspondence to Michael Jetsupphasuk, Department of Biostatistics, UNC Gillings School of Global Public Health, Chapel Hill, NC 27599 (e-mail: )
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
7
|
Zhou Y, Song PXK. Longitudinal self-learning of individualized treatment rules in a nutrient supplementation trial with missing data. Stat Med 2023. [PMID: 37158137 DOI: 10.1002/sim.9766] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Revised: 04/24/2023] [Accepted: 04/29/2023] [Indexed: 05/10/2023]
Abstract
Longitudinal outcomes are prevalent in clinical studies, where the presence of missing data may make the statistical learning of individualized treatment rules (ITRs) a much more challenging task. We analyzed a longitudinal calcium supplementation trial in the ELEMENT Project and established a novel ITR to reduce the risk of adverse outcomes of lead exposure on child growth and development. Lead exposure, particularly in the form of in utero exposure, can seriously impair children's health, especially their cognitive and neurobehavioral development, which necessitates clinical interventions such as calcium supplementation intake during pregnancy. Using the longitudinal outcomes from a randomized clinical trial of calcium supplementation, we developed a new ITR for daily calcium intake during pregnancy to mitigate persistent lead exposure in children at age 3 years. To overcome the technical challenges posed by missing data, we illustrate a new learning approach, termed longitudinal self-learning (LS-learning), that utilizes longitudinal measurements of child's blood lead concentration in the derivation of ITR. Our LS-learning method relies on a temporally weighted self-learning paradigm to synergize serially correlated training data sources. The resulting ITR is the first of this kind in precision nutrition that will contribute to the reduction of expected blood lead concentration in children aged 0-3 years should this ITR be implemented to the entire study population of pregnant women.
Collapse
Affiliation(s)
- Yiwang Zhou
- Department of Biostatistics, St. Jude Children's Research Hospital, Memphis, Tennessee, USA
| | - Peter X K Song
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA
| |
Collapse
|
8
|
Li Z, Chen J, Laber E, Liu F, Baumgartner R. Optimal Treatment Regimes: A Review and Empirical Comparison. Int Stat Rev 2023. [DOI: 10.1111/insr.12536] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/24/2023]
Affiliation(s)
- Zhen Li
- Department of Statistics North Carolina State University Raleigh 27607 NC USA
| | - Jie Chen
- Department of Biometrics Overland Pharmaceuticals Dover 19901 DE USA
| | - Eric Laber
- Department of Statistical Science, Department of Biostatistics and Bioinformatics Duke University Durham 27708 NC USA
| | - Fang Liu
- Biostatistics and Research Decision Sciences Merck & Co., Inc. Kenilworth NJ 07033 USA
| | - Richard Baumgartner
- Biostatistics and Research Decision Sciences Merck & Co., Inc. Kenilworth NJ 07033 USA
| |
Collapse
|
9
|
Li C, Li W, Zhu W. Penalized robust learning for optimal treatment regimes with heterogeneous individualized treatment effects. J Appl Stat 2023; 51:1151-1170. [PMID: 38628447 PMCID: PMC11018073 DOI: 10.1080/02664763.2023.2180167] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Accepted: 02/05/2023] [Indexed: 02/22/2023]
Abstract
The growing popularity of personalized medicine motivates people to explore individualized treatment regimes according to heterogeneous characteristics of the patients. For the large-scale data analysis, however, the data are collected at different times and different locations, i.e. subjects are usually from a heterogeneous population, which causes that the optimal treatment regimes also vary for patients across different subgroups. In this paper, we mainly focus on the estimation of optimal treatment regimes for subjects come from a heterogeneous population with high-dimensional data. We first remove the main effects of the covariates for each subgroup to eliminate non-ignorable residual confounding. Based on the centralized outcome, we propose a penalized robust learning that estimates the coefficient matrix of the interactions between covariates and treatment by penalizing pairwise differences of the coefficients of any two subgroups for the same covariate, which can automatically identify the latent complex structure of the coefficient matrix with heterogeneous and homogeneous columns. At the same time, the penalized robust learning can also select the important variables that truly contribute to the individualized treatment decisions with commonly used sparsity structure penalty. Extensive simulation studies show that our proposed method outperforms current popular methods, and it is further illustrated in the real analysis of the Tamoxifen breast cancer data.
Collapse
Affiliation(s)
- Canhui Li
- Key Laboratory for Applied Statistics of MOE and School of Mathematics and Statistics, Northeast Normal University, Changchun, People's Republic of China
| | - Weirong Li
- Key Laboratory for Applied Statistics of MOE and School of Mathematics and Statistics, Northeast Normal University, Changchun, People's Republic of China
| | - Wensheng Zhu
- Key Laboratory for Applied Statistics of MOE and School of Mathematics and Statistics, Northeast Normal University, Changchun, People's Republic of China
| |
Collapse
|
10
|
Liu P, Li J, Kosorok MR. Change plane model averaging for subgroup identification. Stat Methods Med Res 2023; 32:773-788. [PMID: 36775991 DOI: 10.1177/09622802231154327] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/14/2023]
Abstract
Central to personalized medicine and tailored therapies is discovering the subpopulations that account for treatment effect heterogeneity and are likely to benefit more from given interventions. In this article, we introduce a change plane model averaging method to identify subgroups characterized by linear combinations of predictive variables and multiple cut-offs. We first fit a sequence of statistical models, each incorporating the thresholding effect of one particular covariate. The estimation of submodels is accomplished through an iterative integration of a change point detection method and numerical optimization algorithms. A frequentist model averaging approach is then employed to linearly combine the submodels with optimal weights. Our approach can deal with high-dimensional settings involving enormous potential grouping variables by adopting the sparsity-inducing penalties. Simulation studies are conducted to investigate the prediction and subgrouping performance of the proposed method, with a comparison to various competing subgroup detection methods. Our method is applied to a dataset from a warfarin pharmacogenetics study, producing some new findings.
Collapse
Affiliation(s)
- Pan Liu
- Department of Statistics and Data Science, 37580National University of Singapore, Singapore, Singapore
| | - Jialiang Li
- Department of Statistics and Data Science, 37580National University of Singapore, Singapore, Singapore.,Duke University NUS Graduate Medical School, Singapore, Singapore
| | - Michael R Kosorok
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, USA
| |
Collapse
|
11
|
Wang X, Chakraborty B. The Sequential Multiple Assignment Randomized Trial for Controlling Infectious Diseases: A Review of Recent Developments. Am J Public Health 2023; 113:49-59. [PMID: 36516390 PMCID: PMC9755933 DOI: 10.2105/ajph.2022.307135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Infectious diseases have posed severe threats to public health across the world. Effective prevention and control of infectious diseases in the long term requires adapting interventions based on epidemiological evidence. The sequential multiple assignment randomized trial (SMART) is a multistage randomized trial that can provide valid evidence of when and how to adapt interventions for controlling infectious diseases based on evolving epidemiological evidence. We review recent developments in SMARTs to bring wider attention to the potential benefits of employing SMARTs in constructing effective adaptive interventions for controlling infectious diseases and other threats to public health. We discuss 2 example SMARTs for infectious diseases and summarize recent developments in SMARTs from the varied aspects of design, analysis, cost, and ethics. Public health investigators are encouraged to familiarize themselves with the related materials we discuss and collaborate with experts in SMARTs to translate the methodological developments into preeminent public health research. (Am J Public Health. 2023;113(1):49-59. https://doi.org/10.2105/AJPH.2022.307135).
Collapse
Affiliation(s)
- Xinru Wang
- Xinru Wang and Bibhas Chakraborty are with the Centre for Quantitative Medicine, Duke-NUS Medical School, Singapore. Bibhas Chakraborty is also with the Department of Statistics and Data Science, National University of Singapore, Singapore
| | - Bibhas Chakraborty
- Xinru Wang and Bibhas Chakraborty are with the Centre for Quantitative Medicine, Duke-NUS Medical School, Singapore. Bibhas Chakraborty is also with the Department of Statistics and Data Science, National University of Singapore, Singapore
| |
Collapse
|
12
|
Ma H, Zeng D, Liu Y. Learning Optimal Group-structured Individualized Treatment Rules with Many Treatments. JOURNAL OF MACHINE LEARNING RESEARCH : JMLR 2023; 24:102. [PMID: 37588020 PMCID: PMC10426767] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 08/18/2023]
Abstract
Data driven individualized decision making problems have received a lot of attentions in recent years. In particular, decision makers aim to determine the optimal Individualized Treatment Rule (ITR) so that the expected specified outcome averaging over heterogeneous patient-specific characteristics is maximized. Many existing methods deal with binary or a moderate number of treatment arms and may not take potential treatment effect structure into account. However, the effectiveness of these methods may deteriorate when the number of treatment arms becomes large. In this article, we propose GRoup Outcome Weighted Learning (GROWL) to estimate the latent structure in the treatment space and the optimal group-structured ITRs through a single optimization. In particular, for estimating group-structured ITRs, we utilize the Reinforced Angle based Multicategory Support Vector Machines (RAMSVM) to learn group-based decision rules under the weighted angle based multi-class classification framework. Fisher consistency, the excess risk bound, and the convergence rate of the value function are established to provide a theoretical guarantee for GROWL. Extensive empirical results in simulation studies and real data analysis demonstrate that GROWL enjoys better performance than several other existing methods.
Collapse
Affiliation(s)
- Haixu Ma
- Department of Statistics and Operations Research, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Donglin Zeng
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Yufeng Liu
- Department of Statistics and Operations Research, Department of Genetics, Department of Biostatistics, Carolina Center for Genome Science, Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| |
Collapse
|
13
|
Liao P, Qi Z, Wan R, Klasnja P, Murphy SA. Batch policy learning in average reward Markov decision processes. Ann Stat 2022; 50:3364-3387. [PMID: 37022318 PMCID: PMC10072865 DOI: 10.1214/22-aos2231] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
We consider the batch (off-line) policy learning problem in the infinite horizon Markov Decision Process. Motivated by mobile health applications, we focus on learning a policy that maximizes the long-term average reward. We propose a doubly robust estimator for the average reward and show that it achieves semiparametric efficiency. Further we develop an optimization algorithm to compute the optimal policy in a parameterized stochastic policy class. The performance of the estimated policy is measured by the difference between the optimal average reward in the policy class and the average reward of the estimated policy and we establish a finite-sample regret guarantee. The performance of the method is illustrated by simulation studies and an analysis of a mobile health study promoting physical activity.
Collapse
Affiliation(s)
- Peng Liao
- Department of Statistics, Harvard University
| | - Zhengling Qi
- Department of Decision Sciences, George Washington University
| | | | | | | |
Collapse
|
14
|
Wu L, Yang S. Transfer learning of individualized treatment rules from experimental to real-world data. J Comput Graph Stat 2022; 32:1036-1045. [PMID: 37997592 PMCID: PMC10664843 DOI: 10.1080/10618600.2022.2141752] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2021] [Accepted: 10/04/2022] [Indexed: 11/06/2022]
Abstract
Individualized treatment effect lies at the heart of precision medicine. Interpretable individualized treatment rules (ITRs) are desirable for clinicians or policymakers due to their intuitive appeal and transparency. The gold-standard approach to estimating the ITRs is randomized experiments, where subjects are randomized to different treatment groups and the confounding bias is minimized to the extent possible. However, experimental studies are limited in external validity because of their selection restrictions, and therefore the underlying study population is not representative of the target real-world population. Conventional learning methods of optimal interpretable ITRs for a target population based only on experimental data are biased. On the other hand, real-world data (RWD) are becoming popular and provide a representative sample of the target population. To learn the generalizable optimal interpretable ITRs, we propose an integrative transfer learning method based on weighting schemes to calibrate the covariate distribution of the experiment to that of the RWD. Theoretically, we establish the risk consistency for the proposed ITR estimator. Empirically, we evaluate the finite-sample performance of the transfer learner through simulations and apply it to a real data application of a job training program.
Collapse
Affiliation(s)
- Lili Wu
- Department of Statistics, North Carolina State University
| | - Shu Yang
- Department of Statistics, North Carolina State University
| |
Collapse
|
15
|
Guo X, Ni A. Contrast weighted learning for robust optimal treatment rule estimation. Stat Med 2022; 41:5379-5394. [PMID: 36104931 PMCID: PMC9826186 DOI: 10.1002/sim.9574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2022] [Revised: 08/17/2022] [Accepted: 08/31/2022] [Indexed: 01/11/2023]
Abstract
Personalized medicine aims to tailor medical decisions based on patient-specific characteristics. Advances in data capturing techniques such as electronic health records dramatically increase the availability of comprehensive patient profiles, promoting the rapid development of optimal treatment rule (OTR) estimation methods. An archetypal OTR estimation approach is the outcome weighted learning, where OTR is determined under a weighted classification framework with clinical outcomes as the weights. Although outcome weighted learning has been extensively studied and extended, existing methods are susceptible to irregularities of outcome distributions such as outliers and heavy tails. Methods that involve modeling of the outcome are also sensitive to model misspecification. We propose a contrast weighted learning (CWL) framework that exploits the flexibility and robustness of contrast functions to enable robust OTR estimation for a wide range of clinical outcomes. The novel value function in CWL only depends on the pairwise contrast of clinical outcomes between patients irrespective of their distributional features and supports. The Fisher consistency and convergence rate of the estimated decision rule via CWL are established. We illustrate the superiority of the proposed method under finite samples using comprehensive simulation studies with ill-distributed continuous outcomes and ordinal outcomes. We apply the CWL method to two datasets from clinical trials on idiopathic pulmonary fibrosis and COVID-19 to demonstrate its real-world application.
Collapse
Affiliation(s)
- Xiaohan Guo
- Division of Biostatistics, College of Public HealthThe Ohio State UniversityColumbusOhio
| | - Ai Ni
- Division of Biostatistics, College of Public HealthThe Ohio State UniversityColumbusOhio
| |
Collapse
|
16
|
Qi Z, Miao R, Zhang X. Proximal Learning for Individualized Treatment Regimes Under Unmeasured Confounding. J Am Stat Assoc 2022. [DOI: 10.1080/01621459.2022.2147841] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Affiliation(s)
- Zhengling Qi
- Department of Decision Sciences, The George Washington University
| | - Rui Miao
- Department of Statistics, University of California, Irvine
| | - Xiaoke Zhang
- Department of Statistics, The George Washington University
| |
Collapse
|
17
|
Oh EJ, Qian M, Cheung YK. Generalization error bounds of dynamic treatment regimes in penalized regression-based learning. Ann Stat 2022. [DOI: 10.1214/22-aos2171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
- Eun Jeong Oh
- Department of Biostatistics, Columbia University
| | - Min Qian
- Department of Biostatistics, Columbia University
| | | |
Collapse
|
18
|
Liu M, Shen X, Pan W. Deep reinforcement learning for personalized treatment recommendation. Stat Med 2022; 41:4034-4056. [PMID: 35716038 PMCID: PMC9427729 DOI: 10.1002/sim.9491] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Revised: 05/22/2022] [Accepted: 05/25/2022] [Indexed: 12/12/2022]
Abstract
In precision medicine, the ultimate goal is to recommend the most effective treatment to an individual patient based on patient-specific molecular and clinical profiles, possibly high-dimensional. To advance cancer treatment, large-scale screenings of cancer cell lines against chemical compounds have been performed to help better understand the relationship between genomic features and drug response; existing machine learning approaches use exclusively supervised learning, including penalized regression and recommender systems. However, it would be more efficient to apply reinforcement learning to sequentially learn as data accrue, including selecting the most promising therapy for a patient given individual molecular and clinical features and then collecting and learning from the corresponding data. In this article, we propose a novel personalized ranking system called Proximal Policy Optimization Ranking (PPORank), which ranks the drugs based on their predicted effects per cell line (or patient) in the framework of deep reinforcement learning (DRL). Modeled as a Markov decision process, the proposed method learns to recommend the most suitable drugs sequentially and continuously over time. As a proof-of-concept, we conduct experiments on two large-scale cancer cell line data sets in addition to simulated data. The results demonstrate that the proposed DRL-based PPORank outperforms the state-of-the-art competitors based on supervised learning. Taken together, we conclude that novel methods in the framework of DRL have great potential for precision medicine and should be further studied.
Collapse
Affiliation(s)
- Mingyang Liu
- School of Statistics, University of Minnesota, Minneapolis, Minnesota, USA
| | - Xiaotong Shen
- School of Statistics, University of Minnesota, Minneapolis, Minnesota, USA
| | - Wei Pan
- Division of Biostatistics, University of Minnesota, Minneapolis, Minnesota, USA
| |
Collapse
|
19
|
Montoya LM, van der Laan MJ, Luedtke AR, Skeem JL, Coyle JR, Petersen ML. The optimal dynamic treatment rule superlearner: considerations, performance, and application to criminal justice interventions. Int J Biostat 2022:ijb-2020-0127. [PMID: 35708222 DOI: 10.1515/ijb-2020-0127] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2020] [Accepted: 05/06/2022] [Indexed: 11/15/2022]
Abstract
The optimal dynamic treatment rule (ODTR) framework offers an approach for understanding which kinds of patients respond best to specific treatments - in other words, treatment effect heterogeneity. Recently, there has been a proliferation of methods for estimating the ODTR. One such method is an extension of the SuperLearner algorithm - an ensemble method to optimally combine candidate algorithms extensively used in prediction problems - to ODTRs. Following the ``causal roadmap," we causally and statistically define the ODTR and provide an introduction to estimating it using the ODTR SuperLearner. Additionally, we highlight practical choices when implementing the algorithm, including choice of candidate algorithms, metalearners to combine the candidates, and risk functions to select the best combination of algorithms. Using simulations, we illustrate how estimating the ODTR using this SuperLearner approach can uncover treatment effect heterogeneity more effectively than traditional approaches based on fitting a parametric regression of the outcome on the treatment, covariates and treatment-covariate interactions. We investigate the implications of choices in implementing an ODTR SuperLearner at various sample sizes. Our results show the advantages of: (1) including a combination of both flexible machine learning algorithms and simple parametric estimators in the library of candidate algorithms; (2) using an ensemble metalearner to combine candidates rather than selecting only the best-performing candidate; (3) using the mean outcome under the rule as a risk function. Finally, we apply the ODTR SuperLearner to the ``Interventions" study, an ongoing randomized controlled trial, to identify which justice-involved adults with mental illness benefit most from cognitive behavioral therapy to reduce criminal re-offending.
Collapse
Affiliation(s)
- Lina M Montoya
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | | | | | - Jennifer L Skeem
- School of Social Work and Goldman School of Public Policy, University of California Berkeley, Berkeley, USA
| | - Jeremy R Coyle
- Division of Biostatistics, University of California Berkeley, Berkeley, USA
| | - Maya L Petersen
- Divisions of Biostatistics and Epidemiology, University of California Berkeley, Berkeley, USA
| |
Collapse
|
20
|
Wang Y, Zhao Y, Zheng Y. Targeted Search for Individualized Clinical Decision Rules to Optimize Clinical Outcomes. STATISTICS IN BIOSCIENCES 2022. [DOI: 10.1007/s12561-022-09343-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
21
|
Qi Z, Pang JS, Liu Y. On Robustness of Individualized Decision Rules. J Am Stat Assoc 2022; 118:2143-2157. [PMID: 38143785 PMCID: PMC10746134 DOI: 10.1080/01621459.2022.2038180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2020] [Accepted: 01/21/2022] [Indexed: 10/18/2022]
Abstract
With the emergence of precision medicine, estimating optimal individualized decision rules (IDRs) has attracted tremendous attention in many scientific areas. Most existing literature has focused on finding optimal IDRs that can maximize the expected outcome for each individual. Motivated by complex individualized decision making procedures and the popular conditional value at risk (CVaR) measure, we propose a new robust criterion to estimate optimal IDRs in order to control the average lower tail of the individuals' outcomes. In addition to improving the individualized expected outcome, our proposed criterion takes risks into consideration, and thus the resulting IDRs can prevent adverse events. The optimal IDR under our criterion can be interpreted as the decision rule that maximizes the "worst-case" scenario of the individualized outcome when the underlying distribution is perturbed within a constrained set. An efficient non-convex optimization algorithm is proposed with convergence guarantees. We investigate theoretical properties for our estimated optimal IDRs under the proposed criterion such as consistency and finite sample error bounds. Simulation studies and a real data application are used to further demonstrate the robust performance of our methods. Several extensions of the proposed method are also discussed.
Collapse
Affiliation(s)
- Zhengling Qi
- Department of Decision Sciences, George Washington University
| | - Jong-Shi Pang
- Department of Industrial and Systems Engineering, University of Southern California, LA
| | - Yufeng Liu
- Department of Statistics and Operations Research, Department of Genetics, Department of Biostatistics, Carolina Center for Genome Sciences, Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, NC 27599, USA
| |
Collapse
|
22
|
Xu Y, Greene TH, Bress AP, Sauer BC, Bellows BK, Zhang Y, Weintraub WS, Moran AE, Shen J. Estimating the optimal individualized treatment rule from a cost-effectiveness perspective. Biometrics 2022; 78:337-351. [PMID: 33215693 PMCID: PMC8134511 DOI: 10.1111/biom.13406] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2020] [Revised: 10/30/2020] [Accepted: 11/06/2020] [Indexed: 11/27/2022]
Abstract
Optimal individualized treatment rules (ITRs) provide customized treatment recommendations based on subject characteristics to maximize clinical benefit in accordance with the objectives in precision medicine. As a result, there is growing interest in developing statistical tools for estimating optimal ITRs in evidence-based research. In health economic perspectives, policy makers consider the tradeoff between health gains and incremental costs of interventions to set priorities and allocate resources. However, most work on ITRs has focused on maximizing the effectiveness of treatment without considering costs. In this paper, we jointly consider the impact of effectiveness and cost on treatment decisions and define ITRs under a composite-outcome setting, so that we identify the most cost-effective ITR that accounts for individual-level heterogeneity through direct optimization. In particular, we propose a decision-tree-based statistical learning algorithm that uses a net-monetary-benefit-based reward to provide nonparametric estimations of the optimal ITR. We provide several approaches to estimating the reward underlying the ITR as a function of subject characteristics. We present the strengths and weaknesses of each approach and provide practical guidelines by comparing their performance in simulation studies. We illustrate the top-performing approach from our simulations by evaluating the projected 15-year personalized cost-effectiveness of the intensive blood pressure control of the Systolic Blood Pressure Intervention Trial (SPRINT) study.
Collapse
Affiliation(s)
- Yizhe Xu
- Department of Population Health Sciences, University of Utah, Salt Lake City, Utah
| | - Tom H. Greene
- Department of Population Health Sciences, University of Utah, Salt Lake City, Utah,Department of Internal Medicine, University of Utah, Salt Lake City, Utah,Department of Family and Preventive Medicine, University of Utah, Salt Lake City, Utah
| | - Adam P. Bress
- Department of Population Health Sciences, University of Utah, Salt Lake City, Utah
| | - Brian C. Sauer
- Department of Population Health Sciences, University of Utah, Salt Lake City, Utah,Department of Family and Preventive Medicine, University of Utah, Salt Lake City, Utah,Salt Lake City Veterans Affairs Medical Center, Salt Lake City, Utah
| | - Brandon K. Bellows
- Department of Medicine, Columbia University Medical Center, New York, New York
| | - Yue Zhang
- Department of Population Health Sciences, University of Utah, Salt Lake City, Utah,Department of Internal Medicine, University of Utah, Salt Lake City, Utah,Department of Family and Preventive Medicine, University of Utah, Salt Lake City, Utah
| | | | - Andrew E. Moran
- Department of Medicine, Columbia University Medical Center, New York, New York
| | - Jincheng Shen
- Department of Population Health Sciences, University of Utah, Salt Lake City, Utah,Department of Internal Medicine, University of Utah, Salt Lake City, Utah,Department of Family and Preventive Medicine, University of Utah, Salt Lake City, Utah
| |
Collapse
|
23
|
Wu J, Galanter N, Shortreed SM, Moodie EEM. Ranking tailoring variables for constructing individualized treatment rules: an application to schizophrenia. J R Stat Soc Ser C Appl Stat 2022; 71:309-330. [PMID: 38288004 PMCID: PMC10823524 DOI: 10.1111/rssc.12533] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
As with many chronic conditions, matching patients with schizophrenia to the best treatment options is difficult. Selecting antipsychotic medication is especially challenging because many of the medications can have burdensome side effects. Adjusting or tailoring medications based on patients' characteristics could improve symptoms. However, it is often not known which patient characteristics are most helpful for informing treatment selection. In this paper, we address the challenge of identifying and ranking important variables for tailoring treatment decisions. We consider a value-search approach implemented through dynamic marginal structural models to estimate an optimal individualized treatment rule. We apply our methodology to the Clinical Antipsychotics Trial of Intervention and Effectiveness (CATIE) study for schizophrenia, to evaluate if some tailoring variables have greater potential than others for selecting treatments for patients with schizophrenia (Stroup et al., 2003).
Collapse
Affiliation(s)
| | | | - Susan M Shortreed
- Kaiser Permanente Washington Health Research Institute, Seattle, USA, and University of Washington, Seattle, USA
| | | |
Collapse
|
24
|
Guo H, Li J, Liu H, He J. Learning dynamic treatment strategies for coronary heart diseases by artificial intelligence: real-world data-driven study. BMC Med Inform Decis Mak 2022; 22:39. [PMID: 35168623 PMCID: PMC8845235 DOI: 10.1186/s12911-022-01774-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2021] [Accepted: 02/01/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Coronary heart disease (CHD) has become the leading cause of death and one of the most serious epidemic diseases worldwide. CHD is characterized by urgency, danger and severity, and dynamic treatment strategies for CHD patients are needed. We aimed to build and validate an AI model for dynamic treatment recommendations for CHD patients with the goal of improving patient outcomes and learning best practices from clinicians to help clinical decision support for treating CHD patients. METHODS We formed the treatment strategy as a sequential decision problem, and applied an AI supervised reinforcement learning-long short-term memory (SRL-LSTM) framework that combined supervised learning (SL) and reinforcement learning (RL) with an LSTM network to track patients' states to learn a recommendation model that took a patient's diagnosis and evolving health status as input and provided a treatment recommendation in the form of whether to take specific drugs. The experiments were conducted by leveraging a real-world intensive care unit (ICU) database with 13,762 admitted patients diagnosed with CHD. We compared the performance of the applied SRL-LSTM model and several state-of-the-art SL and RL models in reducing the estimated in-hospital mortality and the Jaccard similarity with clinicians' decisions. We used a random forest algorithm to calculate the feature importance of both the clinician policy and the AI policy to illustrate the interpretability of the AI model. RESULTS Our experimental study demonstrated that the AI model could help reduce the estimated in-hospital mortality through its RL function and learn the best practice from clinicians through its SL function. The similarity between the clinician policy and the AI policy regarding the surviving patients was high, while for the expired patients, it was much lower. The dynamic treatment strategies made by the AI model were clinically interpretable and relied on sensible clinical features extracted according to monitoring indexes and risk factors for CHD patients. CONCLUSIONS We proposed a pipeline for constructing an AI model to learn dynamic treatment strategies for CHD patients that could improve patient outcomes and mimic the best practices of clinicians. And a lot of further studies and efforts are needed to make it practical.
Collapse
Affiliation(s)
- Haihong Guo
- School of Information, Renmin University of China, 59 Zhongguancun Street, Haidian District, Beijing, 100872, China
- Institute of Medical Information/Medical Library, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
- Key Laboratory of Data Engineering and Knowledge Engineering, Ministry of Education, Beijing, China
| | - Jiao Li
- Institute of Medical Information/Medical Library, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Hongyan Liu
- School of Economics and Management, Tsinghua University, Beijing, China
| | - Jun He
- School of Information, Renmin University of China, 59 Zhongguancun Street, Haidian District, Beijing, 100872, China.
- Key Laboratory of Data Engineering and Knowledge Engineering, Ministry of Education, Beijing, China.
| |
Collapse
|
25
|
Meng H, Qiao X. Augmented direct learning for conditional average treatment effect estimation with double robustness. Electron J Stat 2022. [DOI: 10.1214/22-ejs2025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
- Haomiao Meng
- Department of Mathematical Sciences, Binghamton University, State University of New York, Binghamton, NY 13902-6000, USA
| | - Xingye Qiao
- Department of Mathematical Sciences, Binghamton University, State University of New York, Binghamton, NY 13902-6000, USA
| |
Collapse
|
26
|
Qiu H, Carone M, Luedtke A. Individualized treatment rules under stochastic treatment cost constraints. JOURNAL OF CAUSAL INFERENCE 2022; 10:480-493. [PMID: 38323299 PMCID: PMC10846854 DOI: 10.1515/jci-2022-0005] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/08/2024]
Abstract
Estimation and evaluation of individualized treatment rules have been studied extensively, but real-world treatment resource constraints have received limited attention in existing methods. We investigate a setting in which treatment is intervened upon based on covariates to optimize the mean counterfactual outcome under treatment cost constraints when the treatment cost is random. In a particularly interesting special case, an instrumental variable corresponding to encouragement to treatment is intervened upon with constraints on the proportion receiving treatment. For such settings, we first develop a method to estimate optimal individualized treatment rules. We further construct an asymptotically efficient plug-in estimator of the corresponding average treatment effect relative to a given reference rule.
Collapse
Affiliation(s)
- Hongxiang Qiu
- Department of Statistics, the Wharton School, University of Pennsylvania
| | - Marco Carone
- Department of Biostatistics, University of Washington
| | - Alex Luedtke
- Department of Statistics, University of Washington
| |
Collapse
|
27
|
Fan Y, Lu X, Zhao J, Fu H, Liu Y. Estimating individualized treatment rules for treatments with hierarchical structure. Electron J Stat 2022. [DOI: 10.1214/21-ejs1948] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
- Yiwei Fan
- Center for Applied Statistics, School of Statistics, Renmin University of China, China
| | - Xiaoling Lu
- Center for Applied Statistics, School of Statistics, Renmin University of China, China
| | - Junlong Zhao
- School of Statistics, Beijing Normal University, China
| | - Haoda Fu
- Advanced Analytics and Data Sciences, Eli Lilly and Company, U.S.A
| | - Yufeng Liu
- Department of Statistics and Operations Research, Department of Genetics, Department of Biostatistics, Carolina Center for Genome Sciences, Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, U.S.A
| |
Collapse
|
28
|
Gao D, Liu Y, Zeng D. Non-asymptotic Properties of Individualized Treatment Rules from Sequentially Rule-Adaptive Trials. JOURNAL OF MACHINE LEARNING RESEARCH : JMLR 2022; 23:https://www.jmlr.org/papers/v23/21-0354.html. [PMID: 37576335 PMCID: PMC10419117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Figures] [Subscribe] [Scholar Register] [Indexed: 08/15/2023]
Abstract
Learning optimal individualized treatment rules (ITRs) has become increasingly important in the modern era of precision medicine. Many statistical and machine learning methods for learning optimal ITRs have been developed in the literature. However, most existing methods are based on data collected from traditional randomized controlled trials and thus cannot take advantage of the accumulative evidence when patients enter the trials sequentially. It is also ethically important that future patients should have a high probability to be treated optimally based on the updated knowledge so far. In this work, we propose a new design called sequentially rule-adaptive trials to learn optimal ITRs based on the contextual bandit framework, in contrast to the response-adaptive design in traditional adaptive trials. In our design, each entering patient will be allocated with a high probability to the current best treatment for this patient, which is estimated using the past data based on some machine learning algorithm (for example, outcome weighted learning in our implementation). We explore the tradeoff between training and test values of the estimated ITR in single-stage problems by proving theoretically that for a higher probability of following the estimated ITR, the training value converges to the optimal value at a faster rate, while the test value converges at a slower rate. This problem is different from traditional decision problems in the sense that the training data are generated sequentially and are dependent. We also develop a tool that combines martingale with empirical process to tackle the problem that cannot be solved by previous techniques for i.i.d. data. We show by numerical examples that without much loss of the test value, our proposed algorithm can improve the training value significantly as compared to existing methods. Finally, we use a real data study to illustrate the performance of the proposed method.
Collapse
Affiliation(s)
- Daiqi Gao
- Department of Statistics and Operations Research, The University of North Carolina at Chapel Hill Chapel Hill, NC 27599, USA
| | - Yufeng Liu
- Department of Statistics and Operations Research, Department of Genetics, Department of Biostatistics, The University of North Carolina at Chapel Hill Chapel Hill, NC 27599, USA
| | - Donglin Zeng
- Department of Biostatistics, The University of North Carolina at Chapel Hill Chapel Hill, NC 27599, USA
| |
Collapse
|
29
|
Mo W, Liu Y. Efficient learning of optimal individualized treatment rules for heteroscedastic or misspecified treatment‐free effect models. J R Stat Soc Series B Stat Methodol 2021. [DOI: 10.1111/rssb.12474] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Weibin Mo
- Department of Statistics and Operations Research University of North Carolina at Chapel Hill Chapel Hill North Carolina USA
| | - Yufeng Liu
- Department of Statistics and Operations Research Department of Genetics Department of Biostatistics Carolina Center for Genome Sciences Lineberger Comprehensive Cancer Center University of North Carolina at Chapel Hill Chapel Hill North Carolina USA
| |
Collapse
|
30
|
Doubleday K, Zhou J, Zhou H, Fu H. Risk controlled decision trees and random forests for precision Medicine. Stat Med 2021; 41:719-735. [PMID: 34786731 PMCID: PMC8863134 DOI: 10.1002/sim.9253] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2021] [Revised: 10/15/2021] [Accepted: 10/15/2021] [Indexed: 11/08/2022]
Abstract
Statistical methods generating individualized treatment rules (ITRs) often focus on maximizing expected benefit, but these rules may expose patients to excess risk. For instance, aggressive treatment of type 2 diabetes (T2D) with insulin therapies may result in an ITR which controls blood glucose levels but increases rates of hypoglycemia, diminishing the appeal of the ITR. This work proposes two methods to identify risk-controlled ITRs (rcITR), a class of ITR which maximizes a benefit while controlling risk at a prespecified threshold. A novel penalized recursive partitioning algorithm is developed which optimizes an unconstrained, penalized value function. The final rule is a risk-controlled decision tree (rcDT) that is easily interpretable. A natural extension of the rcDT model, risk controlled random forests (rcRF), is also proposed. Simulation studies demonstrate the robustness of rcRF modeling. Three variable importance measures are proposed to further guide clinical decision-making. Both rcDT and rcRF procedures can be applied to data from randomized controlled trials or observational studies. An extensive simulation study interrogates the performance of the proposed methods. A data analysis of the DURABLE diabetes trial in which two therapeutics were compared is additionally presented. An R package implements the proposed methods ( https://github.com/kdoub5ha/rcITR).
Collapse
Affiliation(s)
- Kevin Doubleday
- Department of Biostatistics, University of Arizona, Tucson, Arizona, USA
| | - Jin Zhou
- Department of Biostatistics, University of California, Los Angeles, California, USA
| | - Hua Zhou
- Department of Biostatistics, University of California, Los Angeles, California, USA
| | - Haoda Fu
- Eli Lilly and Company, Indianapolis, Indiana, USA
| |
Collapse
|
31
|
Mulrenin IR, Garcia JE, Fashe MM, Loop MS, Daubert MA, Urrutia RP, Lee CR. The impact of pregnancy on antihypertensive drug metabolism and pharmacokinetics: current status and future directions. Expert Opin Drug Metab Toxicol 2021; 17:1261-1279. [PMID: 34739303 DOI: 10.1080/17425255.2021.2002845] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
INTRODUCTION Hypertensive disorders of pregnancy (HDP) are rising in prevalence, and increase risk of adverse maternal and fetal outcomes. Physiologic changes occur during pregnancy that alter drug pharmacokinetics. However, antihypertensive drugs lack pregnancy-specific dosing recommendations due to critical knowledge gaps surrounding the extent of gestational changes in antihypertensive drug pharmacokinetics and underlying mechanisms. AREAS COVERED This review (1) summarizes currently recommended medications and dosing strategies for non-emergent HDP treatment, (2) reviews and synthesizes existing literature identified via a comprehensive Pubmed search evaluating gestational changes in the maternal pharmacokinetics of commonly prescribed HDP drugs (notably labetalol and nifedipine), and (3) offers insight into the metabolism and clearance mechanisms underlying altered HDP drug pharmacokinetics during pregnancy. Remaining knowledge gaps and future research directions are summarized. EXPERT OPINION A series of small pharmacokinetic studies illustrate higher oral clearance of labetalol and nifedipine during pregnancy. Pharmacokinetic modeling and preclinical studies suggest these effects are likely due to pregnancy-associated increases in hepatic UGT1A1- and CYP3A4-mediated first-pass metabolism and lower bioavailability. Accordingly, higher and/or more frequent doses may be needed to lower blood pressure during pregnancy. Future research is needed to address various evidence gaps and inform the development of more precise antihypertensive drug dosing strategies.
Collapse
Affiliation(s)
- Ian R Mulrenin
- Division of Pharmacotherapy and Experimental Therapeutics, UNC Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Julian E Garcia
- Division of Pharmacotherapy and Experimental Therapeutics, UNC Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Muluneh M Fashe
- Division of Pharmacotherapy and Experimental Therapeutics, UNC Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Matthew Shane Loop
- Division of Pharmacotherapy and Experimental Therapeutics, UNC Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Melissa A Daubert
- Division of Cardiology, Department of Medicine, Duke University Medical Center, Durham, NC
| | - Rachel Peragallo Urrutia
- Division of General Obstetrics and Gynecology, Department of Obstetrics and Gynecology, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Craig R Lee
- Division of Pharmacotherapy and Experimental Therapeutics, UNC Eshelman School of Pharmacy, University of North Carolina at Chapel Hill, Chapel Hill, NC
| |
Collapse
|
32
|
Zhou W, Zhu R, Zeng D. A parsimonious personalized dose-finding model via dimension reduction. Biometrika 2021; 108:643-659. [PMID: 34658383 PMCID: PMC8514170 DOI: 10.1093/biomet/asaa087] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Learning an individualized dose rule in personalized medicine is a challenging statistical problem. Existing methods often suffer from the curse of dimensionality, especially when the decision function is estimated nonparametrically. To tackle this problem, we propose a dimension reduction framework that effectively reduces the estimation to a lower-dimensional subspace of the covariates. We exploit that the individualized dose rule can be defined in a subspace spanned by a few linear combinations of the covariates, leading to a more parsimonious model. Also, our framework does not require the inverse probability of the propensity score under observational studies due to a direct maximization of the value function. This distinguishes us from the outcome weighted learning framework, which also solves decision rules directly. Under the same framework, we further propose a pseudo-direct learning approach focuses more on estimating the dimensionality-reduced subspace of the treatment outcome. Parameters in both approaches can be estimated efficiently using an orthogonality constrained optimization algorithm on the Stiefel manifold. Under mild regularity assumptions, the asymptotic normality results of the proposed estimators can are established, respectively. We also derive the consistency and convergence rate for the value function under the estimated optimal dose rule. We evaluate the performance of the proposed approaches through extensive simulation studies and a warfarin pharmacogenetic dataset.
Collapse
Affiliation(s)
- Wenzhuo Zhou
- Department of Statistics, University of Illinois at Urbana-Champaign, Champaign, Illinois 61820, U.S.A
| | - Ruoqing Zhu
- Department of Statistics, University of Illinois at Urbana-Champaign, Champaign, Illinois 61820, U.S.A
| | - Donglin Zeng
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, U.S.A
| |
Collapse
|
33
|
Cai T, Tony Cai T, Guo Z. Optimal statistical inference for individualized treatment effects in high‐dimensional models. J R Stat Soc Series B Stat Methodol 2021. [DOI: 10.1111/rssb.12426] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
|
34
|
Cai H, Song R, Lu W. GEAR: On optimal decision making with auxiliary data. Stat (Int Stat Inst) 2021. [DOI: 10.1002/sta4.399] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Affiliation(s)
- Hengrui Cai
- Department of Statistics North Carolina State University Raleigh North Carolina 27695 USA
| | - Rui Song
- Department of Statistics North Carolina State University Raleigh North Carolina 27695 USA
| | - Wenbin Lu
- Department of Statistics North Carolina State University Raleigh North Carolina 27695 USA
| |
Collapse
|
35
|
Imai K, Li ML. Experimental Evaluation of Individualized Treatment Rules. J Am Stat Assoc 2021. [DOI: 10.1080/01621459.2021.1923511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Affiliation(s)
- Kosuke Imai
- Department of Government and Department of Statistics, Harvard University, Cambridge, MA
| | - Michael Lingzhi Li
- Operation Research Center, Massachusetts Institute of Technology, Cambridge, MA
| |
Collapse
|
36
|
Luedtke A, Kessler RC. New Directions in Research on Heterogeneity of Treatment Effects for Major Depression. JAMA Psychiatry 2021; 78:478-480. [PMID: 33595616 DOI: 10.1001/jamapsychiatry.2020.4489] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Affiliation(s)
- Alex Luedtke
- Department of Statistics, University of Washington, Seattle.,Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington
| | - Ronald C Kessler
- Department of Health Care Policy, Harvard Medical School, Boston, Massachusetts
| |
Collapse
|
37
|
Jiang X, Nelson AE, Cleveland RJ, Beavers DP, Schwartz TA, Arbeeva L, Alvarez C, Callahan LF, Messier S, Loeser R, Kosorok MR. Precision Medicine Approach to Develop and Internally Validate Optimal Exercise and Weight-Loss Treatments for Overweight and Obese Adults With Knee Osteoarthritis: Data From a Single-Center Randomized Trial. Arthritis Care Res (Hoboken) 2021; 73:693-701. [PMID: 32144896 PMCID: PMC7483572 DOI: 10.1002/acr.24179] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2019] [Accepted: 02/25/2020] [Indexed: 11/09/2022]
Abstract
OBJECTIVE To apply a precision medicine approach to determine the optimal treatment regime for participants in an exercise (E), dietary weight loss (D), and D + E trial for knee osteoarthritis that would maximize their expected outcomes. METHODS Using data from 343 participants of the Intensive Diet and Exercise for Arthritis (IDEA) trial, we applied 24 machine-learning models to develop individualized treatment rules on 7 outcomes: Short Form 36 physical component score, weight loss, Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) pain/function/stiffness scores, compressive force, and interleukin-6 level. The optimal model was selected based on jackknife value function estimates that indicate improvement in the outcomes if future participants follow the estimated decision rule compared to the optimal single, fixed treatment model. RESULTS Multiple outcome random forest was the optimal model for the WOMAC outcomes. For the other outcomes, list-based models were optimal. For example, the estimated optimal decision rule for weight loss indicated assigning the D + E intervention to participants with baseline weight not exceeding 109.35 kg and waist circumference above 90.25 cm, and assigning D to all other participants except those with a history of a heart attack. If applied to future participants, the optimal rule for weight loss is estimated to increase average weight loss to 11.2 kg at 18 months, contrasted with 9.8 kg if all participants received D + E (P = 0.01). CONCLUSION The precision medicine models supported the overall findings from IDEA that the D + E intervention was optimal for most participants, but there was evidence that a subgroup of participants would likely benefit more from diet alone for 2 outcomes.
Collapse
Affiliation(s)
- Xiaotong Jiang
- Department of Biostatistics, University of North Carolina, Chapel Hill, NC
| | - Amanda E. Nelson
- Division of Rheumatology, Allergy and Immunology and the Thurston Arthritis Research Center, University of North Carolina, Chapel Hill, NC
| | - Rebecca J. Cleveland
- Division of Rheumatology, Allergy and Immunology and the Thurston Arthritis Research Center, University of North Carolina, Chapel Hill, NC
| | - Daniel P. Beavers
- Department of Biostatistics and Data Science, Wake Forest School of Medicine, Winston-Salem, NC
| | - Todd A. Schwartz
- Department of Biostatistics, University of North Carolina, Chapel Hill, NC
| | - Liubov Arbeeva
- Division of Rheumatology, Allergy and Immunology and the Thurston Arthritis Research Center, University of North Carolina, Chapel Hill, NC
| | - Carolina Alvarez
- Division of Rheumatology, Allergy and Immunology and the Thurston Arthritis Research Center, University of North Carolina, Chapel Hill, NC
| | - Leigh F. Callahan
- Division of Rheumatology, Allergy and Immunology and the Thurston Arthritis Research Center, University of North Carolina, Chapel Hill, NC
| | - Stephen Messier
- Department of Health and Exercise Science, Wake Forest University, Winston-Salem, NC
| | - Richard Loeser
- Division of Rheumatology, Allergy and Immunology and the Thurston Arthritis Research Center, University of North Carolina, Chapel Hill, NC
| | - Michael R. Kosorok
- Department of Biostatistics, University of North Carolina, Chapel Hill, NC
| |
Collapse
|
38
|
Zhang J, Troxel AB, Petkova E. Robust index of confidence weighted learning for optimal individualized treatment rule estimation. Stat (Int Stat Inst) 2021. [DOI: 10.1002/sta4.374] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Affiliation(s)
- Jinchun Zhang
- Biostatistics and Research Decision Sciences, MRL Merck & Co., Inc. Kenilworth New Jersey USA
| | - Andrea B. Troxel
- Department of Population Health New York University Grossman School of Medicine New York New York USA
| | - Eva Petkova
- Department of Population Health New York University Grossman School of Medicine New York New York USA
- Nathan Kline Institute for Psychiatric Research Orangeburg New York USA
| |
Collapse
|
39
|
Li J, Li Y, Jin B, Kosorok MR. Multithreshold change plane model: Estimation theory and applications in subgroup identification. Stat Med 2021; 40:3440-3459. [PMID: 33843100 DOI: 10.1002/sim.8976] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2020] [Revised: 01/06/2021] [Accepted: 03/21/2021] [Indexed: 11/05/2022]
Abstract
We propose a multithreshold change plane regression model which naturally partitions the observed subjects into subgroups with different covariate effects. The underlying grouping variable is a linear function of observed covariates and thus multiple thresholds produce change planes in the covariate space. We contribute a novel two-stage estimation approach to determine the number of subgroups, the location of thresholds, and all other regression parameters. In the first stage we adopt a group selection principle to consistently identify the number of subgroups, while in the second stage change point locations and model parameter estimates are refined by a penalized induced smoothing technique. Our procedure allows sparse solutions for relatively moderate- or high-dimensional covariates. We further establish the asymptotic properties of our proposed estimators under appropriate technical conditions. We evaluate the performance of the proposed methods by simulation studies and provide illustrations using two medical data examples. Our proposal for subgroup identification may lead to an immediate application in personalized medicine.
Collapse
Affiliation(s)
- Jialiang Li
- Department of Statistics and Applied Probability, National University of Singapore, Singapore, Singapore.,Duke-NUS Graduate Medical School, National University of Singapore, Singapore, Singapore.,Singapore Eye Research Institute, Singapore, Singapore
| | - Yaguang Li
- International Institute of Finance, School of Management, University of Science and Technology of China, Hefei, Anhui, China
| | - Baisuo Jin
- International Institute of Finance, School of Management, University of Science and Technology of China, Hefei, Anhui, China
| | - Michael R Kosorok
- Department of Biotatistics, University of North Carolina, Chapel Hill, North Carolina, USA
| |
Collapse
|
40
|
Liang M, Zhao YQ. Discussion of Kallus (2020) and Mo et al (2020). J Am Stat Assoc 2021; 116:690-693. [PMID: 34483404 PMCID: PMC8409173 DOI: 10.1080/01621459.2020.1833887] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2020] [Accepted: 10/03/2020] [Indexed: 10/21/2022]
Abstract
We discuss the results on improving the generalizability of individualized treatment rule following the work in Kallus [1] and Mo et al. [5]. We note that the advocated weights in Kallus [1] are connected to the efficient score of the contrast function. We further propose a likelihood-ratio-based method (LR-ITR) to accommodate covariate shifts, and compare it to the CTE-DR-ITR method proposed in Mo et al. [5]. We provide the upper-bound on the risk function of the target population when both the covariate shift and the contrast function shift are present. Numerical studies show that LR-ITR can outperform CTE-DR-ITR when there is only covariate shift.
Collapse
Affiliation(s)
- Muxuan Liang
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center
| | - Ying-Qi Zhao
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center
| |
Collapse
|
41
|
Guo W, Zhou XH, Ma S. Estimation of Optimal Individualized Treatment Rules Using a Covariate-Specific Treatment Effect Curve With High-Dimensional Covariates. J Am Stat Assoc 2021. [DOI: 10.1080/01621459.2020.1865167] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Affiliation(s)
- Wenchuan Guo
- Department of Statistics, University of California Riverside, Riverside, CA
- Global Biometric Sciences, Bristol-Myers Squibb, Pennington, NJ
| | - Xiao-Hua Zhou
- Beijing International Center for Mathematical Research, and Department of Biostatistics, Peking University, Beijing, China
| | - Shujie Ma
- Department of Statistics, University of California Riverside, Riverside, CA
| |
Collapse
|
42
|
Luckett DJ, Laber EB, Kim S, Kosorok MR. Estimation and Optimization of Composite Outcomes. JOURNAL OF MACHINE LEARNING RESEARCH : JMLR 2021; 22:167. [PMID: 34733120 PMCID: PMC8562677] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
There is tremendous interest in precision medicine as a means to improve patient outcomes by tailoring treatment to individual characteristics. An individualized treatment rule formalizes precision medicine as a map from patient information to a recommended treatment. A treatment rule is defined to be optimal if it maximizes the mean of a scalar outcome in a population of interest, e.g., symptom reduction. However, clinical and intervention scientists often seek to balance multiple and possibly competing outcomes, e.g., symptom reduction and the risk of an adverse event. One approach to precision medicine in this setting is to elicit a composite outcome which balances all competing outcomes; unfortunately, eliciting a composite outcome directly from patients is difficult without a high-quality instrument, and an expert-derived composite outcome may not account for heterogeneity in patient preferences. We propose a new paradigm for the study of precision medicine using observational data that relies solely on the assumption that clinicians are approximately (i.e., imperfectly) making decisions to maximize individual patient utility. Estimated composite outcomes are subsequently used to construct an estimator of an individualized treatment rule which maximizes the mean of patient-specific composite outcomes. The estimated composite outcomes and estimated optimal individualized treatment rule provide new insights into patient preference heterogeneity, clinician behavior, and the value of precision medicine in a given domain. We derive inference procedures for the proposed estimators under mild conditions and demonstrate their finite sample performance through a suite of simulation experiments and an illustrative application to data from a study of bipolar depression.
Collapse
Affiliation(s)
| | - Eric B Laber
- Department of Statistics, North Carolina State University, Raleigh, NC 27607, USA
| | - Siyeon Kim
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27607, USA
| | - Michael R Kosorok
- Departments of Biostatistics and Statistics & Operations Research, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| |
Collapse
|
43
|
Zhang P, Ma J, Chen X, Shentu Y. A nonparametric method for value function guided subgroup identification via gradient tree boosting for censored survival data. Stat Med 2020; 39:4133-4146. [PMID: 32786155 DOI: 10.1002/sim.8714] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2020] [Revised: 06/08/2020] [Accepted: 07/09/2020] [Indexed: 11/07/2022]
Abstract
In randomized clinical trials with survival outcome, there has been an increasing interest in subgroup identification based on baseline genomic, proteomic markers, or clinical characteristics. Some of the existing methods identify subgroups that benefit substantially from the experimental treatment by directly modeling outcomes or treatment effect. When the goal is to find an optimal treatment for a given patient rather than finding the right patient for a given treatment, methods under the individualized treatment regime framework estimate an individualized treatment rule that would lead to the best expected clinical outcome as measured by a value function. Connecting the concept of value function to subgroup identification, we propose a nonparametric method that searches for subgroup membership scores by maximizing a value function that directly reflects the subgroup-treatment interaction effect based on restricted mean survival time. A gradient tree boosting algorithm is proposed to search for the individual subgroup membership scores. We conduct simulation studies to evaluate the performance of the proposed method and an application to an AIDS clinical trial is performed for illustration.
Collapse
Affiliation(s)
- Pingye Zhang
- Biostatistics and Research Decision Sciences, MRL, Merck & Co., Inc., Rahway, New Jersey, USA
| | - Junshui Ma
- Biostatistics and Research Decision Sciences, MRL, Merck & Co., Inc., Rahway, New Jersey, USA
| | - Xinqun Chen
- Biostatistics and Research Decision Sciences, MRL, Merck & Co., Inc., Rahway, New Jersey, USA
| | - Yue Shentu
- Biostatistics and Research Decision Sciences, MRL, Merck & Co., Inc., Rahway, New Jersey, USA
| |
Collapse
|
44
|
Liu M, Shen X, Pan W. Outcome weighted ψ-learning for individualized treatment rules. Stat (Int Stat Inst) 2020; 10:e343. [PMID: 34937955 PMCID: PMC8691757 DOI: 10.1002/sta4.343] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2020] [Accepted: 12/02/2020] [Indexed: 11/12/2022]
Abstract
An individualized treatment rule is often employed to maximize a certain patient-specific clinical outcome based on his/her clinical or genomic characteristics as well as heterogeneous response to treatments. Although developing such a rule is conceptually important to personalized medicine, existing methods such as the partial least squares Qian and Murphy (2011) suffers from the difficulty of indirect maximization of a patient's clinical outcome, while the outcome weighted learning Y. Zhao, Zeng, Rush, and Kosorok (2012) is not robust against any perturbation of the outcome. In this article, we propose a weighted ψ-learning method to optimize an individualized treatment rule, which is robust against any data perturbation near the decision boundary by seeking the maximum separation. To solve nonconvex minimization, we employ a difference convex algorithm to relax the non-convex minimization iteratively based on a decomposition of the cost function into a difference of two convex functions. On this ground, we also introduce a variable selection method for further removing redundant variables for a higher performance. Finally, we illustrate the proposed method by simulations and a lung health study and demonstrate that it yields higher performances in terms of accuracy of prediction of individualized treatment.
Collapse
Affiliation(s)
- Mingyang Liu
- School of Statistics, University of Minnesota, MN, Minneapolis
| | - Xiaotong Shen
- School of Statistics, University of Minnesota, MN, Minneapolis
| | - Wei Pan
- Division of Biostatistics, University of Minnesota, MN, Minneapolis
| |
Collapse
|
45
|
Affiliation(s)
- Xinkun Nie
- Department of Computer Science, Stanford University, Stanford, CA
| | - Emma Brunskill
- Department of Computer Science, Stanford University, Stanford, CA
| | - Stefan Wager
- Graduate School of Business, Stanford University, Stanford, CA
| |
Collapse
|
46
|
Rashid NU, Luckett DJ, Chen J, Lawson MT, Wang L, Zhang Y, Laber EB, Liu Y, Yeh JJ, Zeng D, Kosorok MR. High-Dimensional Precision Medicine From Patient-Derived Xenografts. J Am Stat Assoc 2020; 116:1140-1154. [PMID: 34548714 PMCID: PMC8451968 DOI: 10.1080/01621459.2020.1828091] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2017] [Revised: 08/28/2020] [Accepted: 09/18/2020] [Indexed: 12/26/2022]
Abstract
The complexity of human cancer often results in significant heterogeneity in response to treatment. Precision medicine offers the potential to improve patient outcomes by leveraging this heterogeneity. Individualized treatment rules (ITRs) formalize precision medicine as maps from the patient covariate space into the space of allowable treatments. The optimal ITR is that which maximizes the mean of a clinical outcome in a population of interest. Patient-derived xenograft (PDX) studies permit the evaluation of multiple treatments within a single tumor, and thus are ideally suited for estimating optimal ITRs. PDX data are characterized by correlated outcomes, a high-dimensional feature space, and a large number of treatments. Here we explore machine learning methods for estimating optimal ITRs from PDX data. We analyze data from a large PDX study to identify biomarkers that are informative for developing personalized treatment recommendations in multiple cancers. We estimate optimal ITRs using regression-based (Q-learning) and direct-search methods (outcome weighted learning). Finally, we implement a superlearner approach to combine multiple estimated ITRs and show that the resulting ITR performs better than any of the input ITRs, mitigating uncertainty regarding user choice. Our results indicate that PDX data are a valuable resource for developing individualized treatment strategies in oncology. Supplementary materials for this article are available online.
Collapse
Affiliation(s)
- Naim U. Rashid
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC
- Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Daniel J. Luckett
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Jingxiang Chen
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Michael T. Lawson
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Longshaokan Wang
- Department of Statistics, North Carolina State University, Raleigh, NC, USA
| | - Yunshu Zhang
- Department of Statistics, North Carolina State University, Raleigh, NC, USA
| | - Eric B. Laber
- Department of Statistics, North Carolina State University, Raleigh, NC, USA
| | - Yufeng Liu
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC
- Department of Statistics and Operations Research, University of North Carolina at Chapel Hill, Chapel Hill, NC
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Jen Jen Yeh
- Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC
- Department of Surgery, University of North Carolina at Chapel Hill, Chapel Hill, NC
- Department of Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Donglin Zeng
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC
| | - Michael R. Kosorok
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC
- Department of Statistics and Operations Research, University of North Carolina at Chapel Hill, Chapel Hill, NC
| |
Collapse
|
47
|
Dong L, Laber E, Goldberg Y, Song R, Yang S. Ascertaining properties of weighting in the estimation of optimal treatment regimes under monotone missingness. Stat Med 2020; 39:3503-3520. [PMID: 32729973 DOI: 10.1002/sim.8678] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Revised: 04/28/2020] [Accepted: 04/30/2020] [Indexed: 11/10/2022]
Abstract
Dynamic treatment regimes operationalize precision medicine as a sequence of decision rules, one per stage of clinical intervention, that map up-to-date patient information to a recommended intervention. An optimal treatment regime maximizes the mean utility when applied to the population of interest. Methods for estimating an optimal treatment regime assume the data to be fully observed, which rarely occurs in practice. A common approach is to first use multiple imputation and then pool the estimators across imputed datasets. However, this approach requires estimating the joint distribution of patient trajectories, which can be high-dimensional, especially when there are multiple stages of intervention. We examine the application of inverse probability weighted estimating equations as an alternative to multiple imputation in the context of monotonic missingness. This approach applies to a broad class of estimators of an optimal treatment regime including both Q-learning and a generalization of outcome weighted learning. We establish consistency under mild regularity conditions and demonstrate its advantages in finite samples using a series of simulation experiments and an application to a schizophrenia study.
Collapse
Affiliation(s)
- Lin Dong
- Department of Statistics, North Carolina State University, Raleigh, North Carolina, USA
| | - Eric Laber
- Department of Statistics, North Carolina State University, Raleigh, North Carolina, USA
| | - Yair Goldberg
- Department of Statistics, Technion Israel Institute of Technology, Haifa, Israel
| | - Rui Song
- Department of Statistics, North Carolina State University, Raleigh, North Carolina, USA
| | - Shu Yang
- Department of Statistics, North Carolina State University, Raleigh, North Carolina, USA
| |
Collapse
|
48
|
Zhang H, Huang J, Sun L. A rank-based approach to estimating monotone individualized two treatment regimes. Comput Stat Data Anal 2020. [DOI: 10.1016/j.csda.2020.107015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
49
|
Chen Y, Zeng D, Wang Y. Learning Individualized Treatment Rules for Multiple-Domain Latent Outcomes. J Am Stat Assoc 2020; 116:269-282. [PMID: 34776561 PMCID: PMC8589272 DOI: 10.1080/01621459.2020.1817751] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2019] [Revised: 03/15/2020] [Accepted: 04/04/2020] [Indexed: 10/23/2022]
Abstract
For many mental disorders, latent mental status from multiple-domain psychological or clinical symptoms may perform as a better characterization of the underlying disorder status than a simple summary score of the symptoms, and they may also serve as more reliable and representative features to differentiate treatment responses. Therefore, in order to address the complexity and heterogeneity of treatment responses for mental disorders, we provide a new paradigm for learning optimal individualized treatment rules (ITRs) by modeling patients' latent mental status. We first learn the multi-domain latent states at baseline from the observed symptoms under a restricted Boltzmann machine (RBM) model, through which patients' heterogeneous symptoms are represented using an economical number of latent variables and yet remains flexible. We then optimize a value function defined by the latent states after treatment by exploiting a transformation of the observed symptoms based on the RBM without modeling the relationship between the latent mental states before and after treatment. The optimal treatment rules are derived using a weighted large margin classifier. We derive the convergence rate of the proposed estimator under the latent models. Simulation studies are conducted to test the performance of the proposed method. Finally, we apply the developed method to real world studies and we demonstrate the utility and advantage of our method in tailoring treatments for patients with major depression, and identify patient subgroups informative for treatment recommendations.
Collapse
Affiliation(s)
- Yuan Chen
- Department of Biostatistics, Mailman School of Public Health, Columbia University, New York, NY 10032
| | - Donglin Zeng
- Department of Biostatistics, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC 27516
| | - Yuanjia Wang
- Department of Biostatistics, Mailman School of Public Health, Columbia University, New York, NY 10032
| |
Collapse
|
50
|
Affiliation(s)
- Muxuan Liang
- Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA
| | - Menggang Yu
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI
| |
Collapse
|