Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Laber EB, Lizotte DJ, Ferguson B. Set-valued dynamic treatment regimes for competing outcomes. Biometrics 2014;70:53-61. [PMID: 24400912 DOI: 10.1111/biom.12132] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2013] [Revised: 09/01/2013] [Accepted: 10/01/2013] [Indexed: 11/30/2022]

For:	Laber EB, Lizotte DJ, Ferguson B. Set-valued dynamic treatment regimes for competing outcomes. Biometrics 2014;70:53-61. [PMID: 24400912 DOI: 10.1111/biom.12132] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2013] [Revised: 09/01/2013] [Accepted: 10/01/2013] [Indexed: 11/30/2022]

Number

Cited by Other Article(s)

Khajuria R, Sarwar A. Review of reinforcement learning applications in segmentation, chemotherapy, and radiotherapy of cancer. Micron 2024;178:103583. [PMID: 38185018 DOI: 10.1016/j.micron.2023.103583] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Revised: 10/16/2023] [Accepted: 12/20/2023] [Indexed: 01/09/2024]

Wu D, Goldfeld KS, Petkova E, Park HG. Improving Individualized Treatment Decisions: A Bayesian Multivariate Hierarchical Model for Developing a Treatment Benefit Index using Mixed Types of Outcomes. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2023.11.17.23298711. [PMID: 38014277 PMCID: PMC10680905 DOI: 10.1101/2023.11.17.23298711] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]

Abstract

Background

Precision medicine has led to the development of targeted treatment strategies tailored to individual patients based on their characteristics and disease manifestations. Although precision medicine often focuses on a single health outcome for individualized treatment decision rules (ITRs), relying only on a single outcome rather than all available outcomes information leads to suboptimal data usage when developing optimal ITRs.

Methods

To address this limitation, we propose a Bayesian multivariate hierarchical model that leverages the wealth of correlated health outcomes collected in clinical trials. The approach jointly models mixed types of correlated outcomes, facilitating the "borrowing of information" across the multivariate outcomes, and results in a more accurate estimation of heterogeneous treatment effects compared to using single regression models for each outcome. We develop a treatment benefit index, which quantifies the relative treatment benefit of the experimental treatment over the control treatment, based on the proposed multivariate outcome model.

Results

We demonstrate the strengths of the proposed approach through extensive simulations and an application to an international Coronavirus Disease 2019 (COVID-19) treatment trial. Simulation results indicate that the proposed method reduces the occurrence of erroneous treatment decisions compared to a single regression model for a single health outcome. Additionally, the sensitivity analysis demonstrates the robustness of the model across various study scenarios. Application of the method to the COVID-19 trial exhibits improvements in estimating the individual-level treatment efficacy (indicated by narrower credible intervals for odds ratios) and optimal ITRs.

Conclusion

The study jointly models mixed types of outcomes in the context of developing ITRs. By considering multiple health outcomes, the proposed approach can advance the development of more effective and reliable personalized treatment.

Collapse

Tran TD, Abad AA, Verbeke G, Molenberghs G, Van Mechelen I. Reflections on the concept of optimality of single decision point treatment regimes. Biom J 2023;65:e2200285. [PMID: 37736675 DOI: 10.1002/bimj.202200285] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Revised: 06/28/2023] [Accepted: 07/30/2023] [Indexed: 09/23/2023]

Li Z, Chen J, Laber E, Liu F, Baumgartner R. Optimal Treatment Regimes: A Review and Empirical Comparison. Int Stat Rev 2023. [DOI: 10.1111/insr.12536] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/24/2023]

Ma H, Zeng D, Liu Y. Learning Optimal Group-structured Individualized Treatment Rules with Many Treatments. JOURNAL OF MACHINE LEARNING RESEARCH : JMLR 2023;24:102. [PMID: 37588020 PMCID: PMC10426767] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 08/18/2023]

Kulasekera K, Siriwardhana C. Quantiles based personalized treatment selection for multivariate outcomes and multiple treatments. Stat Med 2022;41:2695-2710. [PMID: 35699385 PMCID: PMC9232994 DOI: 10.1002/sim.9377] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2021] [Revised: 02/24/2022] [Accepted: 02/26/2022] [Indexed: 11/29/2023]

Wang Y, Zhao Y, Zheng Y. Targeted Search for Individualized Clinical Decision Rules to Optimize Clinical Outcomes. STATISTICS IN BIOSCIENCES 2022. [DOI: 10.1007/s12561-022-09343-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]

Xu R, Chen G, Connor M, Murphy J. Novel Use of Patient-Specific Covariates From Oncology Studies in the Era of Biomedical Data Science: A Review of Latest Methodologies. J Clin Oncol 2022;40:3546-3553. [PMID: 35258995 DOI: 10.1200/jco.21.01957] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Kulasekera K, Siriwardhana C. Multi-Response Based Personalized Treatment Selection with Data from Crossover Designs for Multiple Treatments. COMMUN STAT-SIMUL C 2022;51:554-569. [PMID: 35299995 PMCID: PMC8923529 DOI: 10.1080/03610918.2019.1656739] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]

Tang M, Wang L, Gorin MA, Taylor JMG. Step-adjusted tree-based reinforcement learning for evaluating nested dynamic treatment regimes using test-and-treat observational data. Stat Med 2021;40:6164-6177. [PMID: 34490942 DOI: 10.1002/sim.9177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Revised: 07/31/2021] [Accepted: 08/09/2021] [Indexed: 11/08/2022]

Doubleday K, Zhou J, Zhou H, Fu H. Risk controlled decision trees and random forests for precision Medicine. Stat Med 2021;41:719-735. [PMID: 34786731 PMCID: PMC8863134 DOI: 10.1002/sim.9253] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2021] [Revised: 10/15/2021] [Accepted: 10/15/2021] [Indexed: 11/08/2022]

Siriwardhana C, Kulasekera K. Optimal Personalized Treatment Selection with Multivariate Outcome Measures in a Multiple Treatment Case. COMMUN STAT-SIMUL C 2021;52:5773-5787. [PMID: 38371330 PMCID: PMC10871612 DOI: 10.1080/03610918.2021.1999473] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2021] [Accepted: 10/24/2021] [Indexed: 10/19/2022]

Kapelner A, Bleich J, Levine A, Cohen ZD, DeRubeis RJ, Berk R. Evaluating the Effectiveness of Personalized Medicine With Software. Front Big Data 2021;4:572532. [PMID: 34085036 PMCID: PMC8167073 DOI: 10.3389/fdata.2021.572532] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2020] [Accepted: 02/03/2021] [Indexed: 11/13/2022] Open

Luckett DJ, Laber EB, Kim S, Kosorok MR. Estimation and Optimization of Composite Outcomes. JOURNAL OF MACHINE LEARNING RESEARCH : JMLR 2021;22:167. [PMID: 34733120 PMCID: PMC8562677] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Sperger J, Freeman NLB, Jiang X, Bang D, Marchi D, Kosorok MR. The future of precision health is data‐driven decision support. Stat Anal Data Min 2020. [DOI: 10.1002/sam.11475] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Siriwardhana C, Kulasekera KB. Personalized treatment plans with multivariate outcomes. Biom J 2020;62:1973-1985. [PMID: 32627863 DOI: 10.1002/bimj.201800072] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2018] [Revised: 03/03/2020] [Accepted: 03/21/2020] [Indexed: 11/09/2022]

Dong L, Laber E, Goldberg Y, Song R, Yang S. Ascertaining properties of weighting in the estimation of optimal treatment regimes under monotone missingness. Stat Med 2020;39:3503-3520. [PMID: 32729973 DOI: 10.1002/sim.8678] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Revised: 04/28/2020] [Accepted: 04/30/2020] [Indexed: 11/10/2022]

Wang Y, Zhao YQ, Zheng Y. Learning-based biomarker-assisted rules for optimized clinical benefit under a risk constraint. Biometrics 2020;76:853-862. [PMID: 31833561 PMCID: PMC7292743 DOI: 10.1111/biom.13199] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2018] [Revised: 11/27/2019] [Accepted: 11/29/2019] [Indexed: 11/28/2022]

Ertefaie A, Johnson BA. Comment: Outcome-Wide Individualized Treatment Strategies. Stat Sci 2020. [DOI: 10.1214/20-sts771] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Huang X, Xu J. Estimating individualized treatment rules with risk constraint. Biometrics 2020;76:1310-1318. [DOI: 10.1111/biom.13232] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2019] [Revised: 01/20/2020] [Accepted: 01/27/2020] [Indexed: 11/29/2022]

Meng H, Zhao YQ, Fu H, Qiao X. Near-optimal Individualized Treatment Recommendations. JOURNAL OF MACHINE LEARNING RESEARCH : JMLR 2020;21:183. [PMID: 34335111 PMCID: PMC8324003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Guan Q, Reich BJ, Laber EB, Bandyopadhyay D. Bayesian Nonparametric Policy Search with Application to Periodontal Recall Intervals. J Am Stat Assoc 2019;115:1066-1078. [PMID: 33012901 PMCID: PMC7531024 DOI: 10.1080/01621459.2019.1660169] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2017] [Revised: 07/18/2019] [Accepted: 08/05/2019] [Indexed: 10/26/2022]

Kosorok MR, Laber EB. Precision Medicine. ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION 2019;6:263-286. [PMID: 31073534 PMCID: PMC6502478 DOI: 10.1146/annurev-statistics-030718-105251] [Citation(s) in RCA: 127] [Impact Index Per Article: 25.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]

Laber EB, Wu F, Munera C, Lipkovich I, Colucci S, Ripa S. Identifying optimal dosage regimes under safety constraints: An application to long term opioid treatment of chronic pain. Stat Med 2018;37:1407-1418. [PMID: 29468702 PMCID: PMC6293986 DOI: 10.1002/sim.7566] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2016] [Revised: 08/26/2017] [Accepted: 10/30/2017] [Indexed: 11/08/2022]

Butler EL, Laber EB, Davis SM, Kosorok MR. Incorporating Patient Preferences into Estimation of Optimal Individualized Treatment Rules. Biometrics 2018;74:18-26. [PMID: 28742260 PMCID: PMC5785589 DOI: 10.1111/biom.12743] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2016] [Revised: 05/01/2017] [Accepted: 06/01/2017] [Indexed: 11/29/2022]

Lizotte DJ, Tahmasebi A. Prediction and tolerance intervals for dynamic treatment regimes. Stat Methods Med Res 2017;26:1611-1629. [PMID: 28695763 DOI: 10.1177/0962280217708662] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Laber EB, Staicu AM. Functional feature construction for individualized treatment regimes. J Am Stat Assoc 2017;113:1219-1227. [PMID: 30416232 PMCID: PMC6223315 DOI: 10.1080/01621459.2017.1321545] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2015] [Revised: 01/01/2017] [Indexed: 10/19/2022]

Abstract

Evidence-based personalized medicine formalizes treatment selection as an individualized treatment regime that maps up-to-date patient information into the space of possible treatments. Available patient information may include static features such race, gender, family history, genetic and genomic information, as well as longitudinal information including the emergence of comorbidities, waxing and waning of symptoms, side-effect burden, and adherence. Dynamic information measured at multiple time points before treatment assignment should be included as input to the treatment regime. However, subject longitudinal measurements are typically sparse, irregularly spaced, noisy, and vary in number across subjects. Existing estimators for treatment regimes require equal information be measured on each subject and thus standard practice is to summarize longitudinal subject information into a scalar, ad hoc summary during data pre-processing. This reduction of the longitudinal information to a scalar feature precedes estimation of a treatment regime and is therefore not informed by subject outcomes, treatments, or covariates. Furthermore, we show that this reduction requires more stringent causal assumptions for consistent estimation than are necessary. We propose a data-driven method for constructing maximally prescriptive yet interpretable features that can be used with standard methods for estimating optimal treatment regimes. In our proposed framework, we treat the subject longitudinal information as a realization of a stochastic process observed with error at discrete time points. Functionals of this latent process are then combined with outcome models to estimate an optimal treatment regime. The proposed methodology requires weaker causal assumptions than Q-learning with an ad hoc scalar summary and is consistent for the optimal treatment regime.

Collapse

Schnell P, Tang Q, Müller P, Carlin BP. Subgroup inference for multiple treatments and multiple endpoints in an Alzheimer’s disease treatment trial. Ann Appl Stat 2017. [DOI: 10.1214/17-aoas1024] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Linn KA, Laber EB, Stefanski LA. Interactive Q-learning for Quantiles. J Am Stat Assoc 2017;112:638-649. [PMID: 28890584 PMCID: PMC5586239 DOI: 10.1080/01621459.2016.1155993] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2014] [Revised: 01/01/2016] [Indexed: 12/18/2022]

Wang Y, Fu H, Zeng D. Learning Optimal Personalized Treatment Rules in Consideration of Benefit and Risk: with an Application to Treating Type 2 Diabetes Patients with Insulin Therapies. J Am Stat Assoc 2017;113:1-13. [PMID: 30034060 PMCID: PMC6051551 DOI: 10.1080/01621459.2017.1303386] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2015] [Revised: 01/01/2017] [Indexed: 12/26/2022]

Chen G, Zeng D, Kosorok MR. Personalized Dose Finding Using Outcome Weighted Learning. J Am Stat Assoc 2017;111:1509-1521. [PMID: 28255189 PMCID: PMC5327863 DOI: 10.1080/01621459.2016.1148611] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2014] [Revised: 12/01/2015] [Indexed: 10/22/2022]

Lizotte DJ, Laber EB. Multi-Objective Markov Decision Processes for Data-Driven Decision Support. JOURNAL OF MACHINE LEARNING RESEARCH : JMLR 2016;17:211. [PMID: 28018133 PMCID: PMC5179144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]

Laber EB, Zhao YQ, Regh T, Davidian M, Tsiatis A, Stanford JB, Zeng D, Song R, Kosorok MR. Using pilot data to size a two-arm randomized trial to find a nearly optimal personalized treatment strategy. Stat Med 2015;35:1245-56. [PMID: 26506890 DOI: 10.1002/sim.6783] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2015] [Revised: 10/07/2015] [Accepted: 10/08/2015] [Indexed: 12/18/2022]

Wu F, Laber EB, Lipkovich IA, Severus E. Who will benefit from antidepressants in the acute treatment of bipolar depression? A reanalysis of the STEP-BD study by Sachs et al. 2007, using Q-learning. Int J Bipolar Disord 2015;3:7. [PMID: 25844303 PMCID: PMC4383759 DOI: 10.1186/s40345-014-0018-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/30/2014] [Accepted: 12/30/2014] [Indexed: 11/10/2022] Open

Laber EB, Linn KA, Stefanski LA. Interactive model building for Q-learning. Biometrika 2014;101:831-847. [PMID: 25541562 PMCID: PMC4274394 DOI: 10.1093/biomet/asu043] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open

Lei J. Classification with confidence. Biometrika 2014. [DOI: 10.1093/biomet/asu038] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Anderson K, Joffe M, Kosorok MR. University of Pennsylvania 6th annual conference on statistical issues in clinical trials: Dynamic treatment regimes (morning session). Clin Trials 2014;11:418-425. [DOI: 10.1177/1740774514538553] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Zhao YQ, Laber EB. Estimation of optimal dynamic treatment regimes. Clin Trials 2014;11:400-407. [PMID: 24872361 PMCID: PMC4247353 DOI: 10.1177/1740774514532570] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Laber EB, Lizotte DJ, Qian M, Pelham WE, Murphy SA. Rejoinder of “Dynamic treatment regimes: Technical challenges and applications”. Electron J Stat 2014. [DOI: 10.1214/14-ejs920rej] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Lizotte DJ, Bowling M, Murphy SA. Linear Fitted-Q Iteration with Multiple Reward Functions. JOURNAL OF MACHINE LEARNING RESEARCH : JMLR 2012;13:3253-3295. [PMID: 23741197 PMCID: PMC3670261] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]