Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: He P, Jagannathan S. Reinforcement Learning-Based Output Feedback Control of Nonlinear Systems With Input Constraints. ACTA ACUST UNITED AC 2005;35:150-4. [PMID: 15719944 DOI: 10.1109/tsmcb.2004.840124] [Citation(s) in RCA: 87] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

For:	He P, Jagannathan S. Reinforcement Learning-Based Output Feedback Control of Nonlinear Systems With Input Constraints. ACTA ACUST UNITED AC 2005;35:150-4. [PMID: 15719944 DOI: 10.1109/tsmcb.2004.840124] [Citation(s) in RCA: 87] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Number

Cited by Other Article(s)

Wang Z, Wang X, Pang N. Dynamic event-triggered controller design for nonlinear systems: Reinforcement learning strategy. Neural Netw 2023;163:341-353. [PMID: 37099897 DOI: 10.1016/j.neunet.2023.04.008] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 03/21/2023] [Accepted: 04/10/2023] [Indexed: 04/28/2023]

Lauffenburger JC, Yom-Tov E, Keller PA, McDonnell ME, Bessette LG, Fontanet CP, Sears ES, Kim E, Hanken K, Buckley JJ, Barlev RA, Haff N, Choudhry NK. REinforcement learning to improve non-adherence for diabetes treatments by Optimising Response and Customising Engagement (REINFORCE): study protocol of a pragmatic randomised trial. BMJ Open 2021;11:e052091. [PMID: 34862289 PMCID: PMC8647547 DOI: 10.1136/bmjopen-2021-052091] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open

Abstract

INTRODUCTION

Achieving optimal diabetes control requires several daily self-management behaviours, especially adherence to medication. Evidence supports the use of text messages to support adherence, but there remains much opportunity to improve their effectiveness. One key limitation is that message content has been generic. By contrast, reinforcement learning is a machine learning method that can be used to identify individuals' patterns of responsiveness by observing their response to cues and then optimising them accordingly. Despite its demonstrated benefits outside of healthcare, its application to tailoring communication for patients has received limited attention. The objective of this trial is to test the impact of a reinforcement learning-based text messaging programme on adherence to medication for patients with type 2 diabetes.

METHODS AND ANALYSIS

In the REinforcement learning to Improve Non-adherence For diabetes treatments by Optimising Response and Customising Engagement (REINFORCE) trial, we are randomising 60 patients with suboptimal diabetes control treated with oral diabetes medications to receive a reinforcement learning intervention or control. Subjects in both arms will receive electronic pill bottles to use, and those in the intervention arm will receive up to daily text messages. The messages will be individually adapted using a reinforcement learning prediction algorithm based on daily adherence measurements from the pill bottles. The trial's primary outcome is average adherence to medication over the 6-month follow-up period. Secondary outcomes include diabetes control, measured by glycated haemoglobin A1c, and self-reported adherence. In sum, the REINFORCE trial will evaluate the effect of personalising the framing of text messages for patients to support medication adherence and provide insight into how this could be adapted at scale to improve other self-management interventions.

ETHICS AND DISSEMINATION

This study was approved by the Mass General Brigham Institutional Review Board (IRB) (USA). Findings will be disseminated through peer-reviewed journals, clinicaltrials.gov reporting and conferences.

TRIAL REGISTRATION NUMBER

Clinicaltrials.gov (NCT04473326).

Collapse

Affiliation(s)

Julie C Lauffenburger Center for Healthcare Delivery Sciences, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
Elad Yom-Tov Microsoft Research, Microsoft, Herzeliya, Israel
Punam A Keller Tuck School of Business, Dartmouth College, Hanover, NH, USA
Marie E McDonnell Endocrinology, Diabetes and Hypertension, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
Lily G Bessette Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
Constance P Fontanet Center for Healthcare Delivery Sciences, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
Ellen S Sears Center for Healthcare Delivery Sciences, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
Erin Kim Center for Healthcare Delivery Sciences, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
Kaitlin Hanken Center for Healthcare Delivery Sciences, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
J Joseph Buckley Division of Sleep Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
Renee A Barlev Center for Healthcare Delivery Sciences, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
Nancy Haff Center for Healthcare Delivery Sciences, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
Niteesh K Choudhry Center for Healthcare Delivery Sciences, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA Division of Pharmacoepidemiology and Pharmacoeconomics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA

Collapse

Elkenawy A, El-Nagar AM, El-Bardini M, El-Rabaie NM. Full-state neural network observer-based hybrid quantum diagonal recurrent neural network adaptive tracking control. Neural Comput Appl 2021. [DOI: 10.1007/s00521-020-05685-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Ni X, Wen S, Wang H, Guo Z, Zhu S, Huang T. Observer-Based Quasi-Synchronization of Delayed Dynamical Networks With Parameter Mismatch Under Impulsive Effect. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2021;32:3046-3055. [PMID: 32745009 DOI: 10.1109/tnnls.2020.3009271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Calafiore GC, Possieri C. Output Feedback Q-Learning for Linear-Quadratic Discrete-Time Finite-Horizon Control Problems. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2021;32:3274-3281. [PMID: 32745011 DOI: 10.1109/tnnls.2020.3010304] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Bai W, Li T, Tong S. NN Reinforcement Learning Adaptive Control for a Class of Nonstrict-Feedback Discrete-Time Systems. IEEE TRANSACTIONS ON CYBERNETICS 2020;50:4573-4584. [PMID: 31995515 DOI: 10.1109/tcyb.2020.2963849] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

Xu W, Liu X, Wang H, Zhou Y. Event-based optimal output-feedback control of nonlinear discrete-time systems. Inf Sci (N Y) 2020. [DOI: 10.1016/j.ins.2020.05.098] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

Zhao B, Liu D, Luo C. Reinforcement Learning-Based Optimal Stabilization for Unknown Nonlinear Systems Subject to Inputs With Uncertain Constraints. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2020;31:4330-4340. [PMID: 31899437 DOI: 10.1109/tnnls.2019.2954983] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

Huang M, Liu C, He X, Ma L, Lu Z, Su H. Reinforcement Learning-Based Control for Nonlinear Discrete-Time Systems with Unknown Control Directions and Control Constraints. Neurocomputing 2020. [DOI: 10.1016/j.neucom.2020.03.061] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

Guo X, Yan W, Cui R. Event-Triggered Reinforcement Learning-Based Adaptive Tracking Control for Completely Unknown Continuous-Time Nonlinear Systems. IEEE TRANSACTIONS ON CYBERNETICS 2020;50:3231-3242. [PMID: 30946687 DOI: 10.1109/tcyb.2019.2903108] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Rizvi SAA, Lin Z. Output Feedback Q-Learning Control for the Discrete-Time Linear Quadratic Regulator Problem. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2019;30:1523-1536. [PMID: 30296242 DOI: 10.1109/tnnls.2018.2870075] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

Zhu J, Zhu J, Wang Z, Guo S, Xu C. Hierarchical Decision and Control for Continuous Multitarget Problem: Policy Evaluation With Action Delay. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2019;30:464-473. [PMID: 29994732 DOI: 10.1109/tnnls.2018.2844466] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

Adaptive neural network tracking control-based reinforcement learning for wheeled mobile robots with skidding and slipping. Neurocomputing 2018. [DOI: 10.1016/j.neucom.2017.12.051] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Adaptive tracking control for a class of continuous-time uncertain nonlinear systems using the approximate solution of HJB equation. Neurocomputing 2017. [DOI: 10.1016/j.neucom.2017.04.043] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Mu C, Ni Z, Sun C, He H. Data-Driven Tracking Control With Adaptive Dynamic Programming for a Class of Continuous-Time Nonlinear Systems. IEEE TRANSACTIONS ON CYBERNETICS 2017;47:1460-1470. [PMID: 27116758 DOI: 10.1109/tcyb.2016.2548941] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Mu C, Ni Z, Sun C, He H. Air-Breathing Hypersonic Vehicle Tracking Control Based on Adaptive Dynamic Programming. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2017;28:584-598. [PMID: 26863677 DOI: 10.1109/tnnls.2016.2516948] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Gorban AN, Tyukin IY, Prokhorov DV, Sofeikov KI. Approximation with random bases: Pro et Contra. Inf Sci (N Y) 2016. [DOI: 10.1016/j.ins.2015.09.021] [Citation(s) in RCA: 44] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]

Luo B, Wu HN, Huang T, Liu D. Reinforcement learning solution for HJB equation arising in constrained optimal control problem. Neural Netw 2015;71:150-8. [DOI: 10.1016/j.neunet.2015.08.007] [Citation(s) in RCA: 80] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2015] [Revised: 08/15/2015] [Accepted: 08/16/2015] [Indexed: 11/15/2022]

Esfandiari K, Abdollahi F, Talebi HA. Adaptive control of uncertain nonaffine nonlinear systems with input saturation using neural networks. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2015;26:2311-2322. [PMID: 25532213 DOI: 10.1109/tnnls.2014.2378991] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]

Zhong X, He H, Zhang H, Wang Z. A neural network based online learning and control approach for Markov jump systems. Neurocomputing 2015. [DOI: 10.1016/j.neucom.2014.01.060] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

Liu YJ, Tang L, Tong S, Chen CLP, Li DJ. Reinforcement learning design-based adaptive tracking control with less learning parameters for nonlinear discrete-time MIMO systems. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2015;26:165-176. [PMID: 25438326 DOI: 10.1109/tnnls.2014.2360724] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]

Yang X, Liu D, Wang D, Wei Q. Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning. Neural Netw 2014;55:30-41. [DOI: 10.1016/j.neunet.2014.03.008] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2013] [Revised: 02/08/2014] [Accepted: 03/20/2014] [Indexed: 11/30/2022]

Masaud K, Macnab C. Preventing bursting in adaptive control using an introspective neural network algorithm. Neurocomputing 2014. [DOI: 10.1016/j.neucom.2014.01.002] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]

Zhao D, Wang B, Liu D. A supervised Actor–Critic approach for adaptive cruise control. Soft comput 2013. [DOI: 10.1007/s00500-013-1110-y] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]

Ni Z, He H, Wen J. Adaptive learning in tracking control based on the dual critic network design. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2013;24:913-928. [PMID: 24808473 DOI: 10.1109/tnnls.2013.2247627] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]

Qinmin Yang, Jagannathan S. Reinforcement Learning Controller Design for Affine Nonlinear Discrete-Time Systems using Online Approximators. ACTA ACUST UNITED AC 2012;42:377-90. [DOI: 10.1109/tsmcb.2011.2166384] [Citation(s) in RCA: 131] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Lei Yang, Si J, Tsakalis K, Rodriguez A. Direct Heuristic Dynamic Programming for Nonlinear Tracking Control With Filtered Tracking Error. ACTA ACUST UNITED AC 2009;39:1617-22. [DOI: 10.1109/tsmcb.2009.2021950] [Citation(s) in RCA: 66] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Shih P, Kaul B, Jagannathan S, Drallmeier J. Reinforcement-Learning-Based Output-Feedback Control of Nonstrict Nonlinear Discrete-Time Systems With Application to Engine Emission Control. ACTA ACUST UNITED AC 2009;39:1162-79. [DOI: 10.1109/tsmcb.2009.2013272] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

ZHANG Y, LIANG X, YANG P, CHEN Z, YUAN Z. Modeling and Control of Nonlinear Discrete-time Systems Based on Compound Neural Networks. Chin J Chem Eng 2009. [DOI: 10.1016/s1004-9541(08)60230-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]

Wang FY, Zhang H, Liu D. Adaptive Dynamic Programming: An Introduction. IEEE COMPUT INTELL M 2009. [DOI: 10.1109/mci.2009.932261] [Citation(s) in RCA: 610] [Impact Index Per Article: 40.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Shih P, Kaul BC, Jagannathan S, Drallmeier JA. Reinforcement-learning-based dual-control methodology for complex nonlinear discrete-time systems with application to spark engine EGR operation. IEEE TRANSACTIONS ON NEURAL NETWORKS 2008;19:1369-88. [PMID: 18701368 DOI: 10.1109/tnn.2008.2000452] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Abstract

A novel reinforcement-learning-based dual-control methodology adaptive neural network (NN) controller is developed to deliver a desired tracking performance for a class of complex feedback nonlinear discrete-time systems, which consists of a second-order nonlinear discrete-time system in nonstrict feedback form and an affine nonlinear discrete-time system, in the presence of bounded and unknown disturbances. For example, the exhaust gas recirculation (EGR) operation of a spark ignition (SI) engine is modeled by using such a complex nonlinear discrete-time system. A dual-controller approach is undertaken where primary adaptive critic NN controller is designed for the nonstrict feedback nonlinear discrete-time system whereas the secondary one for the affine nonlinear discrete-time system but the controllers together offer the desired performance. The primary adaptive critic NN controller includes an NN observer for estimating the states and output, an NN critic, and two action NNs for generating virtual control and actual control inputs for the nonstrict feedback nonlinear discrete-time system, whereas an additional critic NN and an action NN are included for the affine nonlinear discrete-time system by assuming the state availability. All NN weights adapt online towards minimization of a certain performance index, utilizing gradient-descent-based rule. Using Lyapunov theory, the uniformly ultimate boundedness (UUB) of the closed-loop tracking error, weight estimates, and observer estimates are shown. The adaptive critic NN controller performance is evaluated on an SI engine operating with high EGR levels where the controller objective is to reduce cyclic dispersion in heat release while minimizing fuel intake. Simulation and experimental results indicate that engine out emissions drop significantly at 20% EGR due to reduction in dispersion in heat release thus verifying the dual-control approach.

Collapse

Al-Tamimi A, Lewis F, Abu-Khalaf M. Discrete-Time Nonlinear HJB Solution Using Approximate Dynamic Programming: Convergence Proof. ACTA ACUST UNITED AC 2008;38:943-9. [DOI: 10.1109/tsmcb.2008.926614] [Citation(s) in RCA: 702] [Impact Index Per Article: 43.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Wang L, Wan C. Comments on "The Extreme Learning Machine. ACTA ACUST UNITED AC 2008;19:1494-5; author reply 1495-6. [DOI: 10.1109/tnn.2008.2002273] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Lewis FL, Huang J, Parisini T, Prokhorov DV, Wunsch DC. Special issue on neural networks for feedback control systems. ACTA ACUST UNITED AC 2007;18:969-72. [PMID: 17668654 DOI: 10.1109/tnn.2007.902966] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

He P, Jagannathan S. Reinforcement Learning Neural-Network-Based Controller for Nonlinear Discrete-Time Systems With Input Constraints. ACTA ACUST UNITED AC 2007;37:425-36. [PMID: 17416169 DOI: 10.1109/tsmcb.2006.883869] [Citation(s) in RCA: 177] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]