1
|
Zhu P, Jin S, Bu X, Hou Z. Improved Model-Free Adaptive Control for MIMO Nonlinear Systems With Event-Triggered Transmission Scheme and Quantization. IEEE TRANSACTIONS ON CYBERNETICS 2023; 53:5867-5880. [PMID: 36170394 DOI: 10.1109/tcyb.2022.3203036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
In this article, an improved model-free adaptive control (iMFAC) is proposed for discrete-time multi-input multioutput (MIMO) nonlinear systems with an event-triggered transmission scheme and quantization (ETQ). First, an event-triggered scheme is designed, and the structure of the uniform quantizer with an encoding-decoding mechanism is given. With the concept of partial form dynamic linearization based on event-triggered and quantization (PFDL-ETQ), a linearized data model of the MIMO nonlinear system is constructed. Then, an improved model-free adaptive controller with the ETQ process is designed. By this design, the update of the pseudo partitioned Jacobean matrix (PPJM) estimates and control inputs occurs only when the trigger conditions are met, which reduces the network transmission burden and saves the computing resources. Theoretical analysis shows that the proposed iMFAC with the ETQ process can achieve a bounded convergence of tracking error. Finally, a numerical simulation and a biaxial gantry motor contour tracking control system simulation are given to illustrate the feasibility of the proposed iMFAC method with the ETQ process.
Collapse
|
2
|
Zhang X, Li B, Li Z, Yang C, Chen X, Su CY. Adaptive Neural Digital Control of Hysteretic Systems With Implicit Inverse Compensator and Its Application on Magnetostrictive Actuator. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022; 33:667-680. [PMID: 33079682 DOI: 10.1109/tnnls.2020.3028500] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Hysteresis is a complex nonlinear effect in smart materials-based actuators, which degrades the positioning performance of the actuator, especially when the hysteresis shows asymmetric characteristics. In order to mitigate the asymmetric hysteresis effect, an adaptive neural digital dynamic surface control (DSC) scheme with the implicit inverse compensator is developed in this article. The implicit inverse compensator for the purpose of compensating for the hysteresis effect is applied to find the compensation signal by searching the optimal control laws from the hysteresis output, which avoids the construction of the inverse hysteresis model. The adaptive neural digital controller is achieved by using a discrete-time neural network controller to realize the discretization of time and quantizing the control signal to realize the discretization of the amplitude. The adaptive neural digital controller ensures the semiglobally uniformly ultimately bounded (SUUB) of all signals in the closed-loop control system. The effectiveness of the proposed approach is validated via the magnetostrictive-actuated system.
Collapse
|
3
|
Zhang P, Yuan Y, Guo L. Fault-Tolerant Optimal Control for Discrete-Time Nonlinear System Subjected to Input Saturation: A Dynamic Event-Triggered Approach. IEEE TRANSACTIONS ON CYBERNETICS 2021; 51:2956-2968. [PMID: 31265427 DOI: 10.1109/tcyb.2019.2923011] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
This paper investigates the dynamic event-triggered fault-tolerant optimal control strategy for a class of output feedback nonlinear discrete-time systems subject to actuator faults and input saturations. To save the communication resources between the sensor and the controller, the so-called dynamic event-triggered mechanism is adopted to schedule the measurement signal. A neural network-based observer is first designed to provide both the system states and fault information. Then, with consideration of the actuator saturation phenomenon, the adaptive dynamic programming (ADP) algorithm is designed based on the estimates provided by the observer. To reduce the computational burden, the optimal control strategy is implemented via the single network adaptive critic architecture. The sufficient conditions are provided to guarantee the boundedness of the overall closed-loop systems. Finally, the numerical simulations on a two-link flexible manipulator system are provided to verify the validity of the proposed control strategy.
Collapse
|
4
|
Xu W, Liu X, Wang H, Zhou Y. Event-based optimal output-feedback control of nonlinear discrete-time systems. Inf Sci (N Y) 2020. [DOI: 10.1016/j.ins.2020.05.098] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
5
|
Zhao B, Liu D, Luo C. Reinforcement Learning-Based Optimal Stabilization for Unknown Nonlinear Systems Subject to Inputs With Uncertain Constraints. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2020; 31:4330-4340. [PMID: 31899437 DOI: 10.1109/tnnls.2019.2954983] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
This article presents a novel reinforcement learning strategy that addresses an optimal stabilizing problem for unknown nonlinear systems subject to uncertain input constraints. The control algorithm is composed of two parts, i.e., online learning optimal control for the nominal system and feedforward neural networks (NNs) compensation for handling uncertain input constraints, which are considered as the saturation nonlinearities. Integrating the input-output data and recurrent NN, a Luenberger observer is established to approximate the unknown system dynamics. For nominal systems without input constraints, the online learning optimal control policy is derived by solving Hamilton-Jacobi-Bellman equation via a critic NN alone. By transforming the uncertain input constraints to saturation nonlinearities, the uncertain input constraints can be compensated by employing a feedforward NN compensator. The convergence of the closed-loop system is guaranteed to be uniformly ultimately bounded by using the Lyapunov stability analysis. Finally, the effectiveness of the developed stabilization scheme is illustrated by simulation studies.
Collapse
|
6
|
Discrete-Time Neural Control of Quantized Nonlinear Systems with Delays: Applied to a Three-Phase Linear Induction Motor. ELECTRONICS 2020. [DOI: 10.3390/electronics9081274] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
This work introduces a neural-feedback control scheme for discrete-time quantized nonlinear systems with time delay. Traditionally, a feedback controller is designed under ideal assumptions that are unrealistic for real-work problems. Among these assumptions, they consider a perfect communication channel for controller inputs and outputs; such a perfect channel does not consider delays, or noise introduced by the sensors and actuators even if such undesired phenomena are well-known sources of bad performance in the systems. Moreover, traditional controllers are also designed based on an ideal plant model without considering uncertainties, disturbances, sensors, actuators, and other unmodeled dynamics, which for real-life applications are effects that are constantly present and should be considered. Furthermore, control system design implemented with digital processors implies sampling and holding processes that can affect the performance; considering and compensating quantization effects of measured signals is a problem that has attracted the attention of control system researchers. In this paper, a neural controller is proposed to overcome the problems mentioned above. This controller is designed based on a neural model using an inverse optimal approach. The neural model is obtained from available measurements of the state variables and system outputs; therefore, uncertainties, disturbances, and unmodeled dynamics can be implicitly considered from the available measurements. This paper shows the performance and effectiveness of the proposed controller presenting real-time results obtained on a linear induction motor prototype. Also, this work includes stability proof for the whole scheme using the Lyapunov approach.
Collapse
|
7
|
Ding D, Wang Z, Han QL. Neural-Network-Based Consensus Control for Multiagent Systems With Input Constraints: The Event-Triggered Case. IEEE TRANSACTIONS ON CYBERNETICS 2020; 50:3719-3730. [PMID: 31329155 DOI: 10.1109/tcyb.2019.2927471] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
In this paper, the neural-network (NN)-based consensus control problem is investigated for a class of discrete-time nonlinear multiagent systems (MASs) with a leader subject to input constraints. Relative measurements related to local tracking errors are collected via some smart sensors. A local nonquadratic cost function is first introduced to evaluate the control performance with input constraints. Then, in view of the relative measurements, an NN-based observer under the event-triggered mechanism is designed to reconstruct the dynamics of the local tracking errors, where the adopted event-triggered condition has a time-dependent threshold and the weight of NNs is updated via a new adaptive tuning law catering to the employed event-triggered mechanism. Furthermore, an ideal control policy is developed for the addressed consensus control problem while minimizing the prescribed local nonquadratic cost function. Moreover, an actor-critic NN scheme with online learning is employed to realize the obtained control policy, where the critic NN is a three-layer structure with powerful approximation capability. Through extensive mathematical analysis, the consensus condition is established for the underlying MAS, and the boundedness of the estimated errors is proven for actor and critic NN weights. In addition, the effect from the adopted event-triggered mechanism on the local cost is thoroughly discussed, and the upper bound of the corresponding increment is derived in comparison with time-triggered cases. Finally, a simulation example is utilized to illustrate the usefulness of the proposed controller design scheme.
Collapse
|
8
|
Event-triggered constrained control with DHP implementation for nonaffine discrete-time systems. Inf Sci (N Y) 2020. [DOI: 10.1016/j.ins.2020.01.020] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
9
|
Huo S, Zhang Y. H ∞ consensus of Markovian jump multi-agent systems under multi-channel transmission via output feedback control strategy. ISA TRANSACTIONS 2020; 99:28-36. [PMID: 31561874 DOI: 10.1016/j.isatra.2019.09.016] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/09/2019] [Revised: 09/16/2019] [Accepted: 09/16/2019] [Indexed: 06/10/2023]
Abstract
This paper studies the H∞ consensus problem of discrete-time multi-agent systems with Markovian jump parameters and exogenous disturbances via static output feedback control strategy. Each agent's measurement information is transmitted to its neighbors through the multiple quantized channels in which each channel obtains transmission permission based on a Markovian jump dispatcher. A static output feedback consensus controller is employed. In terms of an appropriate Lyapunov function, some sufficient conditions are derived to assure stochastic consensus for multi-agent systems subject to a specified H∞ performance. At length, a simulation example is provided to illustrate the correctness of the result.
Collapse
Affiliation(s)
- Shicheng Huo
- School of Automation, Southeast University, Nanjing 210096, China; Key Laboratory of Measurement and Control of Complex Systems of Engineering, Ministry of Education, Nanjing, 210096, China.
| | - Ya Zhang
- School of Automation, Southeast University, Nanjing 210096, China; Key Laboratory of Measurement and Control of Complex Systems of Engineering, Ministry of Education, Nanjing, 210096, China.
| |
Collapse
|
10
|
Liu Y, Liu X, Jing Y, Chen X, Qiu J. Direct Adaptive Preassigned Finite-Time Control With Time-Delay and Quantized Input Using Neural Network. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2020; 31:1222-1231. [PMID: 31247570 DOI: 10.1109/tnnls.2019.2919577] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
This paper investigates an adaptive finite-time control (FTC) problem for a class of strict-feedback nonlinear systems with both time-delays and quantized input from a new point of view. First, a new concept, called preassigned finite-time performance function (PFTF), is defined. Then, another novel notion, called practically preassigned finite-time stability (PPFTS), is introduced. With PFTF and PPFTS in hand, a novel sufficient condition of the FTC is given by using the neural network (NN) control and direct adaptive backstepping technique, which is different from the existing results. In addition, a modified barrier function is first introduced in this work. Moreover, this work is first to focus on the FTC for the situation that the time-delay and quantized input simultaneously exist in the nonlinear systems. Finally, simulation results are carried out to illustrate the effectiveness of the proposed scheme.
Collapse
|
11
|
Treesatayapun C. Knowledge-based reinforcement learning controller with fuzzy-rule network: experimental validation. Neural Comput Appl 2019. [DOI: 10.1007/s00521-019-04509-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
12
|
Zhang P, Yuan Y, Yang H, Liu H. Near-Nash Equilibrium Control Strategy for Discrete-Time Nonlinear Systems With Round-Robin Protocol. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2019; 30:2478-2492. [PMID: 30602423 DOI: 10.1109/tnnls.2018.2884674] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
In this paper, the near-Nash equilibrium (NE) control strategies are investigated for a class of discrete-time nonlinear systems subjected to the round-robin protocol (RRP). In the studied systems, three types of complexities, namely, the additive nonlinearities, the RRP, and the output feedback form of controllers, are simultaneously taken into consideration. To tackle these complexities, an approximate dynamic programing (ADP) algorithm is first developed for NE control strategies by solving the coupled Bellman's equation. Then, a Luenberger-type observer is designed under the RRP scheduling to estimate the system states. The near-NE control strategies are implemented via the actor-critic neural networks. More importantly, the stability analysis of the closed-loop system is conducted to guarantee that the studied system with the proposed control strategies is bounded stable. Finally, simulation results are provided to demonstrate the validity of the proposed method.
Collapse
|
13
|
Ding D, Wang Z, Han QL, Wei G. Neural-Network-Based Output-Feedback Control Under Round-Robin Scheduling Protocols. IEEE TRANSACTIONS ON CYBERNETICS 2019; 49:2372-2384. [PMID: 29994553 DOI: 10.1109/tcyb.2018.2827037] [Citation(s) in RCA: 38] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
The neural-network (NN)-based output-feedback control is considered for a class of stochastic nonlinear systems under round-Robin (RR) scheduling protocols. For the purpose of effectively mitigating data congestions and saving energies, the RR protocols are implemented and the resulting nonlinear systems become the so-called protocol-induced periodic ones. Taking such a periodic characteristic into account, an NN-based observer is first proposed to reconstruct the system states where a novel adaptive tuning law on NN weights is adopted to cater to the requirement of performance analysis. In addition, with the established boundedness of the periodic systems in the mean-square sense, the desired observer gain is obtained by solving a set of matrix inequalities. Then, an actor-critic NN scheme with a time-varying step length in adaptive law is developed to handle the considered control problem with terminal constraints over finite-horizon. Some sufficient conditions are derived to guarantee the boundedness of estimation errors of critic and actor NN weights. In view of these conditions, some key parameters in adaptive tuning laws are easily determined via elementary algebraic operations. Furthermore, the stability in the mean-square sense is investigated for the discussed issue in infinite horizon. Finally, a simulation example is utilized to illustrate the applicability of the proposed control scheme.
Collapse
|
14
|
Liang Y, Zhang H, Cai Y, Sun S. A neural network-based approach for solving quantized discrete-time H∞ optimal control with input constraint over finite-horizon. Neurocomputing 2019. [DOI: 10.1016/j.neucom.2018.12.031] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
15
|
Talaei B, Jagannathan S, Singler J. Output Feedback-Based Boundary Control of Uncertain Coupled Semilinear Parabolic PDE Using Neurodynamic Programming. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2018; 29:1263-1274. [PMID: 28287988 DOI: 10.1109/tnnls.2017.2669941] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
In this paper, neurodynamic programming-based output feedback boundary control of distributed parameter systems governed by uncertain coupled semilinear parabolic partial differential equations (PDEs) under Neumann or Dirichlet boundary control conditions is introduced. First, Hamilton-Jacobi-Bellman (HJB) equation is formulated in the original PDE domain and the optimal control policy is derived using the value functional as the solution of the HJB equation. Subsequently, a novel observer is developed to estimate the system states given the uncertain nonlinearity in PDE dynamics and measured outputs. Consequently, the suboptimal boundary control policy is obtained by forward-in-time estimation of the value functional using a neural network (NN)-based online approximator and estimated state vector obtained from the NN observer. Novel adaptive tuning laws in continuous time are proposed for learning the value functional online to satisfy the HJB equation along system trajectories while ensuring the closed-loop stability. Local uniformly ultimate boundedness of the closed-loop system is verified by using Lyapunov theory. The performance of the proposed controller is verified via simulation on an unstable coupled diffusion reaction process.
Collapse
|
16
|
Wei Q, Liu D, Lin Q. Discrete-Time Local Value Iteration Adaptive Dynamic Programming: Admissibility and Termination Analysis. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2017; 28:2490-2502. [PMID: 27529879 DOI: 10.1109/tnnls.2016.2593743] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
In this paper, a novel local value iteration adaptive dynamic programming (ADP) algorithm is developed to solve infinite horizon optimal control problems for discrete-time nonlinear systems. The focuses of this paper are to study admissibility properties and the termination criteria of discrete-time local value iteration ADP algorithms. In the discrete-time local value iteration ADP algorithm, the iterative value functions and the iterative control laws are both updated in a given subset of the state space in each iteration, instead of the whole state space. For the first time, admissibility properties of iterative control laws are analyzed for the local value iteration ADP algorithm. New termination criteria are established, which terminate the iterative local ADP algorithm with an admissible approximate optimal control law. Finally, simulation results are given to illustrate the performance of the developed algorithm.In this paper, a novel local value iteration adaptive dynamic programming (ADP) algorithm is developed to solve infinite horizon optimal control problems for discrete-time nonlinear systems. The focuses of this paper are to study admissibility properties and the termination criteria of discrete-time local value iteration ADP algorithms. In the discrete-time local value iteration ADP algorithm, the iterative value functions and the iterative control laws are both updated in a given subset of the state space in each iteration, instead of the whole state space. For the first time, admissibility properties of iterative control laws are analyzed for the local value iteration ADP algorithm. New termination criteria are established, which terminate the iterative local ADP algorithm with an admissible approximate optimal control law. Finally, simulation results are given to illustrate the performance of the developed algorithm.
Collapse
Affiliation(s)
- Qinglai Wei
- The State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing, China
| | - Derong Liu
- School of Automation and Electrical Engineering, University of Science and Technology Beijing, Beijing, China
| | - Qiao Lin
- The State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing, China
| |
Collapse
|
17
|
Luo B, Liu D, Huang T, Yang X, Ma H. Multi-step heuristic dynamic programming for optimal control of nonlinear discrete-time systems. Inf Sci (N Y) 2017. [DOI: 10.1016/j.ins.2017.05.005] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
18
|
Zhong X, He H. An Event-Triggered ADP Control Approach for Continuous-Time System With Unknown Internal States. IEEE TRANSACTIONS ON CYBERNETICS 2017; 47:683-694. [PMID: 26929082 DOI: 10.1109/tcyb.2016.2523878] [Citation(s) in RCA: 84] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
This paper proposes a novel event-triggered adaptive dynamic programming (ADP) control method for nonlinear continuous-time system with unknown internal states. Comparing with the traditional ADP design with a fixed sample period, the event-triggered method samples the state and updates the controller only when it is necessary. Therefore, the computation cost and transmission load are reduced. Usually, the event-triggered method is based on the system entire state which is either infeasible or very difficult to obtain in practice applications. This paper integrates a neural-network-based observer to recover the system internal states from the measurable feedback. Both the proposed observer and the controller are aperiodically updated according to the designed triggering condition. Neural network techniques are applied to estimate the performance index and help calculate the control action. The stability analysis of the proposed method is also demonstrated by Lyapunov construct for both the continuous and jump dynamics. The simulation results verify the theoretical analysis and justify the efficiency of the proposed method.
Collapse
|