1
|
Yue H, Xia J, Zhang J, Park JH, Xie X. Event-based adaptive fixed-time optimal control for saturated fault-tolerant nonlinear multiagent systems via reinforcement learning algorithm. Neural Netw 2025; 183:106952. [PMID: 39626531 DOI: 10.1016/j.neunet.2024.106952] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2024] [Revised: 08/13/2024] [Accepted: 11/21/2024] [Indexed: 01/22/2025]
Abstract
This article investigates the problem of adaptive fixed-time optimal consensus tracking control for nonlinear multiagent systems (MASs) affected by actuator faults and input saturation. To achieve optimal control, reinforcement learning (RL) algorithm which is implemented based on neural network (NN) is employed. Under the actor-critic structure, an innovative simple positive definite function is constructed to obtain the upper bound of the estimation error of the actor-critic NN updating law, which is crucial for analyzing fixed-time stabilization. Furthermore, auxiliary functions and estimation laws are designed to eliminate the coupling effects resulting from actuator faults and input saturation. Meanwhile, a novel event-triggered mechanism (ETM) that incorporates the consensus tracking errors into the threshold is proposed, thereby effectively conserving communication resources. Based on this, a fixed-time event-triggered control scheme grounded in RL is proposed through the integration of the backstepping technique and fixed-time theory. It is demonstrated that the consensus tracking errors converge to a specified range in a fixed time and all signals within the closed-loop systems are bounded. Finally, simulation results are provided to verify the effectiveness of the proposed control strategy.
Collapse
Affiliation(s)
- Huarong Yue
- School of Mathematics Science, Liaocheng University, Liaocheng 252000, China.
| | - Jianwei Xia
- School of Mathematics Science, Liaocheng University, Liaocheng 252000, China.
| | - Jing Zhang
- School of Mathematics Science, Liaocheng University, Liaocheng 252000, China.
| | - Ju H Park
- Department of Electrical Engineering, Yeungnam University, Kyongsan, 38541, Republic of Korea.
| | - Xiangpeng Xie
- School of Internet of Things, Nanjing University of Posts and Telecommunications, Nanjing 210023, China.
| |
Collapse
|
2
|
Wang T, Niu B, Xu N, Zhang L. ADP-based online compensation hierarchical sliding-mode control for partially unknown switched nonlinear systems with actuator failures. ISA TRANSACTIONS 2024; 155:69-81. [PMID: 39304368 DOI: 10.1016/j.isatra.2024.09.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/10/2024] [Revised: 07/27/2024] [Accepted: 09/06/2024] [Indexed: 09/22/2024]
Abstract
This article investigates an adaptive dynamic programming-based online compensation hierarchical sliding-mode control problem for a class of partially unknown switched nonlinear systems with actuator failures and uncertain perturbations under an identifier-critic neural networks architecture. Firstly, by introducing a cost function related to hierarchical sliding-mode surfaces for the nominal system, the original control problem is equivalently converted into an optimal control problem. To obtain this optimal control policy, the Hamilton-Jacobi-Bellman equation is solved through an adaptive dynamic programming method. Compared with conventional adaptive dynamic programming methods, the identifier-critic network architecture not only overcomes the limitation on the unknown internal dynamic but also eliminates the approximation error arising from the actor network. The weights in the critic network are tuned via the gradient descent approach and the experience replay technology, such that the persistence of excitation condition can be relaxed. Then, a compensation term containing hierarchical sliding-mode surfaces is used to offset uncertain actuator failures without the fault detection and isolation unit. Based on the Lyapunov stability theory, all states of the closed-loop nonlinear system are stable in the sense of uniformly ultimately boundedness. Finally, numerical and practical examples are given to demonstrate the effectiveness of our presented online compensation control strategy.
Collapse
Affiliation(s)
- Tengda Wang
- College of Control Science and Engineering, Bohai University, Jinzhou 121013, Liaoning, China
| | - Ben Niu
- Faculty of Electronic Information and Electrical Engineering, Dalian University of Technology, Dalian, Liaoning, 116024, China
| | - Ning Xu
- College of Information Science and Technology, Bohai University, Jinzhou 121013, Liaoning, China.
| | - Liang Zhang
- College of Control Science and Engineering, Bohai University, Jinzhou 121013, Liaoning, China
| |
Collapse
|
3
|
Zhao H, Wang H, Niu B, Zhao X, Alharbi KH. Event-triggered fault-tolerant control for input-constrained nonlinear systems with mismatched disturbances via adaptive dynamic programming. Neural Netw 2023; 164:508-520. [PMID: 37201311 DOI: 10.1016/j.neunet.2023.05.001] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Revised: 02/26/2023] [Accepted: 05/01/2023] [Indexed: 05/20/2023]
Abstract
In this paper, the issue of event-triggered optimal fault-tolerant control is investigated for input-constrained nonlinear systems with mismatched disturbances. To eliminate the effect of abrupt faults and ensure the optimal performance of general nonlinear dynamics, an adaptive dynamic programming (ADP) algorithm is employed to develop a sliding mode fault-tolerant control strategy. When the system trajectories converge to the sliding-mode surface, the equivalent sliding mode dynamics is transformed into a reformulated auxiliary system with a modified cost function. Then, a single critic neural network (NN) is adopted to solve the modified Hamilton-Jacobi-Bellman (HJB) equation. In order to overcome the difficulty that arises from the persistence of excitation (PE) condition, the experience replay technique is utilized to update the critic weights. In this study, a novel control method is proposed, which can effectively eliminate the effects of abrupt faults while achieving optimal control with the minimum cost under a single network architecture. Furthermore, the closed-loop nonlinear system is proved to be uniformly ultimate boundedness based on Lyapunov stability theory. Finally, three examples are presented to verify the validity of the control strategy.
Collapse
Affiliation(s)
- Heng Zhao
- College of Control Science and Engineering, Bohai University, Jinzhou, Liaoning 121013, China.
| | - Huanqing Wang
- College of Mathematical Science, Bohai University, Jinzhou, Liaoning 121013, China
| | - Ben Niu
- School of Information Science and Engineering, Shandong Normal University, Jinan 250014, China
| | - Xudong Zhao
- College of Control Science and Engineering, Bohai University, Jinzhou, Liaoning 121013, China
| | - Khalid H Alharbi
- Communication Systems and Networks Research Group, Department of Electrical and Computer Engineering, Faculty of Engineering, King Abdulaziz University, Jeddah Saudi Arabia
| |
Collapse
|
4
|
Li M, Wang D, Zhao M, Qiao J. Event-triggered constrained neural critic control of nonlinear continuous-time multiplayer nonzero-sum games. Inf Sci (N Y) 2023. [DOI: 10.1016/j.ins.2023.02.081] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/05/2023]
|
5
|
Adaptive dynamic event-triggered control for constrained modular reconfigurable robot. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.109624] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
6
|
Xia L, Li Q, Song R, Ge SS. Distributed optimized dynamic event-triggered control for unknown heterogeneous nonlinear MASs with input-constrained. Neural Netw 2022; 154:1-12. [PMID: 35839533 DOI: 10.1016/j.neunet.2022.06.033] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2022] [Revised: 05/27/2022] [Accepted: 06/25/2022] [Indexed: 11/27/2022]
Abstract
The distributed optimized dynamic event-triggered controller is investigated for completely unknown heterogeneous nonlinear multi-agent systems (MASs) on a directed graph subject to input-constrained. First, the distributed observer is designed to estimate the information of the leader for each follower, and a network of the augmented system is constructed by employing the dynamics of the followers and the observers. An identifier with a compensator is designed to approximate the unknown augmented system (agent) with an arbitrarily small identifier error. Then, consider that the input-constrained optimal controller, along with Hamilton-Jacobi-Bellman (HJB) equation, is under pressure to execute in certain systems associated with bottlenecks such as communication and computing burdens. A critic-actor-based optimized dynamic event-triggered controller, which tunes the parameters of critic-actor neural networks (NNs) by the dynamic triggering mechanism, is leveraged to determine the rule of aperiodic sampling and maintain the desired synchronization service. In addition, the existence of a positive minimum inter-event time (MIET) between consecutive events is also proved. Finally, the applications in non-identical nonlinear MAS and 2-DOF robots illustrate the availability of the proposed theoretical results.
Collapse
Affiliation(s)
- Lina Xia
- School of Automation and Electrical Engineering, University of Science and Technology Beijing, Beijing 100083, China; Key Laboratory of Knowledge Automation for Industrial Processes, Ministry of Education, Beijing, China; The Department of Electrical and Computer Engineering, National University of Singapore, 117576, Singapore.
| | - Qing Li
- School of Automation and Electrical Engineering, University of Science and Technology Beijing, Beijing 100083, China; Key Laboratory of Knowledge Automation for Industrial Processes, Ministry of Education, Beijing, China.
| | - Ruizhuo Song
- School of Automation and Electrical Engineering, University of Science and Technology Beijing, Beijing 100083, China; Beijing Engineering Research Center of Industrial Spectrum Imaging, Beijing, China.
| | - Shuzhi Sam Ge
- The Department of Electrical and Computer Engineering, National University of Singapore, 117576, Singapore.
| |
Collapse
|
7
|
Wang N, Gao Y, Yang C, Zhang X. Reinforcement learning-based finite-time tracking control of an unknown unmanned surface vehicle with input constraints. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2021.04.133] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
8
|
Mani P, Joo YH. Fuzzy-logic-based event-triggered H∞ control for networked systems and its application to wind turbine systems. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2021.11.039] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
9
|
Event-triggered optimal control for discrete-time multi-player non-zero-sum games using parallel control. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2021.10.073] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|