1
|
Yang X, Wang D. Reinforcement Learning for Robust Dynamic Event-Driven Constrained Control. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025; 36:6067-6079. [PMID: 38700967 DOI: 10.1109/tnnls.2024.3394251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2024]
Abstract
We consider a robust dynamic event-driven control (EDC) problem of nonlinear systems having both unmatched perturbations and unknown styles of constraints. Specifically, the constraints imposed on the nonlinear systems' input could be symmetric or asymmetric. Initially, to tackle such constraints, we construct a novel nonquadratic cost function for the constrained auxiliary system. Then, we propose a dynamic event-triggering mechanism relied on the time-based variable and the system states simultaneously for cutting down the computational load. Meanwhile, we show that the robust dynamic EDC of original nonlinear-constrained systems could be acquired by solving the event-driven optimal control problem of the constrained auxiliary system. After that, we develop the corresponding event-driven Hamilton-Jacobi-Bellman equation, and then solve it through a unique critic neural network (CNN) in the reinforcement learning framework. To relax the persistence of excitation condition in tuning CNN's weights, we incorporate experience replay into the gradient descent method. With the aid of Lyapunov's approach, we prove that the closed-loop auxiliary system and the weight estimation error are uniformly ultimately bounded stable. Finally, two examples, including a nonlinear plant and the pendulum system, are utilized to validate the theoretical claims.
Collapse
|
2
|
Yue H, Xia J, Zhang J, Park JH, Xie X. Event-based adaptive fixed-time optimal control for saturated fault-tolerant nonlinear multiagent systems via reinforcement learning algorithm. Neural Netw 2025; 183:106952. [PMID: 39626531 DOI: 10.1016/j.neunet.2024.106952] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2024] [Revised: 08/13/2024] [Accepted: 11/21/2024] [Indexed: 01/22/2025]
Abstract
This article investigates the problem of adaptive fixed-time optimal consensus tracking control for nonlinear multiagent systems (MASs) affected by actuator faults and input saturation. To achieve optimal control, reinforcement learning (RL) algorithm which is implemented based on neural network (NN) is employed. Under the actor-critic structure, an innovative simple positive definite function is constructed to obtain the upper bound of the estimation error of the actor-critic NN updating law, which is crucial for analyzing fixed-time stabilization. Furthermore, auxiliary functions and estimation laws are designed to eliminate the coupling effects resulting from actuator faults and input saturation. Meanwhile, a novel event-triggered mechanism (ETM) that incorporates the consensus tracking errors into the threshold is proposed, thereby effectively conserving communication resources. Based on this, a fixed-time event-triggered control scheme grounded in RL is proposed through the integration of the backstepping technique and fixed-time theory. It is demonstrated that the consensus tracking errors converge to a specified range in a fixed time and all signals within the closed-loop systems are bounded. Finally, simulation results are provided to verify the effectiveness of the proposed control strategy.
Collapse
Affiliation(s)
- Huarong Yue
- School of Mathematics Science, Liaocheng University, Liaocheng 252000, China.
| | - Jianwei Xia
- School of Mathematics Science, Liaocheng University, Liaocheng 252000, China.
| | - Jing Zhang
- School of Mathematics Science, Liaocheng University, Liaocheng 252000, China.
| | - Ju H Park
- Department of Electrical Engineering, Yeungnam University, Kyongsan, 38541, Republic of Korea.
| | - Xiangpeng Xie
- School of Internet of Things, Nanjing University of Posts and Telecommunications, Nanjing 210023, China.
| |
Collapse
|
3
|
Luo A, Zhou Q, Ma H, Li H. Observer-Based Consensus Control for MASs With Prescribed Constraints via Reinforcement Learning Algorithm. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:17281-17291. [PMID: 37603472 DOI: 10.1109/tnnls.2023.3301538] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/23/2023]
Abstract
In this article, an adaptive optimal consensus control problem is studied for multiagent systems (MASs) with external disturbances, unmeasurable states, and prescribed constraints. First, by using neural networks (NNs), a composite observer is constructed to estimate the unmeasurable states and disturbances simultaneously. Then, the consensus error is guaranteed within a prescribed boundary by presenting an improved prescribed performance control (PPC) technique, and the initial conditions for the error are eliminated. In addition, the updating laws of actor-critic NNs are established by using a simplified reinforcement learning (RL) algorithm based on the uniqueness of optimal solution, and the asymmetric input saturation is resolved by designing auxiliary system instead of using nonquadratic cost functions in other optimal control methods. Finally, the boundedness of all signals in the closed-loop system is proved by using Lyapunov stability theory. The effectiveness of the proposed control method is verified by a simulation example.
Collapse
|
4
|
Yang T, Sun N, Liu Z, Fang Y. Concurrent Learning-Based Adaptive Control of Underactuated Robotic Systems With Guaranteed Transient Performance for Both Actuated and Unactuated Motions. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:18133-18144. [PMID: 37721889 DOI: 10.1109/tnnls.2023.3311927] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/20/2023]
Abstract
With the wide applications of underactuated robotic systems, more complex tasks and higher safety demands are put forward. However, it is still an open issue to utilize "fewer" control inputs to satisfy control accuracy and transient performance with theoretical and practical guarantee, especially for unactuated variables. To this end, for underactuated robotic systems, this article designs an adaptive tracking controller to realize exponential convergence results, rather than only asymptotic stability or boundedness; meanwhile, unactuated states exponentially converge to a small enough bound, which is adjustable by control gains. The maximum motion ranges and convergence speed of all variables both exhibit satisfactory performance with higher safety and efficiency. Here, a data-driven concurrent learning (CL) method is proposed to compensate for unknown dynamics/disturbances and improve the estimate accuracy of parameters/weights, without the need for persistency of excitation or linear parametrization (LP) conditions. Then, a disturbance judgment mechanism is utilized to eliminate the detrimental impacts of external disturbances. As far as we know, for general underactuated systems with uncertainties/disturbances, it is the first time to theoretically and practically ensure transient performance and exponential convergence speed for unactuated states, and simultaneously obtain the exponential tracking result of actuated motions. Both theoretical analysis and hardware experiment results illustrate the effectiveness of the designed controller.
Collapse
|
5
|
Zhang H, Wang A, Ji W, Qiu J, Yan H. Optimal Consensus Control for Continuous-Time Linear Multiagent Systems: A Dynamic Event-Triggered Approach. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:14449-14457. [PMID: 37279126 DOI: 10.1109/tnnls.2023.3279137] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
This article investigates the optimal consensus problem for general linear multiagent systems (MASs) via a dynamic event-triggered approach. First, a modified interaction-related cost function is proposed. Second, a dynamic event-triggered approach is developed by constructing a new distributed dynamic triggering function and a new distributed event-triggered consensus protocol. Consequently, the modified interaction-related cost function can be minimized by applying the distributed control laws, which overcomes the difficulty in the optimal consensus problem that seeking the interaction-related cost function needs all agents' information. Then, some sufficient conditions are obtained to guarantee optimality. It is shown that the developed optimal consensus gain matrices are only related to the designed triggering parameters and the desirable modified interaction-related cost function, relaxing the constraint that the controller design requires the knowledge of system dynamics, initial states, and network scale. Meanwhile, the tradeoff between optimal consensus performance and event-triggered behavior is also considered. Finally, a simulation example is provided to verify the validity of the designed distributed event-triggered optimal controller.
Collapse
|
6
|
Yan Y, Zhang H, Zhao W, Li M. Fault Reconstruction Algorithm for Fractional-Order Nonlinear Switching Systems Based on Optimal Fault-Tolerant Control. IEEE TRANSACTIONS ON CYBERNETICS 2024; 54:6158-6168. [PMID: 39159028 DOI: 10.1109/tcyb.2024.3426622] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/21/2024]
Abstract
In this article, a novel fault reconstruction algorithms for fractional-order nonlinear switching systems (FONSSs) with actuator and sensor faults are investigated. First, fractional-order nonlinear system (FONS) with faults, is transformed into two fast and slow subsystems using global differential homogeneous transformation, one of which is unaffected by the fault and the state is partially observable; the other subsystem is affected by the fault but the state is fully observable. After that, it is introduced for the first time that persistent dwell-time (PDT) switching is taken into consideration in the design process of the observer for FONSS, which overcomes the transient problem of the switching moment and ensures the stability of the error dynamics equations of the two fast and slow subsystems. In addition, to eliminate the impact of faults, an optimal adaptive fault-tolerant control strategy based on actor-critic architecture NN are designed to effectively compensate. Finally, the effectiveness of the proposed control strategy is verified by simulation results.
Collapse
|
7
|
Liu L, Cao J, Alsaadi FE. Aperiodically Intermittent Event-Triggered Optimal Average Consensus for Nonlinear Multi-Agent Systems. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:10338-10352. [PMID: 37022883 DOI: 10.1109/tnnls.2023.3240427] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
This article is concerned with average consensus of multi-agent systems via intermittent event-triggered strategy. First, a novel intermittent event-triggered condition is designed and the corresponding piecewise differential inequality for the condition is established. Using the established inequality, several criteria on average consensus are obtained. Second, the optimality has been investigated based on average consensus. The optimal intermittent event-triggered strategy in the sense of Nash equilibrium and corresponding local Hamilton-Jacobi-Bellman equation are derived. Third, the adaptive dynamic programming algorithm for the optimal strategy and its neural network implementation with actor-critic architecture are also given. Finally, two numerical examples are presented to show the feasibility and effectiveness of our strategies.
Collapse
|
8
|
Wang W, Li Y, Tong S. Distributed Estimator-Based Event-Triggered Neuro-Adaptive Control for Leader-Follower Consensus of Strict-Feedback Nonlinear Multiagent Systems. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:10713-10725. [PMID: 37027774 DOI: 10.1109/tnnls.2023.3243627] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
This article investigates the leader-follower consensus problem for strict-feedback nonlinear multiagent systems under a dual-terminal event-triggered mechanism. Compared with the existing event-triggered recursive consensus control design, the primary contribution of this article is the development of a distributed estimator-based event-triggered neuro-adaptive consensus control methodology. In particular, by introducing a dynamic event-triggered communication mechanism without continuous monitoring neighbors' information, a novel distributed event-triggered estimator in chain form is constructed to provide the leader's information to the followers. Subsequently, the distributed estimator is utilized to consensus control via backstepping design. To further decrease information transmission, a neuro-adaptive control and an event-triggered mechanism setting on the control channel are codesigned via the function approximate approach. A theoretical analysis shows that all the closed-loop signals are bounded under the developed control methodology, and the estimation of the tracking error asymptotically converges to zero, i.e., the leader-follower consensus is guaranteed. Finally, simulation studies and comparisons are conducted to verify the effectiveness of the proposed control method.
Collapse
|
9
|
Lu K, Liu Z, Yu H, Chen CLP, Zhang Y. Inverse Optimal Adaptive Neural Control for State-Constrained Nonlinear Systems. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:10617-10628. [PMID: 37027622 DOI: 10.1109/tnnls.2023.3243084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Optimizing a performance objective during control operation while also ensuring constraint satisfactions at all times is important in practical applications. Existing works on solving this problem usually require a complicated and time-consuming learning procedure by employing neural networks, and the results are only applicable for simple or time-invariant constraints. In this work, these restrictions are removed by a newly proposed adaptive neural inverse approach. In our approach, a new universal barrier function, which is able to handle various dynamic constraints in a unified manner, is proposed to transform the constrained system into an equivalent one with no constraint. Based on this transformation, a switched-type auxiliary controller and a modified criterion for inverse optimal stabilization are proposed to design an adaptive neural inverse optimal controller. It is proven that optimal performance is achieved with a computationally attractive learning mechanism, and all the constraints are never violated. Besides, improved transient performance is obtained in the sense that the bound of the tracking error could be explicitly designed by users. An illustrative example verifies the proposed methods.
Collapse
|
10
|
Cheng S, Xin B, Wang Q, Chen J, Deng F. Command Filtered Neuroadaptive Fault-Tolerant Control for Nonlinear Systems With Input Saturation and Unknown Control Direction. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:7963-7973. [PMID: 36423316 DOI: 10.1109/tnnls.2022.3222464] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
This article studies the tracking control of a class of nonlinear systems with input saturation, subject to nonaffine faults and unknown control direction. A fault-tolerant command filtered control (CFC) method based on adaptive neural networks (NNs) is proposed for this kind of nonlinear system. First, the combination of CFC and error compensation overcomes the "explosion of complexity" issue and alleviates the impact of filter errors. Then, a set of radial basis function NNs is constructed to approximate the unknown nonlinear items containing the nonaffine fault function. Additionally, the issue of unknown control direction in the system is effectively resolved by using Nussbaum gain technology. It is proven that the designed controller can ensure that all signals in the closed-loop system are bounded and convergent, and the upper bound of the absolute value of system tracking error is given. Finally, three comparative simulation results are illustrated to show the effectiveness of the proposed method.
Collapse
|
11
|
Zhou Y, Zhang H, Mu Y, Wang Y. Cooperative Containment Control for Multiagent Systems With Reduced-Order Protocols. IEEE TRANSACTIONS ON CYBERNETICS 2024; 54:3823-3831. [PMID: 37099465 DOI: 10.1109/tcyb.2023.3266888] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
This article addresses the problem of containment control for continuous-time multiagent systems. A containment error is first given to show the coordination between the outputs of leaders and followers. Then, an observer is designed based on the neighbor observable convex hull state. Under the assumption that the designed reduced-order observer is subject to external disturbances, a reduced-order protocol is designed to realize the containment coordination. In order to ensure the designed control protocol can achieve the effect of the main theories, a corresponding Sylvester equation is given with a novel approach which proves that the Sylvester equation is solvable. Finally, a numerical example is given to verify the validity of the main results.
Collapse
|
12
|
Xiao L, Zhang Y, Huang W, Jia L, Gao X. A Dynamic Parameter Noise-Tolerant Zeroing Neural Network for Time-Varying Quaternion Matrix Equation With Applications. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:8205-8214. [PMID: 37015615 DOI: 10.1109/tnnls.2022.3225309] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
As a common and significant problem in the field of industrial information, the time-varying quaternion matrix equation (TV-QME) is considered in this article and addressed by an improved zeroing neural network (ZNN) method based on the real representation of the quaternion. In the light of an improved dynamic parameter (IDP) and an innovative activation function (IAF), a dynamic parameter noise-tolerant ZNN (DPNTZNN) model is put forward for solving the TV-QME. The presented IDP with the character of changing with the residual error and the proposed IAF with the remarkable performance can strongly enhance the convergence and robustness of the DPNTZNN model. Therefore, the DPNTZNN model possesses fast predefined-time convergence and superior robustness under different noise environments, which are theoretically analyzed in detail. Besides, the provided simulative experiments verify the advantages of the DPNTZNN model for solving the TV-QME, especially compared with other ZNN models. Finally, the DPNTZNN model is applied to image restoration, which further illustrates the practicality of the DPNTZNN model.
Collapse
|
13
|
Zhang JX, Yang T, Chai T. Neural Network Control of Underactuated Surface Vehicles With Prescribed Trajectory Tracking Performance. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:8026-8039. [PMID: 37015439 DOI: 10.1109/tnnls.2022.3223666] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
This article is concerned with the fast and accurate trajectory tracking control problem for a sort of underactuated surface vehicle under model uncertainties and environmental disturbances. A novel neural networks (NNs)-based prescribed performance control strategy is proposed to solve the problem. In the control design, a new type of performance function is constructed which provides a way to predefine the settling time and accuracy, straightforward. Then, a pair of barrier functions are employed to combat not only the position error but also the virtual control input. This evades the possible singularity or discontinuity of the control solution. Next, an initialization technique is exploited, removing the requirement for the initial condition of the control system. Finally, two NNs are employed to deal with the unknown ship nonlinearities. The performance analysis not only demonstrates the effectiveness of the proposed approach but also reveals its robustness against disturbances and unknown reference trajectory derivatives. There is, thus, no need to acquire such knowledge or employ specialized tools to handle disturbances. The theoretical findings are illustrated by a simulation study.
Collapse
|
14
|
Chu L, Li J, Guo Z, Jiang Z, Li S, Du W, Wang Y, Guo C. RBS and ABS Coordinated Control Strategy Based on Explicit Model Predictive Control. SENSORS (BASEL, SWITZERLAND) 2024; 24:3076. [PMID: 38793935 PMCID: PMC11124922 DOI: 10.3390/s24103076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/06/2024] [Revised: 05/04/2024] [Accepted: 05/09/2024] [Indexed: 05/26/2024]
Abstract
During the braking process of electric vehicles, both the regenerative braking system (RBS) and anti-lock braking system (ABS) modulate the hydraulic braking force, leading to control conflict that impacts the effectiveness and real-time capability of coordinated control. Aiming to enhance the coordinated control effectiveness of RBS and ABS within the electro-hydraulic composite braking system, this paper proposes a coordinated control strategy based on explicit model predictive control (eMPC-CCS). Initially, a comprehensive braking control framework is established, combining offline adaptive control law generation, online optimized control law application, and state compensation to effectively coordinate braking force through the electro-hydraulic system. During offline processing, eMPC generates a real-time-oriented state feedback control law based on real-world micro trip segments, improving the adaptiveness of the braking strategy across different driving conditions. In the online implementation, the developed three-dimensional eMPC control laws, corresponding to current driving conditions, are invoked, thereby enhancing the potential for real-time braking strategy implementation. Moreover, the state error compensator is integrated into eMPC-CCS, yielding a state gain matrix that optimizes the vehicle braking status and ensures robustness across diverse braking conditions. Lastly, simulation evaluation and hardware-in-the-loop (HIL) testing manifest that the proposed eMPC-CCS effectively coordinates the regenerative and hydraulic braking systems, outperforming other CCSs in terms of braking energy recovery and real-time capability.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Chong Guo
- State Key Laboratory of Automotive Simulation and Control, Jilin University, Changchun 130022, China; (L.C.); (J.L.); (Z.G.); (Z.J.); (S.L.); (W.D.); (Y.W.)
| |
Collapse
|
15
|
Shao S, Chen M, Zheng S, Lu S, Zhao Q. Event-Triggered Fractional-Order Tracking Control for an Uncertain Nonlinear System With Output Saturation and Disturbances. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:5857-5869. [PMID: 36331647 DOI: 10.1109/tnnls.2022.3212281] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
In this article, an event-triggered (ET) fractional-order adaptive tracking control scheme (ATCS) is studied for the uncertain nonlinear system with the output saturation and the external disturbances by using the nonlinear disturbance observer (NDO) and the neural networks (NNs). Based on NNs, the system uncertainties are approximated. An NN-based NDO is designed to estimate the bounded disturbances. Combining the NNs, the output of the designed NDO, the fractional-order theory, and the ET mechanism, an ATCS is proposed under the output saturation. According to the stability analysis, all the closed-loop signals are semiglobally uniformly ultimately bounded based on the investigative ATCS. The simulation results and the comparative experiment verifications are shown to indicate the viability of the developed control scheme.
Collapse
|
16
|
Yuan Y, Xu X, Yang C, Luo B, Dubljevic S. Concurrent Learning Robust Adaptive Fault Tolerant Boundary Regulation of Hyperbolic Distributed Parameter Systems. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:6286-6300. [PMID: 36449581 DOI: 10.1109/tnnls.2022.3224245] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
This article develops a robust adaptive boundary output regulation approach for a class of complex anticollocated hyperbolic partial differential equations subjected to multiplicative unknown faults in both the boundary sensor and actuator. The regulator design is based on the internal model principle, which amounts to stabilize a coupled cascade system, which consists of a finite-dimensional internal model driven by a hyperbolic distributed parameter system (DPS). To this end, a systematic sliding mode equipped with a backstepping approach is developed such that the robust state feedback control can be realized. Moreover, since the available information is a faulty boundary measurement at the right side point, state estimation is required. However, due to the presence of boundary unknown faults, we need to solve an issue of joint fault-state estimation. Restrictive persistent excitation conditions are usually required to guarantee the exact estimation of faults but are unrealistic in practice. To this end, a novel concurrent learning (CL) adaptive observer is proposed so that exponential convergence is obtained. It is the first time that the spirit of CL is introduced to the field of DPSs. Consequently, the observer-based adaptive boundary fault tolerant control scheme is developed, and rigorous theoretical analysis is given such that the exponential output regulation can be achieved. Finally, the effectiveness of the proposed methodology is demonstrated via comparative simulations.
Collapse
|
17
|
Ma H, Ren H, Zhou Q, Li H, Wang Z. Observer-Based Neural Control of N-Link Flexible-Joint Robots. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:5295-5305. [PMID: 36107896 DOI: 10.1109/tnnls.2022.3203074] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
This article concentrates on the adaptive neural control approach of n -link flexible-joint electrically driven robots. The presented control method only needs to know the position and armature current information of the flexible-joint manipulator. An adaptive observer is designed to estimate the velocities of links and motors, and radial basis function neural networks are applied to approximate the unknown nonlinearities. Based on the backstepping technique and the Lyapunov stability theory, the observer-based neural control issue is addressed by relying on uplink-event-triggered states only. It is demonstrated that all signals are semi-globally ultimately uniformly bounded and the tracking errors can converge to a small neighborhood of zero. Finally, simulation results are shown to validate the designed event-triggered control strategy.
Collapse
|
18
|
Qiao J, Li M, Wang D. Asymmetric Constrained Optimal Tracking Control With Critic Learning of Nonlinear Multiplayer Zero-Sum Games. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:5671-5683. [PMID: 36191112 DOI: 10.1109/tnnls.2022.3208611] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
By utilizing a neural-network-based adaptive critic mechanism, the optimal tracking control problem is investigated for nonlinear continuous-time (CT) multiplayer zero-sum games (ZSGs) with asymmetric constraints. Initially, we build an augmented system with the tracking error system and the reference system. Moreover, a novel nonquadratic function is introduced to address asymmetric constraints. Then, we derive the tracking Hamilton-Jacobi-Isaacs (HJI) equation of the constrained nonlinear multiplayer ZSG. However, it is extremely hard to get the analytical solution to the HJI equation. Hence, an adaptive critic mechanism based on neural networks is established to estimate the optimal cost function, so as to obtain the near-optimal control policy set and the near worst disturbance policy set. In the process of neural critic learning, we only utilize one critic neural network and develop a new weight updating rule. After that, by using the Lyapunov approach, the uniform ultimate boundedness stability of the tracking error in the augmented system and the weight estimation error of the critic network is verified. Finally, two simulation examples are provided to demonstrate the efficacy of the established mechanism.
Collapse
|
19
|
Luo A, Zhou Q, Ren H, Ma H, Lu R. Reinforcement learning-based consensus control for MASs with intermittent constraints. Neural Netw 2024; 172:106105. [PMID: 38232428 DOI: 10.1016/j.neunet.2024.106105] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Revised: 11/01/2023] [Accepted: 01/04/2024] [Indexed: 01/19/2024]
Abstract
In this article, an adaptive optimal consensus control problem is studied for multiagent systems in the strict-feedback structure with intermittent constraints (the constraints appear intermittently). More specifically, by designing a novel switch-like function and an improved coordinate transformation, the constrained states are converted into unconstrained states, and the problem of intermittent constraints is resolved without requiring "feasibility conditions". In addition, using the composite learning algorithm and neural networks to construct the identifier, a simplified identifier-actor-critic-based reinforcement learning strategy is proposed to obtain the approximate optimal controller under the framework of backstepping. Meanwhile, with the aid of the nonlinear dynamic surface control technique, the issue of "explosion of complexity" in backstepping is removed, and the requirements for filter parameters are loosened. Based on Lyapunov stability theory, it is demonstrated that all signals in the closed-loop system are bounded. Finally, two simulation examples are used to verify the effectiveness of the proposed method.
Collapse
Affiliation(s)
- Ao Luo
- School of Automation, Guangdong Provincial Key Laboratory of Intelligent Decision and Cooperative Control, Guangdong University of Technology, Guangzhou, 510000, Guangdong, China
| | - Qi Zhou
- School of Automation, Guangdong Provincial Key Laboratory of Intelligent Decision and Cooperative Control, Guangdong University of Technology, Guangzhou, 510000, Guangdong, China.
| | - Hongru Ren
- School of Automation, Guangdong Provincial Key Laboratory of Intelligent Decision and Cooperative Control, Guangdong University of Technology, Guangzhou, 510000, Guangdong, China
| | - Hui Ma
- School of Mathematics and Statistics, Guangdong University of Technology, Guangzhou, 510000, Guangdong, China
| | - Renquan Lu
- School of Automation, Guangdong Provincial Key Laboratory of Intelligent Decision and Cooperative Control, Guangdong University of Technology, Guangzhou, 510000, Guangdong, China
| |
Collapse
|
20
|
Liu J, Sun J, Zhang H, Xu S, Zou Z. N-Level Hierarchy-Based Optimal Control to Develop Therapeutic Strategies for Ecological Evolutionary Dynamics Systems. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:3953-3963. [PMID: 36083959 DOI: 10.1109/tnnls.2022.3201517] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
This article mainly proposes an evolutionary algorithm and its first application to develop therapeutic strategies for ecological evolutionary dynamics systems (EEDS), obtaining the balance between tumor cells and immune cells by rationally arranging chemotherapeutic drugs and immune drugs. First, an EEDS nonlinear kinetic model is constructed to describe the relationship between tumor cells, immune cells, dose, and drug concentration. Second, the N-level hierarchy optimization (NLHO) algorithm is designed and compared with five algorithms on 20 benchmark functions, which proves the feasibility and effectiveness of NLHO. Finally, we apply NLHO into EEDS to give a dynamic adaptive optimal control policy and develop therapeutic strategies to reduce tumor cells, while minimizing the harm of chemotherapy drugs and immune drugs to the human body. The experimental results prove the validity of the research method.
Collapse
|
21
|
Zhang J, Liu S, Zhang X, Xia J. Event-Triggered-Based Distributed Consensus Tracking for Nonlinear Multiagent Systems With Quantization. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:1501-1511. [PMID: 35737607 DOI: 10.1109/tnnls.2022.3183639] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
In this article, an observer-based adaptive neural network (NN) event-triggered distributed consensus tracking problem is investigated for nonlinear multiagent systems with quantization. In the first place, the limited capacity of the communication channel between agents is considered. The event-trigger mechanism and dynamic uniform quantizers are set up to reduce information transmission. The next NN is utilized to handle the unknown nonlinear functions. Finally, in order to estimate the unmeasurable states, an NN-based state observer is designed for each agent by using a dynamic gain function. To settle the difficulty caused by the coupling effects of event-triggered conditions and the scaling function in dynamic uniform quantizers and observers, a distributed control protocol with estimated information of its neighbors is designed, which ensures distributed consensus tracking of the nonlinear multiagent systems without incurring the Zeno behavior. The effectiveness of the control protocol is illustrated by a simulation example.
Collapse
|
22
|
Song R, Yang G, Lewis FL. Nearly Optimal Control for Mixed Zero-Sum Game Based on Off-Policy Integral Reinforcement Learning. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024; 35:2793-2804. [PMID: 35877793 DOI: 10.1109/tnnls.2022.3191847] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
In this article, we solve a class of mixed zero-sum game with unknown dynamic information of nonlinear system. A policy iterative algorithm that adopts integral reinforcement learning (IRL), which does not depend on system information, is proposed to obtain the optimal control of competitor and collaborators. An adaptive update law that combines critic-actor structure with experience replay is proposed. The actor function not only approximates optimal control of every player but also estimates auxiliary control, which does not participate in the actual control process and only exists in theory. The parameters of the actor-critic structure are simultaneously updated. Then, it is proven that the parameter errors of the polynomial approximation are uniformly ultimately bounded. Finally, the effectiveness of the proposed algorithm is verified by two given simulations.
Collapse
|
23
|
Sun J, Ming Z. Cooperative Differential Game-Based Distributed Optimal Synchronization Control of Heterogeneous Nonlinear Multiagent Systems. IEEE TRANSACTIONS ON CYBERNETICS 2023; 53:7933-7942. [PMID: 37022861 DOI: 10.1109/tcyb.2023.3240983] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
This article presents an online off-policy policy iteration (PI) algorithm using reinforcement learning (RL) to optimize the distributed synchronization problem for nonlinear multiagent systems (MASs). First, considering that not every follower can directly obtain the leader's information, a novel adaptive model-free observer based on neural networks (NNs) is designed. Moreover the feasibility of the observer is strictly proved. Subsequently, combined with the observer and follower dynamics, an augmented system and a distributed cooperative performance index with discount factors are established. On this basis, the optimal distributed cooperative synchronization problem changes into solving the numerical solution of the Hamilton-Jacobian-Bellman (HJB) equation. Finally, an online off-policy algorithm is proposed, which can be used to optimize the distributed synchronization problem of the MASs in real time based on measured data. In order to prove the stability and convergence of the online off-policy algorithm more conveniently, an offline on-policy algorithm whose stability and convergence are proved is given before the online off-policy algorithm is proposed. We give a novel mathematical analysis method for establishing the stability of the algorithm. The effectiveness of the theory is verified by simulation results.
Collapse
|
24
|
Boulkroune A, Haddad M, Li H. Adaptive fuzzy control design for nonlinear systems with actuation and state constraints: An approach with no feasibility condition. ISA TRANSACTIONS 2023; 142:1-11. [PMID: 37604741 DOI: 10.1016/j.isatra.2023.07.040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Revised: 07/28/2023] [Accepted: 07/28/2023] [Indexed: 08/23/2023]
Abstract
In this article, an adaptive fuzzy tracking control scheme is developed for a class of pure-feedback uncertain nonlinear systems in the presence of time-varying full-state constraints (TFSCs), actuators' nonlinearities and external disturbances. Fuzzy logic systems (FLSs) are employed as universal approximators to online estimate unknown nonlinear functions A barrier Lyapunov function (BLF) is used to deal with the state constraint problem. In contrast to numerous adjacent studies, this research diligently tackles the open problem relating to the virtual control laws (VCLs) feasibility in the BLF-based backstepping control design. The resolution to this problem involves formulating VCLs with predefined bounds. The utilization of disturbance observers within a backstepping framework allows for effective compensation of estimation errors arising from the implementation of a predefined bounded VCL. This approach also helps prevent the occurrence of the "complexity explosion", making it a practical solution. The control strategy being proposed guarantees that the output tracking error will effectively approach a small region near the origin. Additionally, all signals of the closed-loop system will remain uniformly ultimately bounded (UUB), and there will be adherence to all state-constraints, ensuring no violations occur. Ultimately, an illustrative simulation example is provided to demonstrate the efficacy of the theoretical findings.
Collapse
Affiliation(s)
| | - Mohammed Haddad
- University of Jijel, LAJ, BP. 98, Ouled-Aissa, 18000 Jijel, Algeria; Research Centre in Industrial Technologies (CRTI), P.O. Box 64, Cheraga, 16014 Algiers, Algeria.
| | - Hongyi Li
- School of Automation and the Guangdong Province Key Laboratory of Intelligent Decision and Cooperative Control, Guangdong University of Technology, Guangzhou 510006, China.
| |
Collapse
|
25
|
Truong HVA, Nguyen MH, Tran DT, Ahn KK. A novel adaptive neural network-based time-delayed estimation control for nonlinear systems subject to disturbances and unknown dynamics. ISA TRANSACTIONS 2023; 142:214-227. [PMID: 37543485 DOI: 10.1016/j.isatra.2023.07.032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/05/2022] [Revised: 07/09/2023] [Accepted: 07/21/2023] [Indexed: 08/07/2023]
Abstract
This paper presents an adaptive backstepping-based model-free control (BSMFC) for general high-order nonlinear systems (HNSs) subject to disturbances and unstructured uncertainties to enhance the system tracking performance. The proposed methodology is constructed based on the backstepping control (BSC) with radial basis function neural network (RBFNN) -based time-delayed estimation (TDE) to overcome the obstacle of unknown system dynamics. Additionally, a command-filtered (CF) approach is involved to address the complexity explosion of the BSC design. As the errors arising from approximation, new control laws are established to reduce the effects in this regard. The stability of the closed-loop system is guaranteed through the Lyapunov theorem and the superiority of the proposed methodology is confirmed through a comparative simulation with other model-free approaches.
Collapse
Affiliation(s)
- Hoai Vu Anh Truong
- Department of Mechanical Engineering, Pohang University of Science and Technology, Gyeongbuk 37673, South Korea.
| | - Manh Hung Nguyen
- School of Mechanical Engineering, University of Ulsan, Ulsan, 44610, South Korea.
| | - Duc Thien Tran
- Automatic Control Department, Ho Chi Minh city University of Technology and Education, Ho Chi Minh city 700000, Viet Nam.
| | - Kyoung Kwan Ahn
- School of Mechanical Engineering, University of Ulsan, Ulsan, 44610, South Korea.
| |
Collapse
|
26
|
Wu W, Tong S. Observer-Based Fixed-Time Adaptive Fuzzy Consensus DSC for Nonlinear Multiagent Systems. IEEE TRANSACTIONS ON CYBERNETICS 2023; 53:5881-5891. [PMID: 36170390 DOI: 10.1109/tcyb.2022.3204806] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
This article studies the output-feedback fixed-time fuzzy consensus control problem for nonlinear multiagent systems (MASs) under the directed communication topologies. Since the controlled systems contain the unmeasurable states and unknown dynamics, the unmeasurable states are reconstructed via linear state observers, and fuzzy logic systems are utilized to identify the unknown internal dynamics. By constructing the integral type Lyapunov function, a fixed-time adaptive fuzzy consensus control scheme is developed by introducing the nonlinear filter technique into the backstepping recursive technique adaptive control algorithm. The presented consensus control method can not only guarantee the controlled system is semi-global practical fixed-time stable (SGPFTS), but also avoid the singular problem in existing backstepping recursive control design methods. Finally, an application of unmanned surface vehicles is provided to verify the effectiveness of the presented fixed-time fuzzy consensus control method.
Collapse
|
27
|
Zong G, Xu Q, Zhao X, Su SF, Song L. Output-Feedback Adaptive Neural Network Control for Uncertain Nonsmooth Nonlinear Systems With Input Deadzone and Saturation. IEEE TRANSACTIONS ON CYBERNETICS 2023; 53:5957-5969. [PMID: 36417717 DOI: 10.1109/tcyb.2022.3222351] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Nonsmooth nonlinear systems can model many practical processes with discontinuous property and are difficult to be stabilized by classical control methods like smooth nonlinear systems. This article considers the output-feedback adaptive neural network (NN) control problem for nonsmooth nonlinear systems with input deadzone and saturation. First, the nonsmooth input deadzone and saturation is converted to a smooth function of affine form with bounded estimation error by means of the mean-value theorem. Second, with the help of approximation theorem and Filippov's differential inclusion theory, the given nonsmooth system is converted to an equivalent smooth system model. Then, by introducing a proper logarithmic barrier Lyapunov function (BLF), an output-feedback adaptive NN strategy is set up by constructing an appropriate observer and adopting the adaptive backstepping technique. A new stability criterion is established to guarantee that all the signals in the closed-loop system are semiglobally uniformly ultimately bounded (SGUUB). Finally, comparative simulations through Chua's oscillator are offered to verify the effectiveness of the proposed control algorithm.
Collapse
|
28
|
Fan QY, Jiang H, Song X, Xu B. Composite robust control of uncertain nonlinear systems with unmatched disturbances using policy iteration. ISA TRANSACTIONS 2023; 138:432-441. [PMID: 37019705 DOI: 10.1016/j.isatra.2023.03.028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/28/2021] [Revised: 12/31/2022] [Accepted: 03/18/2023] [Indexed: 06/16/2023]
Abstract
In this paper, the composite robust control problem of uncertain nonlinear systems with unmatched disturbances is investigated. In order to improve the robust control performance, the integral sliding mode control method is considered together with H∞ control for nonlinear systems. By designing a disturbance observer with a new structure, the estimations of disturbances can be obtained with small errors, which are used to construct sliding mode control policy and avoid high gains. On the basis of ensuring the accessibility of specified sliding surface, the guaranteed cost control problem of nonlinear sliding mode dynamics is considered. To overcome the difficulty of robust control design caused by nonlinear characteristics, a modified policy iteration method based on sum of squares is proposed to solve the H∞ control policy of the nonlinear sliding mode dynamics. Finally, the effectiveness of the proposed robust control method is verified by simulation tests.
Collapse
Affiliation(s)
- Quan-Yong Fan
- Northwestern Polytechnical University, 127 West Youyi Road, Beilin District, Xi'an, 710072, Shanxi, China.
| | - Hongru Jiang
- Northwestern Polytechnical University, 127 West Youyi Road, Beilin District, Xi'an, 710072, Shanxi, China.
| | - Xuekui Song
- Ansteel Engineering Technology Corporation Limited, 1 Huangang Road, Tiexi District, Anshan, 114021, Liaoning, China.
| | - Bin Xu
- Northwestern Polytechnical University, 127 West Youyi Road, Beilin District, Xi'an, 710072, Shanxi, China.
| |
Collapse
|
29
|
Li M, Wang D, Zhao M, Qiao J. Event-triggered constrained neural critic control of nonlinear continuous-time multiplayer nonzero-sum games. Inf Sci (N Y) 2023. [DOI: 10.1016/j.ins.2023.02.081] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/05/2023]
|
30
|
Zhang Y, Wang G, Sun J, Li H, He W. Distributed Observer-Based Adaptive Fuzzy Consensus of Nonlinear Multiagent Systems Under DoS Attacks and Output Disturbance. IEEE TRANSACTIONS ON CYBERNETICS 2023; 53:1994-2004. [PMID: 36149992 DOI: 10.1109/tcyb.2022.3200403] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
This article studies the adaptive output-feedback consensus control problem of nonlinear multiagent systems (MASs) against denial-of-service (DoS) attacks. The attacks on the edges instead of nodes are considered, where we allow different attack intensities but at least one edge is connected in each attacking interval. Affected by output disturbance, the sensor feedback signal of every agent is inaccurate, which will reduce the approximation accuracy of the observer. Then, we design a signal to revise the sensor feedback signal subject to disturbance. Meanwhile, a prescribed performance function is used to ensure the transient and steady-state performance of error. Leveraging the Lyapunov stability theory and the backstepping technique, a distributed output-feedback control scheme subject to asymmetric saturation nonlinearity is designed. For the asymmetric input saturation, an auxiliary signal is designed to simplify the designed progress of controller input. To deal with the inherent problem of "explosion of complexity" emerging with backstepping, dynamic surface control is utilized. It is proved that the consensus errors converge to small neighborhoods of the origin, and all signals within the closed-loop system are bounded. Finally, simulation results are offered to demonstrate the effectiveness of the proposed method.
Collapse
|
31
|
Interval type-2 fuzzy neural network-based adaptive compensation control for omni-directional mobile robot. Neural Comput Appl 2023. [DOI: 10.1007/s00521-023-08309-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/16/2023]
|
32
|
Zhao S, Wang J, Xu H, Wang B. Composite Observer-Based Optimal Attitude-Tracking Control With Reinforcement Learning for Hypersonic Vehicles. IEEE TRANSACTIONS ON CYBERNETICS 2023; 53:913-926. [PMID: 35969557 DOI: 10.1109/tcyb.2022.3192871] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
This article proposes an observer-based reinforcement learning (RL) control approach to address the optimal attitude-tracking problem and application for hypersonic vehicles in the reentry phase. Due to the unknown uncertainty and nonlinearity caused by parameter perturbation and external disturbance, accurate model information of hypersonic vehicles in the reentry phase is generally unavailable. For this reason, a novel synchronous estimation is proposed to construct a composite observer for hypersonic vehicles, which consists of a neural-network (NN)-based Luenberger-type observer and a synchronous disturbance observer. This solves the identification problem of nonlinear dynamics in the reference control and realizes the estimation of the system state when unknown nonlinear dynamics and unknown disturbance exist at the same time. By synthesizing the information from the composite observer, an RL tracking controller is developed to solve the optimal attitude-tracking control problem. To improve the convergence performance of critic network weights, concurrent learning is employed to replace the traditional persistent excitation condition with a historical experience replay manner. In addition, this article proves that the weight estimation error is bounded when the learning rate satisfies the given sufficient condition. Finally, the numerical simulation demonstrates the effectiveness and superiority of the proposed approaches to attitude-tracking control systems for hypersonic vehicles.
Collapse
|
33
|
Liu B, Yu D, Zeng X, Dong D, He X, Li X. Practical discontinuous tracking control for a permanent magnet synchronous motor. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023; 20:3793-3810. [PMID: 36899605 DOI: 10.3934/mbe.2023178] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
In this paper, the practical discontinuous control algorithm is used in the tracking controller design for a permanent magnet synchronous motor (PMSM). Although the theory of discontinuous control has been studied intensely, it is seldom applied to the actual systems, which encourages us to spread the discontinuous control algorithm to motor control. Due to the constraints of physical conditions, the input of the system is limited. Hence, we design the practical discontinuous control algorithm for PMSM with input saturation. To achieve the tracking control of PMSM, we define the error variables of the tracking control, and the sliding mode control method is introduced to complete the design of the discontinuous controller. Based on the Lyapunov stability theory, the error variables are guaranteed to converge to zero asymptotically, and the tracking control of the system is realized. Finally, the validity of the proposed control method is verified by a simulation example and the experimental platform.
Collapse
Affiliation(s)
- Bin Liu
- School of Mechanical Engineering, Northwestern Polytechnical University, Xi'an 710072, China
| | - Dengxiu Yu
- Unmanned System Research Institute, Northwestern Polytechnical University, Xi'an 710072, China
| | - Xing Zeng
- School of Mechanical Engineering, Northwestern Polytechnical University, Xi'an 710072, China
| | - Dianbiao Dong
- School of Mechanical Engineering, Northwestern Polytechnical University, Xi'an 710072, China
| | - Xinyi He
- School of Mathematics and Statistics, Shandong Normal University, Ji'nan 250014, China
| | - Xiaodi Li
- School of Mathematics and Statistics, Shandong Normal University, Ji'nan 250014, China
- Department of Automation, Tsinghua University, Beijing 100084, China
| |
Collapse
|
34
|
Wang D, Ren J, Ha M. Discounted linear Q-learning control with novel tracking cost and its stability. Inf Sci (N Y) 2023. [DOI: 10.1016/j.ins.2023.01.030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
|
35
|
Huang Z, Bai W, Li T, Long Y, Chen CP, Liang H, Yang H. Adaptive Reinforcement Learning Optimal Tracking Control for Strict-Feedback Nonlinear Systems with Prescribed Performance. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.11.109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
36
|
Composite adaptive fuzzy backstepping control of uncertain fractional-order nonlinear systems with quantized input. INT J MACH LEARN CYB 2022. [DOI: 10.1007/s13042-022-01666-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]
|
37
|
Wei SY, Li YX. Finite-time adaptive neural network command filtered controller design for nonlinear system with time-varying full-state constraints and input quantization. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.08.114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
38
|
Pei X, Li K, Li Y. A survey of adaptive optimal control theory. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2022; 19:12058-12072. [PMID: 36653986 DOI: 10.3934/mbe.2022561] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
This paper makes a survey about the recent development of optimal control based on adaptive dynamic programming (ADP). First of all, based on DP algorithm and reinforcement learning (RL) algorithm, the origin and development of the optimization idea and its application in the control field are introduced. The second part introduces achievements in the optimal control direction, then we classify and summarize the research results of optimization method, constraint problem, structure design in control algorithm and practical engineering process based on optimal control. Finally, the possible future research topics are discussed. Through a comprehensive and complete investigation of its application in many existing fields, this survey fully demonstrates that the optimal control algorithms via ADP with critic-actor neural network (NN) structure, which also have a broad application prospect, and some developed optimal control design algorithms have been applied to practical engineering fields.
Collapse
Affiliation(s)
- Xiaoxuan Pei
- College of Science, Liaoning University of Technology, Jinzhou 121001, China
| | - Kewen Li
- College of Science, Liaoning University of Technology, Jinzhou 121001, China
| | - Yongming Li
- College of Science, Liaoning University of Technology, Jinzhou 121001, China
| |
Collapse
|
39
|
Finite-Time Disturbance Observer of Nonlinear Systems. Symmetry (Basel) 2022. [DOI: 10.3390/sym14081704] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
In practical applications, for highly nonlinear systems, how to implement control tasks for dynamic systems with uncertain parameters is still a hot research issue. Aiming at the internal parameter fluctuations and external unknown disturbances in nonlinear system, this paper proposes an adaptive dynamic terminal sliding mode control (ADTSMC) based on a finite-time disturbance observer (FTDO) for nonlinear systems. A finite-time disturbance observer is designed to compensate for the unknown uncertainties and a dynamic terminal sliding mode control (DTSMC) method is developed to achieve finite time convergence and weaken system chattering. Moreover, a dual hidden layer recurrent neural network (DHLRNN) estimator is proposed to approximate the sliding mode gain, so that the switching item gain is not overestimated and optimal value is obtained. Finally, simulation experiments of an active power filter model verify the designed ADTSMC method has better steady-state and dynamic-steady compensation effects with at least 1% THD reduction in the presence of nonlinear load and disturbances compared with the simple adaptive DTSMC law.
Collapse
|
40
|
Wen G, Niu B. Optimized tracking control based on reinforcement learning for a class of high-order unknown nonlinear dynamic systems. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.05.048] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
41
|
Zhou S, Sui S, Tong S. Adaptive Neural Networks Optimal Control of Permanent Magnet Synchronous Motor System with State Constraints. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.06.114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
|
42
|
Li D, Han HG, Qiao JF. Adaptive NN Controller of Nonlinear State-Dependent Constrained Systems With Unknown Control Direction. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022; PP:913-922. [PMID: 35675237 DOI: 10.1109/tnnls.2022.3177839] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Various constraints commonly exist in most physical systems; however, traditional constraint control methods consider the constraint boundaries only relying on constant or time variable, which greatly restricts applying constraint control to practical systems. To avoid such conservatism, this study develops a new adaptive neural controller for the nonlinear strict-feedback systems subject to state-dependent constraint boundaries. The nonlinear state-dependent mapping is employed in each step of backstepping procedure, and the prescribed transient performance on tracking error and the constraints on system states are ensured without repeatedly verifying the feasibility conditions on virtual controllers. The radial basis function neural network (NN) with less parameters approach is introduced as an identifier to estimate the unknown system dynamics and reduce computation burden. For removing the effect of unknown control direction, the Nussbaum gain technique is integrated into controller design. Based on the Lyapunov analysis, the developed control strategy can ensure that all the closed-loop signals are bounded, and the constraints on full system states and tracking error are achieved. The simulation examples are used to illustrate the effectiveness of the developed control strategy.
Collapse
|
43
|
Yang X, Xu M, Wei Q. Dynamic Event-Sampled Control of Interconnected Nonlinear Systems Using Reinforcement Learning. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022; PP:923-937. [PMID: 35666792 DOI: 10.1109/tnnls.2022.3178017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
We develop a decentralized dynamic event-based control strategy for nonlinear systems subject to matched interconnections. To begin with, we introduce a dynamic event-based sampling mechanism, which relies on the system's states and the variables generated by time-based differential equations. Then, we prove that the decentralized event-based controller for the whole system is composed of all the optimal event-based control policies of nominal subsystems. To derive these optimal event-based control policies, we design a critic-only architecture to solve the related event-based Hamilton-Jacobi-Bellman equations in the reinforcement learning framework. The implementation of such an architecture uses only critic neural networks (NNs) with their weight vectors being updated through the gradient descent method together with concurrent learning. After that, we demonstrate that the asymptotic stability of closed-loop nominal subsystems and the uniformly ultimate boundedness stability of critic NNs' weight estimation errors are guaranteed by using Lyapunov's approach. Finally, we provide simulations of a matched nonlinear-interconnected plant to validate the present theoretical claims.
Collapse
|
44
|
Ma J, Wang H, Qiao J. Adaptive Neural Fixed-Time Tracking Control for High-Order Nonlinear Systems. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022; PP:708-717. [PMID: 35666791 DOI: 10.1109/tnnls.2022.3176625] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
The problem of adaptive neural fixed-time tracking control for high-order systems is addressed in this article. In order to handle the difficulties from the uncertain nonlinearities within the original systems, the radial basis function neural networks (RBF NNs) are introduced to approximate the unknown nonlinear functions, and the adding a power integrator is applied to overcome the obstacle from high-order terms. It is proven that all signals in the closed-loop system are bounded and the output signal can eventually converge to a small neighborhood of the reference signal. Simulation results further verify the approaches developed.
Collapse
|
45
|
Adaptive neural network decentralized fault-tolerant control for nonlinear interconnected fractional-order systems. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.02.078] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
46
|
Liu Y, Zhu Q, Liu Z. Event-based adaptive neural network asymptotic control design for nonstrict feedback nonlinear system with state constraints. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-07247-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
47
|
Adaptive NN Control of Electro-Hydraulic System with Full State Constraints. ELECTRONICS 2022. [DOI: 10.3390/electronics11091483] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
This paper presents an adaptive neural network (NN) control approach for an electro-hydraulic system. The friction and internal leakage are nonlinear uncertainties, and the states in the considered electro-hydraulic system are fully constrained. In the control design, the NNs are utilized to approximate the nonlinear uncertainties. Then, by constructing barrier Lyapunov functions and based on the adaptive backstepping control design technique, a novel adaptive NN control scheme is formulated. It has been proven that the developed adaptive NN control scheme can sustain the controlled electro-hydraulic system to be stable and make the system output track the desired reference signal. Furthermore, the system states do not surpass the given bounds. The computer simulation results verify the effectiveness of the proposed controller.
Collapse
|
48
|
Xue G, Lin F, Liu H, Li S. Composite learning sliding mode control of uncertain nonlinear systems with prescribed performance. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2022. [DOI: 10.3233/jifs-211310] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
This paper explores the prescribed performance tracking control problem of nonlinear systems with triangular structure. To obtain the desired transient performance and precise estimations of uncertain terms, the techniques of neural network control, sliding mode control and composite learning control are incorporated into the proposed control method. The presented control strategy can ensure the tracking error converges to a prescribed small residual set. Compared with the persistent excitation condition required in the conventional adaptive control, the interval excitation condition needed in the proposed control approach is weak, which guarantees that the radial basis function neural networks approximate the unknown nonlinear terms more accurately. Finally, two simulation examples are exploited to manifest the effectiveness of the proposed approach.
Collapse
Affiliation(s)
- Guangming Xue
- School of Mathematics and Statistics, Shaanxi Normal University, Xi’an, China
- School of Information and Statistics, Guangxi University of Finance and Economics, Nanning, China
| | - Funing Lin
- School of Information and Statistics, Guangxi University of Finance and Economics, Nanning, China
| | - Heng Liu
- School of Mathematics and Physics, Guangxi University for Nationalities, Nanning, China
| | - Shenggang Li
- School of Mathematics and Statistics, Shaanxi Normal University, Xi’an, China
| |
Collapse
|
49
|
Luo Y, Yu X, Yang D, Zhou B. A survey of intelligent transmission line inspection based on unmanned aerial vehicle. Artif Intell Rev 2022. [DOI: 10.1007/s10462-022-10189-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
|
50
|
Adaptive Neural Partial State Tracking Control for Full-State-Constrained Uncertain Singularly Perturbed Nonlinear Systems and Its Applications to Electric Circuit. ELECTRONICS 2022. [DOI: 10.3390/electronics11081209] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
This paper is concerned with the adaptive neural network (NN) partial tracking control problem for a class of completely unknown multi-input multi-output (MIMO) singularly perturbed nonlinear systems possessing time-varying asymmetric state constraints. To satisfy the constraints, we utilize the state-depended transformation technique to convert the original state-constrained system to an equivalent unconstrained one, then the state constraint problem can be solved by ensuring its stability. Partial state tracking can be achieved without the violation of state constraints. The adaptive tracking controllers are designed by using singular perturbation theory and the adaptive control method, in which NNs are used to approximate unknown nonlinear functions. The ill-conditioned numerical problems lurking in the controller design process are averted and the closed-loop system stability can be guaranteed by introducing an appropriate Lyapunov function with singular perturbation parameter. Finally, a practical example is given to demonstrate the effectiveness of our proposed adaptive NN tracking control scheme.
Collapse
|