1
|
Yan B, Shi P, Lim CP, Sun Y, Agarwal RK. Security and Safety-Critical Learning-Based Collaborative Control for Multiagent Systems. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2025; 36:2777-2788. [PMID: 38277245 DOI: 10.1109/tnnls.2024.3350679] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/28/2024]
Abstract
This article presents a novel learning-based collaborative control framework to ensure communication security and formation safety of nonlinear multiagent systems (MASs) subject to denial-of-service (DoS) attacks, model uncertainties, and barriers in environments. The framework has a distributed and decoupled design at the cyber-layer and the physical layer. A resilient control Lyapunov function-quadratic programming (RCLF-QP)-based observer is first proposed to achieve secure reference state estimation under DoS attacks at the cyber-layer. Based on deep reinforcement learning (RL) and control barrier function (CBF), a safety-critical formation controller is designed at the physical layer to ensure safe collaborations between uncertain agents in dynamic environments. The framework is applied to autonomous vehicles for area scanning formations with barriers in environments. The comparative experimental results demonstrate that the proposed framework can effectively improve the resilience and robustness of the system.
Collapse
|
2
|
Qin C, Jiang K, Zhang J, Zhu T. Critic Learning-Based Safe Optimal Control for Nonlinear Systems with Asymmetric Input Constraints and Unmatched Disturbances. ENTROPY (BASEL, SWITZERLAND) 2023; 25:1101. [PMID: 37510048 PMCID: PMC10378920 DOI: 10.3390/e25071101] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/29/2023] [Revised: 07/01/2023] [Accepted: 07/07/2023] [Indexed: 07/30/2023]
Abstract
In this paper, the safe optimal control method for continuous-time (CT) nonlinear safety-critical systems with asymmetric input constraints and unmatched disturbances based on the adaptive dynamic programming (ADP) is investigated. Initially, a new non-quadratic form function is implemented to effectively handle the asymmetric input constraints. Subsequently, the safe optimal control problem is transformed into a two-player zero-sum game (ZSG) problem to suppress the influence of unmatched disturbances, and a new Hamilton-Jacobi-Isaacs (HJI) equation is introduced by integrating the control barrier function (CBF) with the cost function to penalize unsafe behavior. Moreover, a damping factor is embedded in the CBF to balance safety and optimality. To obtain a safe optimal controller, only one critic neural network (CNN) is utilized to tackle the complex HJI equation, leading to a decreased computational load in contrast to the utilization of the conventional actor-critic network. Then, the system state and the parameters of the CNN are uniformly ultimately bounded (UUB) through the application of the Lyapunov stability method. Lastly, two examples are presented to confirm the efficacy of the presented approach.
Collapse
Affiliation(s)
- Chunbin Qin
- School of Artificial Intelligence, Henan University, Zhengzhou 450000, China
| | - Kaijun Jiang
- School of Artificial Intelligence, Henan University, Zhengzhou 450000, China
| | - Jishi Zhang
- School of Software, Henan University, Kaifeng 475000, China
| | - Tianzeng Zhu
- School of Artificial Intelligence, Henan University, Zhengzhou 450000, China
| |
Collapse
|
3
|
Liang Q, Wang Z, Yin Y, Xiong W, Zhang J, Yang Z. Autonomous aerial obstacle avoidance using LiDAR sensor fusion. PLoS One 2023; 18:e0287177. [PMID: 37379288 PMCID: PMC10306222 DOI: 10.1371/journal.pone.0287177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Accepted: 05/30/2023] [Indexed: 06/30/2023] Open
Abstract
The obstacle avoidance problem of unmanned aerial vehicle (UAV) mainly refers to the design of a method that can safely reach the target point from the starting point in an unknown flight environment. In this paper, we mainly propose an obstacle avoidance method composed of three modules: environment perception, algorithm obstacle avoidance and motion control. Our method realizes the function of reasonable and safe obstacle avoidance of UAV in low-altitude complex environments. Firstly, we use the light detection and ranging (LiDAR) sensor to perceive obstacles around the environment. Next, the sensor data is processed by the vector field histogram (VFH) algorithm to output the desired speed of drone flight. Finally, the expected speed value is sent to the quadrotor flight control and realizes autonomous obstacle avoidance flight of the drone. We verify the effectiveness and feasibility of the proposed method in 3D simulation environment.
Collapse
Affiliation(s)
- Qing Liang
- School Of Electronic Engineering, Xi’an University Of Posts And Telecommunications, Xi’an, Shaanxi Province, China
| | - Zilong Wang
- School Of Electronic Engineering, Xi’an University Of Posts And Telecommunications, Xi’an, Shaanxi Province, China
| | - Yafang Yin
- School Of Electronic Engineering, Xi’an University Of Posts And Telecommunications, Xi’an, Shaanxi Province, China
| | - Wei Xiong
- School Of Information Engineering, Xi’an FANYI University, Xi’an, Shaanxi Province, China
| | - Jingjing Zhang
- School Of Electronic Engineering, Xi’an University Of Posts And Telecommunications, Xi’an, Shaanxi Province, China
| | - Ziyi Yang
- School Of Electronic Engineering, Xi’an University Of Posts And Telecommunications, Xi’an, Shaanxi Province, China
| |
Collapse
|
4
|
Nagy DJ, Milton JG, Insperger T. Controlling stick balancing on a linear track: Delayed state feedback or delay-compensating predictor feedback? BIOLOGICAL CYBERNETICS 2023; 117:113-127. [PMID: 36943486 PMCID: PMC10160210 DOI: 10.1007/s00422-023-00957-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/26/2022] [Accepted: 02/18/2023] [Indexed: 05/06/2023]
Abstract
A planar stick balancing task was investigated using stabilometry parameters (SP); a concept initially developed to assess the stability of human postural sway. Two subject groups were investigated: 6 subjects (MD) with many days of balancing a 90 cm stick on a linear track and 25 subjects (OD) with only one day of balancing experience. The underlying mechanical model is a pendulum-cart system. Two control force models were investigated by means of numerical simulations: (1) delayed state feedback (DSF); and (2) delay-compensating predictor feedback (PF). Both models require an internal model and are subject to certainty thresholds with delayed switching. Measured and simulated time histories were compared quantitatively using a cost function in terms of some essential SPs for all subjects. Minimization of the cost function showed that the control strategy of both OD and MD subjects can better be described by DSF. The control mechanism for the MD subjects was superior in two aspects: (1) they devoted less energy to controlling the cart's position; and (2) their perception threshold for the stick's angular velocity was found to be smaller. Findings support the concept that when sufficient sensory information is readily available, a delay-compensating PF strategy is not necessary.
Collapse
Affiliation(s)
- Dalma J Nagy
- Department of Applied Mechanics, Faculty of Mechanical Engineering, Budapest University of Technology and Economics, Budapest, Hungary
| | - John G Milton
- W. M. Keck Science Center, Claremont Colleges, Claremont, CA, 91711, USA
| | - Tamas Insperger
- Department of Applied Mechanics, Faculty of Mechanical Engineering, Budapest University of Technology and Economics, Budapest, Hungary.
- ELKH-BME Dynamics of Machines Research Group, Budapest, Hungary.
| |
Collapse
|
5
|
Qin C, Qiao X, Wang J, Zhang D. Robust Trajectory Tracking Control for Continuous-Time Nonlinear Systems with State Constraints and Uncertain Disturbances. ENTROPY 2022; 24:e24060816. [PMID: 35741537 PMCID: PMC9222594 DOI: 10.3390/e24060816] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Revised: 06/07/2022] [Accepted: 06/07/2022] [Indexed: 02/04/2023]
Abstract
In this paper, a robust trajectory tracking control method with state constraints and uncertain disturbances on the ground of adaptive dynamic programming (ADP) is proposed for nonlinear systems. Firstly, the augmented system consists of the tracking error and the reference trajectory, and the tracking control problems with uncertain disturbances is described as the problem of robust control adjustment. In addition, considering the nominal system of the augmented system, the guaranteed cost tracking control problem is transformed into the optimal control problem by using the discount coefficient in the nominal system. A new safe Hamilton-Jacobi-Bellman (HJB) equation is proposed by combining the cost function with the control barrier function (CBF), so that the behavior of violating the safety regulations for the system states will be punished. In order to solve the new safe HJB equation, a critic neural network (NN) is used to approximate the solution of the safe HJB equation. According to the Lyapunov stability theory, in the case of state constraints and uncertain disturbances, the system states and the parameters of the critic neural network are guaranteed to be uniformly ultimately bounded (UUB). At the end of this paper, the feasibility of the proposed method is verified by a simulation example.
Collapse
|
6
|
Singletary A, Guffey W, Molnar TG, Sinnet R, Ames AD. Safety-Critical Manipulation for Collision-Free Food Preparation. IEEE Robot Autom Lett 2022. [DOI: 10.1109/lra.2022.3192634] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Affiliation(s)
- Andrew Singletary
- Department of Mechanical and Civil Engineering, California Institute of Technology, Pasadena, CA, USA
| | | | - Tamas G. Molnar
- Department of Mechanical and Civil Engineering, California Institute of Technology, Pasadena, CA, USA
| | | | - Aaron D. Ames
- Department of Mechanical and Civil Engineering, California Institute of Technology, Pasadena, CA, USA
| |
Collapse
|
7
|
Akella P, Ames AD. A Barrier-Based Scenario Approach to Verifying Safety-Critical Systems. IEEE Robot Autom Lett 2022. [DOI: 10.1109/lra.2022.3192805] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
8
|
Selim M, Alanwar A, Kousik S, Gao G, Pavone M, H. Johansson K. Safe Reinforcement Learning Using Black-Box Reachability Analysis. IEEE Robot Autom Lett 2022. [DOI: 10.1109/lra.2022.3192205] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
| | | | | | - Grace Gao
- Stanford University, Stanford, CA, USA
| | | | | |
Collapse
|