Safe adaptive learning algorithm with neural network implementation for H∞ control of nonlinear safety‐critical system

In this article, the H∞$$ {H}_{\infty } $$ safe control problem of continuous‐time affine nonlinear safety‐critical systems is studied based on the barrier function (BF) and adaptive dynamic programming (ADP). We show that the safety constraints in this article occur under the condition that the sys...

Full description

Saved in:
Bibliographic Details
Published inInternational journal of robust and nonlinear control Vol. 33; no. 1; pp. 372 - 391
Main Authors Qin, Chunbin, Wang, Jinguang, Zhu, Heyang, Xiao, Qiyang, Zhang, Dehua
Format Journal Article
LanguageEnglish
Published Bognor Regis Wiley Subscription Services, Inc 10.01.2023
Subjects
Online AccessGet full text
ISSN1049-8923
1099-1239
DOI10.1002/rnc.6452

Cover

More Information
Summary:In this article, the H∞$$ {H}_{\infty } $$ safe control problem of continuous‐time affine nonlinear safety‐critical systems is studied based on the barrier function (BF) and adaptive dynamic programming (ADP). We show that the safety constraints in this article occur under the condition that the system initial state is unsafe and have not been adequately addressed in the existing work. First, the H∞$$ {H}_{\infty } $$ control problem is transformed into a zero‐sum game problem, and a new safe Hamilton–Jacobi–Isaacs equation is proposed by combining with the BF, which makes the unsafe behavior be punished in the learning process. In addition, a damping coefficient is introduced into the BF to punish the unsafe behavior more flexibly. Aiming at the requirement that the initial system state must be strictly constrained in the safe set, a new weight updating method based on ADP is proposed, which can reasonably avoid the influence of the BF on neural network (NN) parameters when the initial system state is unsafe. Furthermore, based on the Lyapunov stability theory, it is proved that the system states of safety‐critical systems and the NN parameters are uniformly ultimately bounded under the safety constraints and disturbances effect. Finally, the effectiveness of the proposed method is verified by two simulation examples.
Bibliography:Funding information
Key Projects of Henan Province Colleges, Grant/Award Number: 22A416004; National Nature Science Foundation, Grant/Award Number: U1504615; Science and Technology Research Project of the Henan Province, Grant/Award Numbers: 222102220028, 222102240014; Young Elite Scientist Sponsorship Program by Henan Association for Science and Technology, Grant/Award Number: 2021HYTP014; Youth Backbone Teachers in Colleges and Universities of Henan Province, Grant/Award Number: 2018GGJS017
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1049-8923
1099-1239
DOI:10.1002/rnc.6452