Adaptive Dynamics Programming for H∞ Control of Continuous-Time Unknown Nonlinear Systems via Generalized Fuzzy Hyperbolic Models

In this paper, a novel adaptive dynamic programming (ADP) algorithm is developed for the infinite-horizon (<inline-formula> <tex-math notation="LaTeX">H_{\infty} </tex-math></inline-formula>) optimal control problems with unknown continuous-time (CT) nonlinear syste...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on systems, man, and cybernetics. Systems Vol. 50; no. 11; pp. 3996 - 4008
Main Authors Su, Hanguang, Zhang, Huaguang, Gao, David Wenzhong, Luo, Yanhong
Format Journal Article
LanguageEnglish
Published New York IEEE 01.11.2020
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text
ISSN2168-2216
2168-2232
DOI10.1109/TSMC.2019.2900750

Cover

More Information
Summary:In this paper, a novel adaptive dynamic programming (ADP) algorithm is developed for the infinite-horizon (<inline-formula> <tex-math notation="LaTeX">H_{\infty} </tex-math></inline-formula>) optimal control problems with unknown continuous-time (CT) nonlinear systems subject to external disturbances. To facilitate the implementation of the algorithm, generalized fuzzy hyperbolic models (GFHMs) are utilized to establish an identifier-critic architecture, where the identifier is designed to reconstruct the unknown system dynamics, and the GFHM-based critic network is employed to approximate the value functions. The CT <inline-formula> <tex-math notation="LaTeX">H_{\infty} </tex-math></inline-formula> optimal control issue is converted into a two-player zero-sum game and the corresponding Hamilton-Jacobi-Isaacs equation is derived. The learning procedure of the critic design is adaptively implemented with the help of the reconstructed model, thus the requirement of the complete system dynamics is relaxed. Furthermore, by the means of Lyapunov direct method, the uniform ultimate boundedness stability analysis of the closed-loop control system is explicitly provided. Finally, to compare the control performances and disturbance attenuation properties of the proposed method and the existing ADP algorithms, two numerical examples are given.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:2168-2216
2168-2232
DOI:10.1109/TSMC.2019.2900750