Stochastic Two-Player Zero-Sum Learning Differential Games

The two-player zero-sum differential game has been extensively studied, partially because its solution implies the H ∞ optimality. Existing studies on zero-sum differential games either assume deterministic dynamics or the dynamics corrupted by additive noise. In realistic environments, highdimensio...

Full description

Saved in:
Bibliographic Details
Published inIEEE International Conference on Control and Automation (Print) pp. 1038 - 1043
Main Authors Liu, Mushuang, Wan, Yan, Lewis, Frank L., Lopez, Victor G.
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.07.2019
Subjects
Online AccessGet full text
ISSN1948-3457
DOI10.1109/ICCA.2019.8899568

Cover

More Information
Summary:The two-player zero-sum differential game has been extensively studied, partially because its solution implies the H ∞ optimality. Existing studies on zero-sum differential games either assume deterministic dynamics or the dynamics corrupted by additive noise. In realistic environments, highdimensional environmental uncertainties often modulate system dynamics in a more complicated fashion. In this paper, we study the stochastic two-player zero-sum differential game governed by more general uncertain linear dynamics. We show that the optimal control policies for this game can be found by solving the Hamilton-Jacobi-Bellman (HJB) equation. We prove that with the derived optimal control policies, the system is asymptotically stable in the mean, and reaches the Nash equilibrium. To solve the stochastic two-player zero-sum game online, we design a new policy iteration (PI) algorithm that integrates the integral reinforcement learning (IRL) and an efficient uncertainty evaluation method-multivariate probabilistic collocation method (MPCM). This algorithm provides a fast online solution for the stochastic two-player zero-sum differential game subject to multiple uncertainties in the system dynamics.
ISSN:1948-3457
DOI:10.1109/ICCA.2019.8899568