Stochastic Two-Player Zero-Sum Learning Differential Games

The two-player zero-sum differential game has been extensively studied, partially because its solution implies the H ∞ optimality. Existing studies on zero-sum differential games either assume deterministic dynamics or the dynamics corrupted by additive noise. In realistic environments, highdimensio...

Full description

Saved in:

Bibliographic Details
Published in	IEEE International Conference on Control and Automation (Print) pp. 1038 - 1043
Main Authors	Liu, Mushuang, Wan, Yan, Lewis, Frank L., Lopez, Victor G.
Format	Conference Proceeding
Language	English
Published	IEEE 01.07.2019
Subjects	Games Heuristic algorithms Optimal control System dynamics Uncertainty
Online Access	Get full text
ISSN	1948-3457
DOI	10.1109/ICCA.2019.8899568

Cover

More Information
Summary:	The two-player zero-sum differential game has been extensively studied, partially because its solution implies the H ∞ optimality. Existing studies on zero-sum differential games either assume deterministic dynamics or the dynamics corrupted by additive noise. In realistic environments, highdimensional environmental uncertainties often modulate system dynamics in a more complicated fashion. In this paper, we study the stochastic two-player zero-sum differential game governed by more general uncertain linear dynamics. We show that the optimal control policies for this game can be found by solving the Hamilton-Jacobi-Bellman (HJB) equation. We prove that with the derived optimal control policies, the system is asymptotically stable in the mean, and reaches the Nash equilibrium. To solve the stochastic two-player zero-sum game online, we design a new policy iteration (PI) algorithm that integrates the integral reinforcement learning (IRL) and an efficient uncertainty evaluation method-multivariate probabilistic collocation method (MPCM). This algorithm provides a fast online solution for the stochastic two-player zero-sum differential game subject to multiple uncertainties in the system dynamics.
ISSN:	1948-3457
DOI:	10.1109/ICCA.2019.8899568