Stochastic Two-Player Zero-Sum Learning Differential Games
The two-player zero-sum differential game has been extensively studied, partially because its solution implies the H ∞ optimality. Existing studies on zero-sum differential games either assume deterministic dynamics or the dynamics corrupted by additive noise. In realistic environments, highdimensio...
Saved in:
| Published in | IEEE International Conference on Control and Automation (Print) pp. 1038 - 1043 |
|---|---|
| Main Authors | , , , |
| Format | Conference Proceeding |
| Language | English |
| Published |
IEEE
01.07.2019
|
| Subjects | |
| Online Access | Get full text |
| ISSN | 1948-3457 |
| DOI | 10.1109/ICCA.2019.8899568 |
Cover
| Summary: | The two-player zero-sum differential game has been extensively studied, partially because its solution implies the H ∞ optimality. Existing studies on zero-sum differential games either assume deterministic dynamics or the dynamics corrupted by additive noise. In realistic environments, highdimensional environmental uncertainties often modulate system dynamics in a more complicated fashion. In this paper, we study the stochastic two-player zero-sum differential game governed by more general uncertain linear dynamics. We show that the optimal control policies for this game can be found by solving the Hamilton-Jacobi-Bellman (HJB) equation. We prove that with the derived optimal control policies, the system is asymptotically stable in the mean, and reaches the Nash equilibrium. To solve the stochastic two-player zero-sum game online, we design a new policy iteration (PI) algorithm that integrates the integral reinforcement learning (IRL) and an efficient uncertainty evaluation method-multivariate probabilistic collocation method (MPCM). This algorithm provides a fast online solution for the stochastic two-player zero-sum differential game subject to multiple uncertainties in the system dynamics. |
|---|---|
| ISSN: | 1948-3457 |
| DOI: | 10.1109/ICCA.2019.8899568 |