Stochastic Two-Player Zero-Sum Learning Differential Games

The two-player zero-sum differential game has been extensively studied, partially because its solution implies the H ∞ optimality. Existing studies on zero-sum differential games either assume deterministic dynamics or the dynamics corrupted by additive noise. In realistic environments, highdimensio...

Full description

Saved in:
Bibliographic Details
Published inIEEE International Conference on Control and Automation (Print) pp. 1038 - 1043
Main Authors Liu, Mushuang, Wan, Yan, Lewis, Frank L., Lopez, Victor G.
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.07.2019
Subjects
Online AccessGet full text
ISSN1948-3457
DOI10.1109/ICCA.2019.8899568

Cover

Abstract The two-player zero-sum differential game has been extensively studied, partially because its solution implies the H ∞ optimality. Existing studies on zero-sum differential games either assume deterministic dynamics or the dynamics corrupted by additive noise. In realistic environments, highdimensional environmental uncertainties often modulate system dynamics in a more complicated fashion. In this paper, we study the stochastic two-player zero-sum differential game governed by more general uncertain linear dynamics. We show that the optimal control policies for this game can be found by solving the Hamilton-Jacobi-Bellman (HJB) equation. We prove that with the derived optimal control policies, the system is asymptotically stable in the mean, and reaches the Nash equilibrium. To solve the stochastic two-player zero-sum game online, we design a new policy iteration (PI) algorithm that integrates the integral reinforcement learning (IRL) and an efficient uncertainty evaluation method-multivariate probabilistic collocation method (MPCM). This algorithm provides a fast online solution for the stochastic two-player zero-sum differential game subject to multiple uncertainties in the system dynamics.
AbstractList The two-player zero-sum differential game has been extensively studied, partially because its solution implies the H ∞ optimality. Existing studies on zero-sum differential games either assume deterministic dynamics or the dynamics corrupted by additive noise. In realistic environments, highdimensional environmental uncertainties often modulate system dynamics in a more complicated fashion. In this paper, we study the stochastic two-player zero-sum differential game governed by more general uncertain linear dynamics. We show that the optimal control policies for this game can be found by solving the Hamilton-Jacobi-Bellman (HJB) equation. We prove that with the derived optimal control policies, the system is asymptotically stable in the mean, and reaches the Nash equilibrium. To solve the stochastic two-player zero-sum game online, we design a new policy iteration (PI) algorithm that integrates the integral reinforcement learning (IRL) and an efficient uncertainty evaluation method-multivariate probabilistic collocation method (MPCM). This algorithm provides a fast online solution for the stochastic two-player zero-sum differential game subject to multiple uncertainties in the system dynamics.
Author Liu, Mushuang
Lewis, Frank L.
Wan, Yan
Lopez, Victor G.
Author_xml – sequence: 1
  givenname: Mushuang
  surname: Liu
  fullname: Liu, Mushuang
  organization: University of Texas at Arlington,Department of Electrical Engineering,Arlington,TX,USA,76019
– sequence: 2
  givenname: Yan
  surname: Wan
  fullname: Wan, Yan
  organization: University of Texas at Arlington,Department of Electrical Engineering,Arlington,TX,USA,76019
– sequence: 3
  givenname: Frank L.
  surname: Lewis
  fullname: Lewis, Frank L.
  organization: UTA Research Institute, University of Texas at Arlington,Fort Worth,Texas,USA
– sequence: 4
  givenname: Victor G.
  surname: Lopez
  fullname: Lopez, Victor G.
  organization: University of Texas at Arlington,Department of Electrical Engineering,Arlington,TX,USA,76019
BookMark eNotz8tKw0AUgOFRFKw1DyBu8gKp52Qmc3FXYq2FgELrxk05Sc_oSC4yiUjf3oVd_bsP_mtx0Q89C3GLsEAEd78py-UiB3QLa50rtD0TiTMWTW4RUSt5LmbolM2kKsyVSMbxCwAQbKEBZuJhOw3NJ41TaNLd75C9tnTkmL5zHLLtT5dWTLEP_Uf6GLznyP0UqE3X1PF4Iy49tSMnp87F29NqVz5n1ct6Uy6rLOQgp8x5yUDOKQlInAMVmpTRpjGWFTeSmtrYQ-GhRlI5FlbmqOvGsHdeH5jlXNz9u4GZ998xdBSP-9Ot_AMXhkmo
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/ICCA.2019.8899568
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISBN 9781728111643
1728111641
EISSN 1948-3457
EndPage 1043
ExternalDocumentID 8899568
Genre orig-research
GroupedDBID 6IE
6IF
6IK
6IL
6IN
AAJGR
AAWTH
ACGFS
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IPLJI
M43
OCL
RIE
RIL
RNS
ID FETCH-LOGICAL-i203t-9f3e0a994301ae20a56a4767c78e4ec3acb78d5f0b1a421583216bc7ef9f6dee3
IEDL.DBID RIE
IngestDate Wed Aug 27 02:44:32 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i203t-9f3e0a994301ae20a56a4767c78e4ec3acb78d5f0b1a421583216bc7ef9f6dee3
PageCount 6
ParticipantIDs ieee_primary_8899568
PublicationCentury 2000
PublicationDate 2019-July
PublicationDateYYYYMMDD 2019-07-01
PublicationDate_xml – month: 07
  year: 2019
  text: 2019-July
PublicationDecade 2010
PublicationTitle IEEE International Conference on Control and Automation (Print)
PublicationTitleAbbrev ICCA
PublicationYear 2019
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0001085600
Score 1.7216903
Snippet The two-player zero-sum differential game has been extensively studied, partially because its solution implies the H ∞ optimality. Existing studies on zero-sum...
SourceID ieee
SourceType Publisher
StartPage 1038
SubjectTerms Games
Heuristic algorithms
Optimal control
System dynamics
Uncertainty
Title Stochastic Two-Player Zero-Sum Learning Differential Games
URI https://ieeexplore.ieee.org/document/8899568
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3LS8MwGP_YdtKLj01804NHs6Vr1qTeZDqnMBlsg-FlpOlXFecqo0XwrzdfWzcVD95CIWnJg1-a_B4AZ54ywg15xHQQaUZxFoxyReyK9w0XscBORALnwb3fn4i7aWdagfOVFgYRc_IZNqmY3-VHicnoqKylFOkwVRWqUvmFVmt9nmL3Dha8y4tLlwet2273krhbNBnyej8CVHL86G3B4OvNBW3kpZmlYdN8_DJl_O-nbUNjrdRzhisM2oEKLnZh85vJYB0uRmlinjT5MTvj94QN59pus50HXCZslL06pcPqo3NVZqXYNT93bog924BJ73rc7bMyMYE9t7mXsiD2kOuALNVdjW2uO74W0pdGKhRoPG1CqaJOzENXCwv2FFPkh0ZiHMR-hOjtQW2RLHAfHGnI2S438oyEsj9pQkjJ3VDbdpSn8ADq1Auzt8IUY1Z2wOHfj49gg0ai4LkeQy1dZnhi0TwNT_Nh_ARQNp59
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3LT8IwHP4F8aBefKDx7Q4eLXSsWztvBkVQICRAQryQrvtNjcgM2WLiX2-7TVDjwVuzpN3SR76u_R4A545QzA5oSKQfSmLiLIjJFdEr3lOURQzd0Aicuz2vNWJ3Y3dcgouFFgYRM_IZVk0xu8sPY5Wao7KaEEaHKVZg1WWMublaa3mioncPGr6Lq0ub-rV2o3Fl2FtmOmQ1f0SoZAjS3ITu17tz4shLNU2Cqvr4Zcv434_bgt2lVs_qL1BoG0o424GNbzaDFbgcJLF6ksaR2Rq-x6Q_lXqjbT3gPCaD9NUqPFYfresiLUWv-ql1a_izuzBq3gwbLVJkJpDnOnUS4kcOUukbU3VbYp1K15OMe1xxgQyVI1XARehGNLAl03Bvgoq8QHGM_MgLEZ09KM_iGe6DxZXxtsusPEMm9G8aY5xTO5C6HeEIPICK6YXJW26LMSk64PDvx2ew1hp2O5NOu3d_BOtmVHLW6zGUk3mKJxrbk-A0G9JPz6Shyg
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=IEEE+International+Conference+on+Control+and+Automation+%28Print%29&rft.atitle=Stochastic+Two-Player+Zero-Sum+Learning+Differential+Games&rft.au=Liu%2C+Mushuang&rft.au=Wan%2C+Yan&rft.au=Lewis%2C+Frank+L.&rft.au=Lopez%2C+Victor+G.&rft.date=2019-07-01&rft.pub=IEEE&rft.eissn=1948-3457&rft.spage=1038&rft.epage=1043&rft_id=info:doi/10.1109%2FICCA.2019.8899568&rft.externalDocID=8899568