Stochastic Two-Player Zero-Sum Learning Differential Games

The two-player zero-sum differential game has been extensively studied, partially because its solution implies the H ∞ optimality. Existing studies on zero-sum differential games either assume deterministic dynamics or the dynamics corrupted by additive noise. In realistic environments, highdimensio...

Full description

Saved in:

Bibliographic Details
Published in	IEEE International Conference on Control and Automation (Print) pp. 1038 - 1043
Main Authors	Liu, Mushuang, Wan, Yan, Lewis, Frank L., Lopez, Victor G.
Format	Conference Proceeding
Language	English
Published	IEEE 01.07.2019
Subjects	Games Heuristic algorithms Optimal control System dynamics Uncertainty
Online Access	Get full text
ISSN	1948-3457
DOI	10.1109/ICCA.2019.8899568

Cover

Abstract	The two-player zero-sum differential game has been extensively studied, partially because its solution implies the H ∞ optimality. Existing studies on zero-sum differential games either assume deterministic dynamics or the dynamics corrupted by additive noise. In realistic environments, highdimensional environmental uncertainties often modulate system dynamics in a more complicated fashion. In this paper, we study the stochastic two-player zero-sum differential game governed by more general uncertain linear dynamics. We show that the optimal control policies for this game can be found by solving the Hamilton-Jacobi-Bellman (HJB) equation. We prove that with the derived optimal control policies, the system is asymptotically stable in the mean, and reaches the Nash equilibrium. To solve the stochastic two-player zero-sum game online, we design a new policy iteration (PI) algorithm that integrates the integral reinforcement learning (IRL) and an efficient uncertainty evaluation method-multivariate probabilistic collocation method (MPCM). This algorithm provides a fast online solution for the stochastic two-player zero-sum differential game subject to multiple uncertainties in the system dynamics.
AbstractList	The two-player zero-sum differential game has been extensively studied, partially because its solution implies the H ∞ optimality. Existing studies on zero-sum differential games either assume deterministic dynamics or the dynamics corrupted by additive noise. In realistic environments, highdimensional environmental uncertainties often modulate system dynamics in a more complicated fashion. In this paper, we study the stochastic two-player zero-sum differential game governed by more general uncertain linear dynamics. We show that the optimal control policies for this game can be found by solving the Hamilton-Jacobi-Bellman (HJB) equation. We prove that with the derived optimal control policies, the system is asymptotically stable in the mean, and reaches the Nash equilibrium. To solve the stochastic two-player zero-sum game online, we design a new policy iteration (PI) algorithm that integrates the integral reinforcement learning (IRL) and an efficient uncertainty evaluation method-multivariate probabilistic collocation method (MPCM). This algorithm provides a fast online solution for the stochastic two-player zero-sum differential game subject to multiple uncertainties in the system dynamics.
Author	Liu, Mushuang Lewis, Frank L. Wan, Yan Lopez, Victor G.
Author_xml	– sequence: 1 givenname: Mushuang surname: Liu fullname: Liu, Mushuang organization: University of Texas at Arlington,Department of Electrical Engineering,Arlington,TX,USA,76019 – sequence: 2 givenname: Yan surname: Wan fullname: Wan, Yan organization: University of Texas at Arlington,Department of Electrical Engineering,Arlington,TX,USA,76019 – sequence: 3 givenname: Frank L. surname: Lewis fullname: Lewis, Frank L. organization: UTA Research Institute, University of Texas at Arlington,Fort Worth,Texas,USA – sequence: 4 givenname: Victor G. surname: Lopez fullname: Lopez, Victor G. organization: University of Texas at Arlington,Department of Electrical Engineering,Arlington,TX,USA,76019
BookMark	eNotz8tKw0AUgOFRFKw1DyBu8gKp52Qmc3FXYq2FgELrxk05Sc_oSC4yiUjf3oVd_bsP_mtx0Q89C3GLsEAEd78py-UiB3QLa50rtD0TiTMWTW4RUSt5LmbolM2kKsyVSMbxCwAQbKEBZuJhOw3NJ41TaNLd75C9tnTkmL5zHLLtT5dWTLEP_Uf6GLznyP0UqE3X1PF4Iy49tSMnp87F29NqVz5n1ct6Uy6rLOQgp8x5yUDOKQlInAMVmpTRpjGWFTeSmtrYQ-GhRlI5FlbmqOvGsHdeH5jlXNz9u4GZ998xdBSP-9Ot_AMXhkmo
ContentType	Conference Proceeding
DBID	6IE 6IL CBEJK RIE RIL
DOI	10.1109/ICCA.2019.8899568
DatabaseName	IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml	– sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
Discipline	Engineering
EISBN	9781728111643 1728111641
EISSN	1948-3457
EndPage	1043
ExternalDocumentID	8899568
Genre	orig-research
GroupedDBID	6IE 6IF 6IK 6IL 6IN AAJGR AAWTH ACGFS ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IPLJI M43 OCL RIE RIL RNS
ID	FETCH-LOGICAL-i203t-9f3e0a994301ae20a56a4767c78e4ec3acb78d5f0b1a421583216bc7ef9f6dee3
IEDL.DBID	RIE
IngestDate	Wed Aug 27 02:44:32 EDT 2025
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-i203t-9f3e0a994301ae20a56a4767c78e4ec3acb78d5f0b1a421583216bc7ef9f6dee3
PageCount	6
ParticipantIDs	ieee_primary_8899568
PublicationCentury	2000
PublicationDate	2019-July
PublicationDateYYYYMMDD	2019-07-01
PublicationDate_xml	– month: 07 year: 2019 text: 2019-July
PublicationDecade	2010
PublicationTitle	IEEE International Conference on Control and Automation (Print)
PublicationTitleAbbrev	ICCA
PublicationYear	2019
Publisher	IEEE
Publisher_xml	– name: IEEE
SSID	ssj0001085600
Score	1.7216903
Snippet	The two-player zero-sum differential game has been extensively studied, partially because its solution implies the H ∞ optimality. Existing studies on zero-sum...
SourceID	ieee
SourceType	Publisher
StartPage	1038
SubjectTerms	Games Heuristic algorithms Optimal control System dynamics Uncertainty
Title	Stochastic Two-Player Zero-Sum Learning Differential Games
URI	https://ieeexplore.ieee.org/document/8899568
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3LS8MwGP_YdtKLj01804NHs6Vr1qTeZDqnMBlsg-FlpOlXFecqo0XwrzdfWzcVD95CIWnJg1-a_B4AZ54ywg15xHQQaUZxFoxyReyK9w0XscBORALnwb3fn4i7aWdagfOVFgYRc_IZNqmY3-VHicnoqKylFOkwVRWqUvmFVmt9nmL3Dha8y4tLlwet2273krhbNBnyej8CVHL86G3B4OvNBW3kpZmlYdN8_DJl_O-nbUNjrdRzhisM2oEKLnZh85vJYB0uRmlinjT5MTvj94QN59pus50HXCZslL06pcPqo3NVZqXYNT93bog924BJ73rc7bMyMYE9t7mXsiD2kOuALNVdjW2uO74W0pdGKhRoPG1CqaJOzENXCwv2FFPkh0ZiHMR-hOjtQW2RLHAfHGnI2S438oyEsj9pQkjJ3VDbdpSn8ADq1Auzt8IUY1Z2wOHfj49gg0ai4LkeQy1dZnhi0TwNT_Nh_ARQNp59
linkProvider	IEEE
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3LT8IwHP4F8aBefKDx7Q4eLXSsWztvBkVQICRAQryQrvtNjcgM2WLiX2-7TVDjwVuzpN3SR76u_R4A545QzA5oSKQfSmLiLIjJFdEr3lOURQzd0Aicuz2vNWJ3Y3dcgouFFgYRM_IZVk0xu8sPY5Wao7KaEEaHKVZg1WWMublaa3mioncPGr6Lq0ub-rV2o3Fl2FtmOmQ1f0SoZAjS3ITu17tz4shLNU2Cqvr4Zcv434_bgt2lVs_qL1BoG0o424GNbzaDFbgcJLF6ksaR2Rq-x6Q_lXqjbT3gPCaD9NUqPFYfresiLUWv-ql1a_izuzBq3gwbLVJkJpDnOnUS4kcOUukbU3VbYp1K15OMe1xxgQyVI1XARehGNLAl03Bvgoq8QHGM_MgLEZ09KM_iGe6DxZXxtsusPEMm9G8aY5xTO5C6HeEIPICK6YXJW26LMSk64PDvx2ew1hp2O5NOu3d_BOtmVHLW6zGUk3mKJxrbk-A0G9JPz6Shyg
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=IEEE+International+Conference+on+Control+and+Automation+%28Print%29&rft.atitle=Stochastic+Two-Player+Zero-Sum+Learning+Differential+Games&rft.au=Liu%2C+Mushuang&rft.au=Wan%2C+Yan&rft.au=Lewis%2C+Frank+L.&rft.au=Lopez%2C+Victor+G.&rft.date=2019-07-01&rft.pub=IEEE&rft.eissn=1948-3457&rft.spage=1038&rft.epage=1043&rft_id=info:doi/10.1109%2FICCA.2019.8899568&rft.externalDocID=8899568