Stochastic Two-Player Zero-Sum Learning Differential Games
The two-player zero-sum differential game has been extensively studied, partially because its solution implies the H ∞ optimality. Existing studies on zero-sum differential games either assume deterministic dynamics or the dynamics corrupted by additive noise. In realistic environments, highdimensio...
        Saved in:
      
    
          | Published in | IEEE International Conference on Control and Automation (Print) pp. 1038 - 1043 | 
|---|---|
| Main Authors | , , , | 
| Format | Conference Proceeding | 
| Language | English | 
| Published | 
            IEEE
    
        01.07.2019
     | 
| Subjects | |
| Online Access | Get full text | 
| ISSN | 1948-3457 | 
| DOI | 10.1109/ICCA.2019.8899568 | 
Cover
| Abstract | The two-player zero-sum differential game has been extensively studied, partially because its solution implies the H ∞ optimality. Existing studies on zero-sum differential games either assume deterministic dynamics or the dynamics corrupted by additive noise. In realistic environments, highdimensional environmental uncertainties often modulate system dynamics in a more complicated fashion. In this paper, we study the stochastic two-player zero-sum differential game governed by more general uncertain linear dynamics. We show that the optimal control policies for this game can be found by solving the Hamilton-Jacobi-Bellman (HJB) equation. We prove that with the derived optimal control policies, the system is asymptotically stable in the mean, and reaches the Nash equilibrium. To solve the stochastic two-player zero-sum game online, we design a new policy iteration (PI) algorithm that integrates the integral reinforcement learning (IRL) and an efficient uncertainty evaluation method-multivariate probabilistic collocation method (MPCM). This algorithm provides a fast online solution for the stochastic two-player zero-sum differential game subject to multiple uncertainties in the system dynamics. | 
    
|---|---|
| AbstractList | The two-player zero-sum differential game has been extensively studied, partially because its solution implies the H ∞ optimality. Existing studies on zero-sum differential games either assume deterministic dynamics or the dynamics corrupted by additive noise. In realistic environments, highdimensional environmental uncertainties often modulate system dynamics in a more complicated fashion. In this paper, we study the stochastic two-player zero-sum differential game governed by more general uncertain linear dynamics. We show that the optimal control policies for this game can be found by solving the Hamilton-Jacobi-Bellman (HJB) equation. We prove that with the derived optimal control policies, the system is asymptotically stable in the mean, and reaches the Nash equilibrium. To solve the stochastic two-player zero-sum game online, we design a new policy iteration (PI) algorithm that integrates the integral reinforcement learning (IRL) and an efficient uncertainty evaluation method-multivariate probabilistic collocation method (MPCM). This algorithm provides a fast online solution for the stochastic two-player zero-sum differential game subject to multiple uncertainties in the system dynamics. | 
    
| Author | Liu, Mushuang Lewis, Frank L. Wan, Yan Lopez, Victor G.  | 
    
| Author_xml | – sequence: 1 givenname: Mushuang surname: Liu fullname: Liu, Mushuang organization: University of Texas at Arlington,Department of Electrical Engineering,Arlington,TX,USA,76019 – sequence: 2 givenname: Yan surname: Wan fullname: Wan, Yan organization: University of Texas at Arlington,Department of Electrical Engineering,Arlington,TX,USA,76019 – sequence: 3 givenname: Frank L. surname: Lewis fullname: Lewis, Frank L. organization: UTA Research Institute, University of Texas at Arlington,Fort Worth,Texas,USA – sequence: 4 givenname: Victor G. surname: Lopez fullname: Lopez, Victor G. organization: University of Texas at Arlington,Department of Electrical Engineering,Arlington,TX,USA,76019  | 
    
| BookMark | eNotz8tKw0AUgOFRFKw1DyBu8gKp52Qmc3FXYq2FgELrxk05Sc_oSC4yiUjf3oVd_bsP_mtx0Q89C3GLsEAEd78py-UiB3QLa50rtD0TiTMWTW4RUSt5LmbolM2kKsyVSMbxCwAQbKEBZuJhOw3NJ41TaNLd75C9tnTkmL5zHLLtT5dWTLEP_Uf6GLznyP0UqE3X1PF4Iy49tSMnp87F29NqVz5n1ct6Uy6rLOQgp8x5yUDOKQlInAMVmpTRpjGWFTeSmtrYQ-GhRlI5FlbmqOvGsHdeH5jlXNz9u4GZ998xdBSP-9Ot_AMXhkmo | 
    
| ContentType | Conference Proceeding | 
    
| DBID | 6IE 6IL CBEJK RIE RIL  | 
    
| DOI | 10.1109/ICCA.2019.8899568 | 
    
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present  | 
    
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher  | 
    
| DeliveryMethod | fulltext_linktorsrc | 
    
| Discipline | Engineering | 
    
| EISBN | 9781728111643 1728111641  | 
    
| EISSN | 1948-3457 | 
    
| EndPage | 1043 | 
    
| ExternalDocumentID | 8899568 | 
    
| Genre | orig-research | 
    
| GroupedDBID | 6IE 6IF 6IK 6IL 6IN AAJGR AAWTH ACGFS ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IPLJI M43 OCL RIE RIL RNS  | 
    
| ID | FETCH-LOGICAL-i203t-9f3e0a994301ae20a56a4767c78e4ec3acb78d5f0b1a421583216bc7ef9f6dee3 | 
    
| IEDL.DBID | RIE | 
    
| IngestDate | Wed Aug 27 02:44:32 EDT 2025 | 
    
| IsPeerReviewed | false | 
    
| IsScholarly | false | 
    
| Language | English | 
    
| LinkModel | DirectLink | 
    
| MergedId | FETCHMERGED-LOGICAL-i203t-9f3e0a994301ae20a56a4767c78e4ec3acb78d5f0b1a421583216bc7ef9f6dee3 | 
    
| PageCount | 6 | 
    
| ParticipantIDs | ieee_primary_8899568 | 
    
| PublicationCentury | 2000 | 
    
| PublicationDate | 2019-July | 
    
| PublicationDateYYYYMMDD | 2019-07-01 | 
    
| PublicationDate_xml | – month: 07 year: 2019 text: 2019-July  | 
    
| PublicationDecade | 2010 | 
    
| PublicationTitle | IEEE International Conference on Control and Automation (Print) | 
    
| PublicationTitleAbbrev | ICCA | 
    
| PublicationYear | 2019 | 
    
| Publisher | IEEE | 
    
| Publisher_xml | – name: IEEE | 
    
| SSID | ssj0001085600 | 
    
| Score | 1.7216903 | 
    
| Snippet | The two-player zero-sum differential game has been extensively studied, partially because its solution implies the H ∞ optimality. Existing studies on zero-sum... | 
    
| SourceID | ieee | 
    
| SourceType | Publisher | 
    
| StartPage | 1038 | 
    
| SubjectTerms | Games Heuristic algorithms Optimal control System dynamics Uncertainty  | 
    
| Title | Stochastic Two-Player Zero-Sum Learning Differential Games | 
    
| URI | https://ieeexplore.ieee.org/document/8899568 | 
    
| hasFullText | 1 | 
    
| inHoldings | 1 | 
    
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3LS8MwGP_YdtKLj01804NHs6Vr1qTeZDqnMBlsg-FlpOlXFecqo0XwrzdfWzcVD95CIWnJg1-a_B4AZ54ywg15xHQQaUZxFoxyReyK9w0XscBORALnwb3fn4i7aWdagfOVFgYRc_IZNqmY3-VHicnoqKylFOkwVRWqUvmFVmt9nmL3Dha8y4tLlwet2273krhbNBnyej8CVHL86G3B4OvNBW3kpZmlYdN8_DJl_O-nbUNjrdRzhisM2oEKLnZh85vJYB0uRmlinjT5MTvj94QN59pus50HXCZslL06pcPqo3NVZqXYNT93bog924BJ73rc7bMyMYE9t7mXsiD2kOuALNVdjW2uO74W0pdGKhRoPG1CqaJOzENXCwv2FFPkh0ZiHMR-hOjtQW2RLHAfHGnI2S438oyEsj9pQkjJ3VDbdpSn8ADq1Auzt8IUY1Z2wOHfj49gg0ai4LkeQy1dZnhi0TwNT_Nh_ARQNp59 | 
    
| linkProvider | IEEE | 
    
| linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3LT8IwHP4F8aBefKDx7Q4eLXSsWztvBkVQICRAQryQrvtNjcgM2WLiX2-7TVDjwVuzpN3SR76u_R4A545QzA5oSKQfSmLiLIjJFdEr3lOURQzd0Aicuz2vNWJ3Y3dcgouFFgYRM_IZVk0xu8sPY5Wao7KaEEaHKVZg1WWMublaa3mioncPGr6Lq0ub-rV2o3Fl2FtmOmQ1f0SoZAjS3ITu17tz4shLNU2Cqvr4Zcv434_bgt2lVs_qL1BoG0o424GNbzaDFbgcJLF6ksaR2Rq-x6Q_lXqjbT3gPCaD9NUqPFYfresiLUWv-ql1a_izuzBq3gwbLVJkJpDnOnUS4kcOUukbU3VbYp1K15OMe1xxgQyVI1XARehGNLAl03Bvgoq8QHGM_MgLEZ09KM_iGe6DxZXxtsusPEMm9G8aY5xTO5C6HeEIPICK6YXJW26LMSk64PDvx2ew1hp2O5NOu3d_BOtmVHLW6zGUk3mKJxrbk-A0G9JPz6Shyg | 
    
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=IEEE+International+Conference+on+Control+and+Automation+%28Print%29&rft.atitle=Stochastic+Two-Player+Zero-Sum+Learning+Differential+Games&rft.au=Liu%2C+Mushuang&rft.au=Wan%2C+Yan&rft.au=Lewis%2C+Frank+L.&rft.au=Lopez%2C+Victor+G.&rft.date=2019-07-01&rft.pub=IEEE&rft.eissn=1948-3457&rft.spage=1038&rft.epage=1043&rft_id=info:doi/10.1109%2FICCA.2019.8899568&rft.externalDocID=8899568 |