First-order algorithm with O ( rm ln ( 1 / epsilon ) ) convergence for epsilon -equilibrium in two-person zero-sum games
We propose an iterated version of Nesterov's first-order smoothing method for the two-person zero-sum game equilibrium problem min x [isin] Q 1 max y [isin] Q 2 x rm T Ay = max y [isin] Q 2 min x [isin] Q 1 x rm T Ay . This formulation applies to matrix games as well as sequential games. Our ne...
Saved in:
| Published in | Mathematical programming Vol. 133; no. 1-2; pp. 279 - 298 |
|---|---|
| Main Authors | , , |
| Format | Journal Article |
| Language | English |
| Published |
01.06.2012
|
| Subjects | |
| Online Access | Get full text |
| ISSN | 0025-5610 1436-4646 |
| DOI | 10.1007/s10107-010-0430-2 |
Cover
| Summary: | We propose an iterated version of Nesterov's first-order smoothing method for the two-person zero-sum game equilibrium problem min x [isin] Q 1 max y [isin] Q 2 x rm T Ay = max y [isin] Q 2 min x [isin] Q 1 x rm T Ay . This formulation applies to matrix games as well as sequential games. Our new algorithmic scheme computes an epsilon -equilibrium to this min-max problem in O ( [par] A [par] delta ( A ) rm ln ( 1 / epsilon ) ) first-order iterations, where delta (A) is a certain condition measure of the matrix A. This improves upon the previous first-order methods which required O ( 1 / epsilon ) iterations, and it matches the iteration complexity bound of interior-point methods in terms of the algorithm's dependence on epsilon . Unlike interior-point methods that are inapplicable to large games due to their memory requirements, our algorithm retains the small memory requirements of prior first-order methods. Our scheme supplements Nesterov's method with an outer loop that lowers the target epsilon between iterations (this target affects the amount of smoothing in the inner loop). Computational experiments both in matrix games and sequential games show that a significant speed improvement is obtained in practice as well, and the relative speed improvement increases with the desired accuracy (as suggested by the complexity bounds). |
|---|---|
| Bibliography: | ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23 |
| ISSN: | 0025-5610 1436-4646 |
| DOI: | 10.1007/s10107-010-0430-2 |