First-order algorithm with convergence for -equilibrium in two-person zero-sum games

We propose an iterated version of Nesterov’s first-order smoothing method for the two-person zero-sum game equilibrium problem This formulation applies to matrix games as well as sequential games. Our new algorithmic scheme computes an -equilibrium to this min-max problem in first-order iterations,...

Full description

Saved in:

Bibliographic Details
Published in	Mathematical programming Vol. 133; no. 1-2; pp. 279 - 298
Main Authors	Gilpin, Andrew, Peña, Javier, Sandholm, Tuomas
Format	Journal Article
Language	English
Published	Berlin/Heidelberg Springer-Verlag 01.06.2012 Springer Nature B.V
Subjects	Algorithms Calculus of Variations and Optimal Control; Optimization Combinatorics Convex analysis Equilibrium Full Length Paper Game theory Games Linear programming Mathematical and Computational Physics Mathematical Methods in Physics Mathematics Mathematics and Statistics Mathematics of Computing Methods Numerical Analysis Optimization Theoretical 90C52 91A05 90C47 90C33
Online Access	Get full text
ISSN	0025-5610 1436-4646
DOI	10.1007/s10107-010-0430-2

Cover

More Information
Summary:	We propose an iterated version of Nesterov’s first-order smoothing method for the two-person zero-sum game equilibrium problem This formulation applies to matrix games as well as sequential games. Our new algorithmic scheme computes an -equilibrium to this min-max problem in first-order iterations, where δ ( A ) is a certain condition measure of the matrix A . This improves upon the previous first-order methods which required iterations, and it matches the iteration complexity bound of interior-point methods in terms of the algorithm’s dependence on . Unlike interior-point methods that are inapplicable to large games due to their memory requirements, our algorithm retains the small memory requirements of prior first-order methods. Our scheme supplements Nesterov’s method with an outer loop that lowers the target between iterations (this target affects the amount of smoothing in the inner loop). Computational experiments both in matrix games and sequential games show that a significant speed improvement is obtained in practice as well, and the relative speed improvement increases with the desired accuracy (as suggested by the complexity bounds).
Bibliography:	SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 14
ISSN:	0025-5610 1436-4646
DOI:	10.1007/s10107-010-0430-2