Abstract
Solving zero-sum matrix games is polynomial, because it boils down to linear programming. The approximate solving is sublinear by randomized algorithms on machines with random access memory. Algorithms working separately and independently on columns and rows have been proposed, with the same performance; these versions are compliant with matrix games with stochastic reward. (Flory and Teytaud, 2011) has proposed a new version, empirically performing better on sparse problems, i.e. cases in which the Nash equilibrium has small support. In this paper, we propose a variant, similar to their work, also dedicated to sparse problems, with provably better bounds than existing methods. We then experiment the method on a card game.
Original language | English |
---|---|
Pages (from-to) | 173-188 |
Number of pages | 16 |
Journal | Journal of Machine Learning Research |
Volume | 39 |
Issue number | 2014 |
Publication status | Published - 2014 |
Externally published | Yes |
Event | 6th Asian Conference on Machine Learning, ACML 2014 - Nha Trang, Viet Nam Duration: 26 Nov 2014 → 28 Nov 2014 |
Bibliographical note
Publisher Copyright:© 2014 D. Auger, J. Liu, S. Ruette, D.L. Saint-Pierre & O. Teytaud.
Keywords
- Bandit algorithms
- Sparsity
- Zero-sum matrix games