Iterative Empirical Game Solving via Single Policy Best Response

M Smith, T Anthony, and MP Wellman 9th International Conference on Learning Representations (ICLR), Spotlight Presentation, May 2021. Abstract Policy-Space Response Oracles (PSRO) is a general algorithmic framework for learning policies in…