Iterative Empirical Game Solving via Single Policy Best Response

M Smith, T Anthony, and MP Wellman 9th International Conference on Learning Representations (ICLR), Spotlight Presentation, May 2021 (forthcoming). Abstract Policy-Space Response Oracles (PSRO) is a general algorithmic framework for learning…