Completed
Intro
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Combining Tree-Search, Generative Models, and Nash Bargaining Concepts in Game-Theoretic Reinforcement Learning
Automatically move to the next video in the Classroom when playback concludes
- 1 Intro
- 2 Motivations
- 3 Policy-Space Response Oracles (PSRO) [Lanctot et. al '17] • Maintains a pool of strategies for each player, and iteratively.
- 4 Motivated Example: "Deal-or-No-Deal"[1]
- 5 Example: Bach or Stravinsky
- 6 PSRO on games beyond purely adversarial domains (no search)
- 7 Extending AlphaZero to Large Imperfect Information
- 8 MCTS in PSRO: A Bayesian Interpretation