Combining Tree-Search, Generative Models, and Nash Bargaining Concepts in Game-Theoretic Reinforcement Learning

Combining Tree-Search, Generative Models, and Nash Bargaining Concepts in Game-Theoretic Reinforcement Learning

GERAD Research Center via YouTube Direct link

Extending AlphaZero to Large Imperfect Information

7 of 8

7 of 8

Extending AlphaZero to Large Imperfect Information

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Combining Tree-Search, Generative Models, and Nash Bargaining Concepts in Game-Theoretic Reinforcement Learning

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Intro
  2. 2 Motivations
  3. 3 Policy-Space Response Oracles (PSRO) [Lanctot et. al '17] • Maintains a pool of strategies for each player, and iteratively.
  4. 4 Motivated Example: "Deal-or-No-Deal"[1]
  5. 5 Example: Bach or Stravinsky
  6. 6 PSRO on games beyond purely adversarial domains (no search)
  7. 7 Extending AlphaZero to Large Imperfect Information
  8. 8 MCTS in PSRO: A Bayesian Interpretation

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.