Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Mastering Atari with Discrete World Models - Machine Learning Research Paper Explained

Yannic Kilcher via YouTube

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a comprehensive video lecture on the Dreamer v2 algorithm, a groundbreaking approach in model-based reinforcement learning for mastering Atari games. Delve into the intricacies of world models, discrete representations, and latent space predictions as the speaker breaks down this collaborative research from Google AI, DeepMind, and the University of Toronto. Learn about the innovative use of discrete and stochastic latent states, the architecture of the world model learner, and the application of actor-critic learning in dream space. Gain insights into advanced concepts such as KL balancing, straight-through estimators, and the challenges of incomplete world models. Understand the experimental results, limitations, and potential applications of this state-of-the-art algorithm that achieves human-level performance on the Atari benchmark using a single GPU.

Syllabus

- Intro & Overview
- Short Recap of Reinforcement Learning
- Problems with Model-Free Reinforcement Learning
- How World Models Help
- World Model Learner Architecture
- Deterministic & Stochastic Hidden States
- Latent Categorical Variables
- Categorical Variables and Multi-Modality
- Sampling & Stochastic State Prediction
- Actor-Critic Learning in Dream Space
- The Incompleteness of Learned World Models
- How General is this Algorithm?
- World Model Loss Function
- KL Balancing
- Actor-Critic Loss Function
- Straight-Through Estimators for Sampling Backpropagation
- Experimental Results
- Where Does It Fail?
- Conclusion

Taught by

Yannic Kilcher

Reviews

Start your review of Mastering Atari with Discrete World Models - Machine Learning Research Paper Explained

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.