Overview
Syllabus
- Intro & Overview
- Short Recap of Reinforcement Learning
- Problems with Model-Free Reinforcement Learning
- How World Models Help
- World Model Learner Architecture
- Deterministic & Stochastic Hidden States
- Latent Categorical Variables
- Categorical Variables and Multi-Modality
- Sampling & Stochastic State Prediction
- Actor-Critic Learning in Dream Space
- The Incompleteness of Learned World Models
- How General is this Algorithm?
- World Model Loss Function
- KL Balancing
- Actor-Critic Loss Function
- Straight-Through Estimators for Sampling Backpropagation
- Experimental Results
- Where Does It Fail?
- Conclusion
Taught by
Yannic Kilcher