Completed
Policy improvement
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Maximum Entropy Reinforcement Learning
Automatically move to the next video in the Classroom when playback concludes
- 1 Intro
- 2 Maximum Entropy RL
- 3 Reinforcement Learning
- 4 Encouraging Stochasticity
- 5 Optimal Policy
- 6 Q-function
- 7 Greedy Policy
- 8 Greedy Value function
- 9 Soft Q-Value Iteration
- 10 Soft Q-learning
- 11 Soft Policy Iteration
- 12 Policy improvement
- 13 Inequality derivation
- 14 Proof derivation
- 15 Soft Actor-Critic
- 16 Soft Actor Critic (SAC)
- 17 Empirical Results
- 18 Robustness to Environment Changes