Maximum Entropy Reinforcement Learning

Maximum Entropy Reinforcement Learning

Pascal Poupart via YouTube Direct link

Greedy Policy

7 of 18

7 of 18

Greedy Policy

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Maximum Entropy Reinforcement Learning

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Intro
  2. 2 Maximum Entropy RL
  3. 3 Reinforcement Learning
  4. 4 Encouraging Stochasticity
  5. 5 Optimal Policy
  6. 6 Q-function
  7. 7 Greedy Policy
  8. 8 Greedy Value function
  9. 9 Soft Q-Value Iteration
  10. 10 Soft Q-learning
  11. 11 Soft Policy Iteration
  12. 12 Policy improvement
  13. 13 Inequality derivation
  14. 14 Proof derivation
  15. 15 Soft Actor-Critic
  16. 16 Soft Actor Critic (SAC)
  17. 17 Empirical Results
  18. 18 Robustness to Environment Changes

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.