Completed
Markov Decision Process
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Understanding Q* (Q-star) - From Q-Learning to Maximum Entropy Reinforcement Learning
Automatically move to the next video in the Classroom when playback concludes
- 1 What is Q?
- 2 Q function explained
- 3 Q-learning update rule Bellman
- 4 Markov Decision Process
- 5 We compute Q
- 6 Residual Q-Learning Oct 2023
- 7 Policy customization, multi tasks
- 8 Residual Soft Actor Critic
- 9 Residual Max-Entropy MC
- 10 Q* a soft Q-function Oct 2023
- 11 Q* in Max Entropy RL
- 12 Q* dev by OpenAI & Berkeley
- 13 Maximum Entropy Policies w/ Q star