Understanding Q* (Q-star) - From Q-Learning to Maximum Entropy Reinforcement Learning

Understanding Q* (Q-star) - From Q-Learning to Maximum Entropy Reinforcement Learning

Discover AI via YouTube Direct link

Markov Decision Process

4 of 13

4 of 13

Markov Decision Process

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Understanding Q* (Q-star) - From Q-Learning to Maximum Entropy Reinforcement Learning

Automatically move to the next video in the Classroom when playback concludes

  1. 1 What is Q?
  2. 2 Q function explained
  3. 3 Q-learning update rule Bellman
  4. 4 Markov Decision Process
  5. 5 We compute Q
  6. 6 Residual Q-Learning Oct 2023
  7. 7 Policy customization, multi tasks
  8. 8 Residual Soft Actor Critic
  9. 9 Residual Max-Entropy MC
  10. 10 Q* a soft Q-function Oct 2023
  11. 11 Q* in Max Entropy RL
  12. 12 Q* dev by OpenAI & Berkeley
  13. 13 Maximum Entropy Policies w/ Q star

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.