Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Understanding Q* (Q-star) - From Q-Learning to Maximum Entropy Reinforcement Learning

Discover AI via YouTube

Overview

Learn about the technical foundations and recent developments of Q* (Q-star) in this 20-minute educational video that demystifies complex reinforcement learning concepts. Explore the evolution from basic Q-functions to advanced Q* applications, covering fundamental topics like Bellman equations, Markov Decision Processes, and entropy-based reinforcement learning. Delve into recent developments including Residual Q-Learning, policy customization, and maximum entropy policies, with particular focus on collaborative work between OpenAI and UC Berkeley. Gain clear explanations of how Q* relates to physics principles and agent behavior through imitation learning, dispelling common misconceptions about its connection to artificial general intelligence (AGI). Master technical concepts through structured segments that progress from basic Q-functions to advanced applications in maximum entropy reinforcement learning.

Syllabus

What is Q?
Q function explained
Q-learning update rule Bellman
Markov Decision Process
We compute Q
Residual Q-Learning Oct 2023
Policy customization, multi tasks
Residual Soft Actor Critic
Residual Max-Entropy MC
Q* a soft Q-function Oct 2023
Q* in Max Entropy RL
Q* dev by OpenAI & Berkeley
Maximum Entropy Policies w/ Q star

Taught by

Discover AI

Reviews

Start your review of Understanding Q* (Q-star) - From Q-Learning to Maximum Entropy Reinforcement Learning

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.