DQN - Playing Atari with Deep Reinforcement Learning - RL Paper Explained
Aleksa Gordić - The AI Epiphany via YouTube
Overview
Syllabus
High-level overview of the paper
Experience replay buffer
Difficulties with RL correlations, non-stationary distributions
DQN is very general
MDP formalism and optimal Q function
Function approximators
The loss function explained
The deadly triad
Algorithm walk-through
Preprocessing and architecture details
Additional details - normalizing score, schedule, etc.
Agent training metrics
Results
Taught by
Aleksa Gordić - The AI Epiphany