Deep Reinforcement Learning in the Real World - Sergey Levine
Institute for Advanced Study via YouTube
Overview
Syllabus
Intro
Deep learning helps us handle unstructured environments
Reinforcement learning provides a formalism for behavior
RL has a big problem
Off-policy RL with large datasets
Off-policy model-free learning
How to solve for the Q-function?
QT-Opt: off-policy Q-learning at scale
Grasping with QT-Opt
Emergent grasping strategies
So what's the problem?
How to stop training on garbage?
How well does it work?
Off-policy model-based reinforcement learning
High-level algorithm outline
Model-based RL for dexterous manipulation
Q-Functions (can) learn models
Temporal difference models
Optimizing over valid states
Taught by
Institute for Advanced Study