Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Off-Policy Policy Optimization

Simons Institute via YouTube

Overview

Explore off-policy policy optimization in reinforcement learning with Dale Schuurmans from Google Brain and the University of Alberta in this 53-minute lecture. Delve into key concepts including the RL problem, batch policy optimization, and optimization objectives. Compare supervised and reinforcement learning approaches, and examine missing data inference in the context of sequential decision making. Gain insights into the emerging challenges in deep learning as applied to reinforcement learning algorithms and policy optimization techniques.

Syllabus

Intro
The RL problem
Batch policy optimization
Optimization objectives
Supervised vs reinforcement learning
Missing data inference
Sequential decision making
Sequential RL

Taught by

Simons Institute

Reviews

Start your review of Off-Policy Policy Optimization

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.