Off-Policy Policy Optimization

Off-Policy Policy Optimization

Simons Institute via YouTube Direct link

Batch policy optimization

3 of 8

3 of 8

Batch policy optimization

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Off-Policy Policy Optimization

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Intro
  2. 2 The RL problem
  3. 3 Batch policy optimization
  4. 4 Optimization objectives
  5. 5 Supervised vs reinforcement learning
  6. 6 Missing data inference
  7. 7 Sequential decision making
  8. 8 Sequential RL

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.