Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Beyond Worst-Case: Instance-Dependent Optimality in Reinforcement Learning

Simons Institute via YouTube

Overview

Explore instance-dependent optimality in reinforcement learning through this 44-minute lecture by Martin Wainwright from UC Berkeley. Delve into topics such as global minimax, policy evaluation, and the optimality of TD learning. Learn how to develop instance-optimal algorithms and understand useful bounds in the context of reinforcement learning from batch data and simulation. Gain insights into decoupling techniques and optimal site selection. Conclude with a summary and engage in a Q&A session to deepen your understanding of advanced reinforcement learning concepts.

Syllabus

Introduction
Problem Statement
Global Minimax
Policy Evaluation
Is TD Learning Optimal
How to Get an InstanceOptimal Algorithm
Useful Bounds
Summary
Questions
Decoupling
Optimal Site
QA

Taught by

Simons Institute

Reviews

Start your review of Beyond Worst-Case: Instance-Dependent Optimality in Reinforcement Learning

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.