Completed
Regret analysis of UCRL-VTR
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Model-Based Reinforcement Learning with Value-Targeted Regression
Automatically move to the next video in the Classroom when playback concludes
- 1 Intro
- 2 Model-Based Reinforcement Learning
- 3 Episodic Reinforcement Learning
- 4 Upper Confidence Model-Based RL (UCRL)
- 5 The class of deterministic continuous systems . Consider a deterministic system
- 6 A Simple Metric-Based RL Algorithm
- 7 Doubling Dimension d
- 8 Feature space embedding of transition model
- 9 The MatrixRL Algorithm
- 10 From Feature to Kernel Embedding of Transition Model
- 11 A motivating example: MuZero
- 12 Assumption of Value-Targeted Regression
- 13 Value-Targeted Regression (VTR) for Confidence Set Construction
- 14 Full Algorithm of UCRL-VTR
- 15 Regret analysis of UCRL-VTR
- 16 A Special Case