Completed
Unbiased Policy Evaluation for General RL in Short Horizons
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Towards Fast Autonomous Learners: Advances in Reinforcement Learning - 2015
Automatically move to the next video in the Classroom when playback concludes
- 1 Intro
- 2 Markov Decision Process (MDP)
- 3 Reinforcement Learning
- 4 Unbiased Policy Evaluation for General RL in Short Horizons
- 5 Queue-based Offline Evaluation of Online Bandit Algorithms
- 6 Our Queue Approach Can Sometimes Evaluate Algorithms that Use Deterministic Policies for Many More Time Steps than Rejection
- 7 Sample Complexity of RL
- 8 Provably More Efficient Learners
- 9 Fast, Better Policy Search using Bayesian Optimization
- 10 Black Box Optimization
- 11 Opening the Box: Leverage Offline Policy Evaluation
- 12 Personalization & Transfer Learning for Sequential Decision Making Tasks
- 13 Latent Variable Modeling Background
- 14 Diameter Assumption: Needed for Sample Complexity Improvement in Transfer?
- 15 Active Set is Models Compatible with Current Task's Data
- 16 More Data Efficient Learning In Domains Where It Matters