Safe and Efficient Exploration in Reinforcement Learning

Safe and Efficient Exploration in Reinforcement Learning

Fields Institute via YouTube Direct link

Action penalty effect

25 of 33

25 of 33

Action penalty effect

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Safe and Efficient Exploration in Reinforcement Learning

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Intro
  2. 2 RL beyond simulated environments?
  3. 3 Tuning the Swiss Free Electron Laser [with Kirschner, Muty, Hiller, Ischebeck et al.]
  4. 4 Challenge: Safety Constraints
  5. 5 Safe optimization
  6. 6 Safe Bayesian optimization
  7. 7 Illustration of Gaussian Process Inference [cf, Rasmussen & Williams 2006]
  8. 8 Plausible maximizers
  9. 9 Certifying Safety
  10. 10 Confidence intervals for GPS?
  11. 11 Online tuning of 24 parameters
  12. 12 Shortcomings of Safe Opt
  13. 13 Safe learning for dynamical systems Koller, Berkenkamp, Turchetta, K CDC 18, 19
  14. 14 Stylized task
  15. 15 Planning with confidence bounds Koller, Berkenkamp, Turchetta, K CDC 18, 19
  16. 16 Forwards-propagating uncertain, nonlinear dynamics
  17. 17 Challenges with long-term action dependencies
  18. 18 Safe learning-based MPC
  19. 19 Experimental illustration
  20. 20 Scaling up: Efficient Optimistic Exploration in Deep Model based Reinforcement Learning
  21. 21 Optimism in Model-based Deep RL
  22. 22 Deep Model-based RL with Confidence: H-UCRL [Curi, Berkenkamp, K, Neurips 20]
  23. 23 Illustration on Inverted Pendulum
  24. 24 Deep RL: Mujoco Half-Cheetah
  25. 25 Action penalty effect
  26. 26 What about safety?
  27. 27 Safety-Gym Benchmark Suite
  28. 28 Which priors to choose? → PAC-Bayesian Meta Learning [Rothfuss, Fortuin, Josifoski, K, ICML 2021]
  29. 29 Experiments - Predictive accuracy (Regression)
  30. 30 Meta-Learned Priors for Bayesian Optimization
  31. 31 Meta-Learned Priors for Sequential Decision Making
  32. 32 Safe and efficient exploration in real-world RL
  33. 33 Acknowledgments

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.