On the Hardness of Reinforcement Learning With Value-Function Approximation

On the Hardness of Reinforcement Learning With Value-Function Approximation

Simons Institute via YouTube Direct link

Proving the conjecture: Attempt 1

15 of 18

15 of 18

Proving the conjecture: Attempt 1

Class Central Classrooms beta

YouTube playlists curated by Class Central.

Classroom Contents

On the Hardness of Reinforcement Learning With Value-Function Approximation

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Intro
  2. 2 Reinforcement Learning (RL) Applications
  3. 3 Value-function Approximation
  4. 4 Comparison between SL and RL
  5. 5 Markov Decision Process (MDP)
  6. 6 Batch learning in MDPS
  7. 7 Example: Video game playing
  8. 8 Batch learning in large MDPS
  9. 9 Assumption on data (?)
  10. 10 Assumption on data & MDP dynamics
  11. 11 Algorithm for batch RL
  12. 12 How things go wrong (w/ restricted class)
  13. 13 Fix using a strong assumption ("completeness")
  14. 14 Realizability alone is insufficient?
  15. 15 Proving the conjecture: Attempt 1
  16. 16 Checklist for a plausible construction
  17. 17 Importance of the conjecture
  18. 18 Importance of the construction

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.