Can Wikipedia Help Offline Reinforcement Learning - Author Interview

Can Wikipedia Help Offline Reinforcement Learning - Author Interview

Yannic Kilcher via YouTube Direct link

- How are tokens distributed in Reinforcement Learning?

8 of 14

8 of 14

- How are tokens distributed in Reinforcement Learning?

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Can Wikipedia Help Offline Reinforcement Learning - Author Interview

Automatically move to the next video in the Classroom when playback concludes

  1. 1 - Intro
  2. 2 - Brief paper, setup & idea recap
  3. 3 - Main experimental results & high standard deviations
  4. 4 - Why is there no clear winner?
  5. 5 - Why are bigger models not a lot better?
  6. 6 - What’s behind the name ChibiT?
  7. 7 - Why is iGPT underperforming?
  8. 8 - How are tokens distributed in Reinforcement Learning?
  9. 9 - What other domains could have good properties to transfer?
  10. 10 - A deeper dive into the models' attention patterns
  11. 11 - Codebase, model sizes, and compute requirements
  12. 12 - Scaling behavior of pre-trained models
  13. 13 - What did not work out in this project?
  14. 14 - How can people get started and where to go next?

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.