Neural Nets for NLP 2019 - Reinforcement Learning

Neural Nets for NLP 2019 - Reinforcement Learning

Graham Neubig via YouTube Direct link

Policy-based vs. Value-based

13 of 18

13 of 18

Policy-based vs. Value-based

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Neural Nets for NLP 2019 - Reinforcement Learning

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Intro
  2. 2 What is Reinforcement Learning?
  3. 3 Why Reinforcement Learning in NLP?
  4. 4 Supervised Learning
  5. 5 Self Training
  6. 6 Policy Gradient/REINFORCE
  7. 7 Credit Assignment for Rewards
  8. 8 Problems w/ Reinforcement Learning
  9. 9 Adding a Baseline
  10. 10 Calculating Baselines
  11. 11 Increasing Batch Size
  12. 12 When to Use Reinforcement Learning?
  13. 13 Policy-based vs. Value-based
  14. 14 Action-Value Function . Given a states we try to estimate the value of each action a
  15. 15 Estimating Value Functions
  16. 16 Exploration vs. Exploitation
  17. 17 RL in Dialog
  18. 18 RL for Information Retrieval

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.