Neural Nets for NLP - Minimum Risk Training and Reinforcement Learning

Neural Nets for NLP - Minimum Risk Training and Reinforcement Learning

Graham Neubig via YouTube Direct link

Adding a Baseline

15 of 21

15 of 21

Adding a Baseline

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Neural Nets for NLP - Minimum Risk Training and Reinforcement Learning

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Intro
  2. 2 Problem 1: Exposure Bias
  3. 3 Problem 2: Disregard to Evaluation Metrics
  4. 4 Error
  5. 5 Problem: Argmax is Non- differentiable
  6. 6 Sampling for Risk
  7. 7 Adding Temperature
  8. 8 What is Reinforcement Learning?
  9. 9 Why Reinforcement Learning in NLP?
  10. 10 Supervised MLE
  11. 11 Self Training
  12. 12 Policy Gradient/REINFORCE
  13. 13 Credit Assignment for Rewards
  14. 14 Problems w/ Reinforcement Learning
  15. 15 Adding a Baseline
  16. 16 Calculating Baselines
  17. 17 Increasing Batch Size
  18. 18 Warm-start
  19. 19 When to Use Reinforcement Learning?
  20. 20 Action-Value Function
  21. 21 Estimating Value Functions

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.