Learning to Summarize from Human Feedback

Yannic Kilcher via YouTube Direct link

- Understanding the Reward Model

10

of 11

10 of 11

- Understanding the Reward Model

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Learning to Summarize from Human Feedback