Completed
Case study: GPT-2
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
When Machine Learning Isn't Private
Automatically move to the next video in the Classroom when playback concludes
- 1 THE ADVANCED COMPUTING SYSTEMS ASSOCIATION
- 2 Do models leak training data?
- 3 Act I: Extracting Training Data
- 4 A New Attack: : Training Data Extraction
- 5 1. Generate a lot of data 2. Predict membership
- 6 Evaluation
- 7 Up to 5% of the output of language models is verbatim copied from the training dataset
- 8 Case study: GPT-2
- 9 Act II: Ad-hoc privacy isn't
- 10 Act III: Whatever can we do?
- 11 3. Use differential privacy
- 12 Questions?