How to Fail Interpretability Research

How to Fail Interpretability Research

Simons Institute via YouTube Direct link

How I present the explanations doesn't

12 of 16

12 of 16

How I present the explanations doesn't

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

How to Fail Interpretability Research

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Intro
  2. 2 Premeditation of evils
  3. 3 Interpretability hype.
  4. 4 well... we've been here before...
  5. 5 What this talk is about
  6. 6 What this talk is NOT about
  7. 7 We need interpretability to increase user trust.
  8. 8 We need to understand every single bit of the model.
  9. 9 Agenda Many opportunities to fail.
  10. 10 We first must define a universal mathematical definition of interpretability
  11. 11 The performance and interpretability trade-off is inevitable.
  12. 12 How I present the explanations doesn't
  13. 13 Since there is no good way to evaluate interpretability methods, I can only show you qualitative results
  14. 14 I am a computer scientist! Running human
  15. 15 The explanation is always true; It is what the model thinks.
  16. 16 I'm just a researcher who provide technical tools. The real world usage is something I cannot control.

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.