Towards Falsifiable Interpretability Research in Machine Learning - Lecture

Towards Falsifiable Interpretability Research in Machine Learning - Lecture

Bolei Zhou via YouTube Direct link

Key takeaways

23 of 23

23 of 23

Key takeaways

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Towards Falsifiable Interpretability Research in Machine Learning - Lecture

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Introduction
  2. 2 Outline
  3. 3 Obstacles
  4. 4 Misdirection of Saliency
  5. 5 What is Saliency
  6. 6 Saliency axioms
  7. 7 Input invariants
  8. 8 Model parameter randomization
  9. 9 Does silencing help humans
  10. 10 Takeaways
  11. 11 Case Study 2
  12. 12 Individual neurons
  13. 13 Activation maximization
  14. 14 Populations
  15. 15 Selective units
  16. 16 Ablating selective units
  17. 17 Posthoc studies
  18. 18 Regularizing selectivity
  19. 19 Ingenerative models
  20. 20 Summary
  21. 21 Building better hypothesis hypotheses
  22. 22 Building a stronger hypothesis
  23. 23 Key takeaways

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.