Overview
Syllabus
Intro
Premeditation of evils
Interpretability hype.
well... we've been here before...
What this talk is about
What this talk is NOT about
We need interpretability to increase user trust.
We need to understand every single bit of the model.
Agenda Many opportunities to fail.
We first must define a universal mathematical definition of interpretability
The performance and interpretability trade-off is inevitable.
How I present the explanations doesn't
Since there is no good way to evaluate interpretability methods, I can only show you qualitative results
I am a computer scientist! Running human
The explanation is always true; It is what the model thinks.
I'm just a researcher who provide technical tools. The real world usage is something I cannot control.
Taught by
Simons Institute