Overview
Syllabus
Intro
Applications of Deep Learning
Research Themes
Prior Work: Perturbation Approaches
Our Approach: Meaningful Perturbations
Our Approach: Extremal Perturbations
Interpretability
Foreground evidence is usually sufficient
Suppressing the background may overdrive the network
Adversarial Defense
Regularization to mitigate artifacts
Area Constraint
Smooth Masks
Comparison with Prior Work
Measure Performance on Weak Localization
Selectivity to Output Class
Sensitive to Model Parameters
Intermediate Activations
Spatial Attribution
Channel Attribution
Activation "Diffing"
# Concepts per Filter
# Filters per Concept
Self-Supervised Learning
Comparing Concept Embedding Spaces
Segmentation
Classification
Human-Guided Machine Learning
Future Work: Model Debugging
Taught by
Bolei Zhou