Overview
Syllabus
Intro
How do we generate adversarial examples?
Threat Models
A threat model is a formal statement defining when a system is intended to be secure.
This talk: non-certified defenses
For example: adversarial training
How complete are evaluations?
Case Study: ICLR 2018
Broken Defenses Correct Defenses
Lessons Learned from Evaluating the Robustness of Defenses to Adversarial Examples
Disentangling true robustness from apparent robustness is nontrivial
Lessons (2 of 2) performing better evaluations
To understand adversarial examples, repeatedly attack and defend, optimizing for lessons learned.
Taught by
Simons Institute