Completed
Intro
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Do Vision Transformers See Like Convolutional Neural Networks - Paper Explained
Automatically move to the next video in the Classroom when playback concludes
- 1 Intro
- 2 Contrasting features in ViTs vs CNNs
- 3 Global vs Local receptive fields
- 4 Data matters, mr. obvious
- 5 Contrasting receptive fields
- 6 Data flow through CLS vs spatial tokens
- 7 Skip connections matter a lot in ViTs
- 8 Spatial information is preserved in ViTs
- 9 Features evolution with the amount of data
- 10 Outro