Do Vision Transformers See Like Convolutional Neural Networks - Paper Explained

Do Vision Transformers See Like Convolutional Neural Networks - Paper Explained

Aleksa Gordić - The AI Epiphany via YouTube Direct link

Spatial information is preserved in ViTs

8 of 10

8 of 10

Spatial information is preserved in ViTs

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Do Vision Transformers See Like Convolutional Neural Networks - Paper Explained

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Intro
  2. 2 Contrasting features in ViTs vs CNNs
  3. 3 Global vs Local receptive fields
  4. 4 Data matters, mr. obvious
  5. 5 Contrasting receptive fields
  6. 6 Data flow through CLS vs spatial tokens
  7. 7 Skip connections matter a lot in ViTs
  8. 8 Spatial information is preserved in ViTs
  9. 9 Features evolution with the amount of data
  10. 10 Outro

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.