Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Vision Transformer - Lecture 16

MIT HAN Lab via YouTube

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the fundamentals of Vision Transformers in this MIT graduate-level lecture delivered by Professor Song Han as part of the EfficientML.ai series (MIT 6.5940, Fall 2024). Delve into the architectural principles, mechanisms, and applications of Vision Transformers in computer vision tasks. Learn how these transformers adapt the successful natural language processing transformer architecture for visual data processing, understanding their key components, operational workflow, and performance characteristics. Gain insights into how Vision Transformers have revolutionized the field of computer vision by offering an alternative to traditional convolutional neural networks.

Syllabus

EfficientML.ai Lecture 16 - Vision Transformer (MIT 6.5940, Fall 2024)

Taught by

MIT HAN Lab

Reviews

Start your review of Vision Transformer - Lecture 16

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.