Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Understanding Diffusion Transformers and the Technology Behind Sora

Oxen via YouTube

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a comprehensive technical video lecture breaking down OpenAI's Sora model and the underlying Diffusion Transformer technology that powers it. Learn the fundamental concepts behind diffusion models, U-Net architecture, auto encoders, and latent diffusion models before diving deep into the Diffusion Transformer architecture and its variations. Master key technical aspects including patch scaling versus model size relationships and Fréchet Inception Distance (FID) metrics through practical examples. Gain valuable insights into cutting-edge AI development through detailed explanations supported by academic papers, with links to additional resources including the Generative Deep Learning book and relevant research publications. Connect with the AI community through provided Discord and community channels while accessing supplementary materials like the Road to Sora reading list for continued learning.

Syllabus

Road to Sora
Intro to Diffusion Transformer
What is a Diffusion Model?
What is a U-Net?
Auto Encoder
Latent Diffusion Models
Diffusion Transformer Architecture
Variations on the Diffusion Transformer
Scaling Patch vs. Model Size
FID
Examples

Taught by

Oxen

Reviews

Start your review of Understanding Diffusion Transformers and the Technology Behind Sora

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.