Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Stable Diffusion and Friends - High-Resolution Image Synthesis via Two-Stage Generative Models

HuggingFace via YouTube

Overview

Explore the evolution of generative image models in this insightful talk by Robin Rombach, co-creator of Stable Diffusion. Delve into the progression from GANs to Transformers and latent Diffusion models, gaining a comprehensive understanding of high-resolution image synthesis techniques. Learn about two-stage generative models, the QCVAE architecture, Vision Transformers, and the groundbreaking Stable Diffusion model. Discover applications in text-to-image generation, semantic synthesis, upscaling, and creative endeavors like text-to-color palette conversion and video stylization. Gain valuable insights from Rombach's extensive research experience and his pivotal role in developing widely-used projects such as VQGAN, Taming Transformers, and Latent Diffusion Models.

Syllabus

Introduction
Diffusion
TwoStage Generative Models
Leon Model
Why domain knowledge
QCVAE architecture
QCVAE reconstruction
VisionTransformers
VQan
HighResolution Image Synthesis
Text to Image Generation
Stable Diffusion
Classifier Free Diffusion Guidance
Stereo Fusion in Painting
Semantic Synthesis
Upscaling
SBEdit
Diffusion Model
Creative Applications
Text to Color Palette
Video stylization
Lexi Carlile
Credits
Questions
One Direction
Adding Numerology
Conclusion

Taught by

Hugging Face

Reviews

Start your review of Stable Diffusion and Friends - High-Resolution Image Synthesis via Two-Stage Generative Models

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.