VQ-GAN - Taming Transformers for High-Resolution Image Synthesis - Paper Explained
Aleksa Gordić - The AI Epiphany via YouTube
Overview
Syllabus
Intro
A high-level VQ-GAN overview
Perceptual loss
Patch-based adversarial loss
Sequence prediction via GPT
Generating high-res images
Loss explained in depth
Training the transformer
Conditioning transformer
Comparisons and results
Sampling strategies
Comparisons and results continued
Rejection sampling with ResNet or CLIP
Receptive field effects
Comparisons with DALL-E
Taught by
Aleksa Gordić - The AI Epiphany