Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

VQ-GAN - PyTorch Implementation

Outlier via YouTube

Overview

Dive into a comprehensive 38-minute video tutorial on implementing Vector Quantized Generative Adversarial Networks (VQGAN) using PyTorch. Explore the two-stage process of VQGAN, starting with an autoencoder-like approach for encoding images into a low-dimensional latent space and applying vector quantization using a codebook. Learn about the fully convolutional encoder and decoder, and discover how to train a transformer for the latent space to generate novel images. Follow along with detailed explanations of helper modules, encoder, decoder, codebook, discriminator, and LPIPS. Gain insights into the training process for both stages, examine results, and understand the implementation of GPT and VQGAN Transformer. Access additional resources for further reading on related topics such as VAE, VQVAE, CNNs, NonLocal NN, PatchGAN, and Hinge Loss.

Syllabus

Introduction
Helper modules
Encoder
Decoder
Codebook
VQGAN
Discriminator
LPIPS
Utils
Training: First Stage
Results: First Stage
Introducing Second Stage
GPT
VQGAN Transformer
Training: Second Stage
Results: Second Stage
Github Code & Outro

Taught by

Outlier

Reviews

Start your review of VQ-GAN - PyTorch Implementation

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.