TransGAN - Two Transformers Can Make One Strong GAN - Machine Learning Research Paper Explained

Overview

Explore a comprehensive video explanation of the machine learning research paper "TransGAN: Two Transformers Can Make One Strong GAN." Delve into the groundbreaking approach of using transformer-based architectures for both the generator and discriminator in Generative Adversarial Networks (GANs). Learn about the innovative techniques employed, including data augmentation with DiffAug, super-resolution co-training, and localized initialization of self-attention. Discover how TransGAN achieves competitive performance with convolutional GANs on various datasets and gain insights into the future potential of transformer-based GANs in computer vision tasks.

Syllabus

- Introduction & Overview
- Discriminator Architecture
- Generator Architecture
- Upsampling with PixelShuffle
- Architecture Recap
- Vanilla TransGAN Results
- Trick 1: Data Augmentation with DiffAugment
- Trick 2: Super-Resolution Co-Training
- Trick 3: Locality-Aware Initialization for Self-Attention
- Scaling Up & Experimental Results
- Recap & Conclusion

Taught by

Yannic Kilcher

Reviews

Start your review of TransGAN - Two Transformers Can Make One Strong GAN - Machine Learning Research Paper Explained

Taught by

Advanced Generative Adversarial Networks (GANs)

DCGAN Implementation From Scratch

XCiT- Cross-Covariance Image Transformers - Facebook AI Machine Learning Research Paper Explained

VQ-GAN - Taming Transformers for High-Resolution Image Synthesis - Paper Explained

Efficient Geometry-Aware 3D Generative Adversarial Networks - GAN Paper Explained

10 Best Machine Learning Courses for 2024: Scikit-learn, TensorFlow, and more

Never Stop Learning.