Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a comprehensive video lecture on Autoregressive Diffusion Models (ARDMs), a novel class of generative models that combines autoregressive and diffusion approaches. Delve into the key concepts, including order-agnostic autoregressive models, discrete diffusion, and their applications in text and image generation. Learn about the efficient training objective, parallel generation capabilities, and the model's adaptability to various generation budgets. Discover how ARDMs outperform discrete diffusion models with fewer steps and their unique suitability for lossless compression tasks. Gain insights into the model's architecture, sampling techniques, and extensions for parallel sampling and depth upscaling. Understand the implications of this research for machine learning and its potential impact on generative modeling and data compression.
Syllabus
- Intro & Overview
- Decoding Order in Autoregressive Models
- Autoregressive Diffusion Models
- Dependent and Independent Sampling
- Application to Character-Level Language Models
- How Sampling & Training Works
- Extension 1: Parallel Sampling
- Extension 2: Depth Upscaling
- Conclusion & Comments
Taught by
Yannic Kilcher