Completed
- Model Architecture
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
OpenAI DALLĀ·E - Creating Images from Text - Blog Post Explained
Automatically move to the next video in the Classroom when playback concludes
- 1 - Introduction
- 2 - Overview
- 3 - Dataset
- 4 - Comparison to GPT-3
- 5 - Model Architecture
- 6 - VQ-VAE
- 7 - Combining VQ-VAE with GPT-3
- 8 - Pre-Training with Relaxation
- 9 - Experimental Results
- 10 - My Hypothesis about DALLĀ·E's inner workings
- 11 - Sparse Attention Patterns
- 12 - DALLĀ·E can't count
- 13 - DALLĀ·E can't global order
- 14 - DALLĀ·E renders different views
- 15 - DALLĀ·E is very good at texture
- 16 - DALLĀ·E can complete a bust
- 17 - DALLĀ·E can do some reflections, but not others
- 18 - DALLĀ·E can do cross-sections of some objects
- 19 - DALLĀ·E is amazing at style
- 20 - DALLĀ·E can generate logos
- 21 - DALLĀ·E can generate bedrooms
- 22 - DALLĀ·E can combine unusual concepts
- 23 - DALLĀ·E can generate illustrations
- 24 - DALLĀ·E sometimes understands complicated prompts
- 25 - DALLĀ·E can pass part of an IQ test
- 26 - DALLĀ·E probably does not have geographical / temporal knowledge
- 27 - Reranking dramatically improves quality
- 28 - Conclusions & Comments