OpenAI DALL·E - Creating Images from Text - Blog Post Explained

OpenAI DALL·E - Creating Images from Text - Blog Post Explained

Yannic Kilcher via YouTube Direct link

- My Hypothesis about DALL·E's inner workings

10 of 28

10 of 28

- My Hypothesis about DALL·E's inner workings

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

OpenAI DALL·E - Creating Images from Text - Blog Post Explained

Automatically move to the next video in the Classroom when playback concludes

  1. 1 - Introduction
  2. 2 - Overview
  3. 3 - Dataset
  4. 4 - Comparison to GPT-3
  5. 5 - Model Architecture
  6. 6 - VQ-VAE
  7. 7 - Combining VQ-VAE with GPT-3
  8. 8 - Pre-Training with Relaxation
  9. 9 - Experimental Results
  10. 10 - My Hypothesis about DALL·E's inner workings
  11. 11 - Sparse Attention Patterns
  12. 12 - DALL·E can't count
  13. 13 - DALL·E can't global order
  14. 14 - DALL·E renders different views
  15. 15 - DALL·E is very good at texture
  16. 16 - DALL·E can complete a bust
  17. 17 - DALL·E can do some reflections, but not others
  18. 18 - DALL·E can do cross-sections of some objects
  19. 19 - DALL·E is amazing at style
  20. 20 - DALL·E can generate logos
  21. 21 - DALL·E can generate bedrooms
  22. 22 - DALL·E can combine unusual concepts
  23. 23 - DALL·E can generate illustrations
  24. 24 - DALL·E sometimes understands complicated prompts
  25. 25 - DALL·E can pass part of an IQ test
  26. 26 - DALL·E probably does not have geographical / temporal knowledge
  27. 27 - Reranking dramatically improves quality
  28. 28 - Conclusions & Comments

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.