Stable Video Diffusion: Model Architecture and Training Pipeline

Stable Video Diffusion: Model Architecture and Training Pipeline

AI Bites via YouTube Direct link

- Motivation for Image Pretraining

5 of 19

5 of 19

- Motivation for Image Pretraining

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Stable Video Diffusion: Model Architecture and Training Pipeline

Automatically move to the next video in the Classroom when playback concludes

  1. 1 - Intro
  2. 2 - Model Architecture
  3. 3 - Training Stages
  4. 4 - Image Pretraining Stage
  5. 5 - Motivation for Image Pretraining
  6. 6 - Video Curation Stage
  7. 7 - Video data curation pipeline
  8. 8 - LVD Dataset
  9. 9 - Filtering Mechanisms
  10. 10 - Optical Flow
  11. 11 - Synthetic Captions
  12. 12 - OCR Detection
  13. 13 - LVD dataset summarised
  14. 14 - Ablation studies
  15. 15 - High quality fine-tuning
  16. 16 - Base Model
  17. 17 - Tex-to-video example
  18. 18 - Image-to-video example
  19. 19 - Conclusion

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.