AudioGen- Textually Guided Audio Generation - Paper Explained

AudioGen- Textually Guided Audio Generation - Paper Explained

Aleksa Gordić - The AI Epiphany via YouTube Direct link

Data and augmentations

11 of 13

11 of 13

Data and augmentations

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

AudioGen- Textually Guided Audio Generation - Paper Explained

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Intro
  2. 2 Why is text-to-audio hard?
  3. 3 Comparison with VQ-GAN
  4. 4 Comparison with SoundStream
  5. 5 AudioGen overview
  6. 6 Deep dive: audio representation, LSTM
  7. 7 Losses explained
  8. 8 Complex-valued STFTs
  9. 9 Audio Language Modeling
  10. 10 Multi-stream audio inputs
  11. 11 Data and augmentations
  12. 12 Results
  13. 13 Outro

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.