Fastformer - Additive Attention Can Be All You Need

Fastformer - Additive Attention Can Be All You Need

Yannic Kilcher via YouTube Direct link

- Fastformer architecture

4 of 11

4 of 11

- Fastformer architecture

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Fastformer - Additive Attention Can Be All You Need

Automatically move to the next video in the Classroom when playback concludes

  1. 1 - Intro & Outline
  2. 2 - Fastformer description
  3. 3 - Baseline: Classic Attention
  4. 4 - Fastformer architecture
  5. 5 - Additive Attention
  6. 6 - Query-Key element-wise multiplication
  7. 7 - Redundant modules in Fastformer
  8. 8 - Problems with the architecture
  9. 9 - Is this even attention?
  10. 10 - Experimental Results
  11. 11 - Conclusion & Comments

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.