Completed
- Query-Key element-wise multiplication
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Fastformer - Additive Attention Can Be All You Need
Automatically move to the next video in the Classroom when playback concludes
- 1 - Intro & Outline
- 2 - Fastformer description
- 3 - Baseline: Classic Attention
- 4 - Fastformer architecture
- 5 - Additive Attention
- 6 - Query-Key element-wise multiplication
- 7 - Redundant modules in Fastformer
- 8 - Problems with the architecture
- 9 - Is this even attention?
- 10 - Experimental Results
- 11 - Conclusion & Comments