Completed
- Quadratic Memory in Full Attention
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Big Bird- Transformers for Longer Sequences
Automatically move to the next video in the Classroom when playback concludes
- 1 - Intro & Overview
- 2 - Quadratic Memory in Full Attention
- 3 - Architecture Overview
- 4 - Random Attention
- 5 - Window Attention
- 6 - Global Attention
- 7 - Architecture Summary
- 8 - Theoretical Result
- 9 - Experimental Parameters
- 10 - Structured Block Computations
- 11 - Recap
- 12 - Experimental Results
- 13 - Conclusion