Completed
- Conclusion
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Scalable MatMul-free Language Modeling - Paper Explained
Automatically move to the next video in the Classroom when playback concludes
- 1 - Intro
- 2 - MatMul is everywhere
- 3 - Ternary accumulation as a substitute for matrix multiplication
- 4 - Replacing attention layers with recurrent layers
- 5 - Replacing dense layers with ternary channel mixing
- 6 - Language modelling results & scaling laws
- 7 - Other experimental results
- 8 - Conclusion