Scalable MatMul-free Language Modeling - Paper Explained

Scalable MatMul-free Language Modeling - Paper Explained

Yannic Kilcher via YouTube Direct link

- Intro

1 of 8

1 of 8

- Intro

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Scalable MatMul-free Language Modeling - Paper Explained

Automatically move to the next video in the Classroom when playback concludes

  1. 1 - Intro
  2. 2 - MatMul is everywhere
  3. 3 - Ternary accumulation as a substitute for matrix multiplication
  4. 4 - Replacing attention layers with recurrent layers
  5. 5 - Replacing dense layers with ternary channel mixing
  6. 6 - Language modelling results & scaling laws
  7. 7 - Other experimental results
  8. 8 - Conclusion

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.