SparQ Attention: Bandwidth-Efficient LLM Inference

Unify via YouTube Direct link

We're very excited to welcome both Ivan Chelombiev and Luka Ribar from GraphCore. They will be presenting their work on SparQ Attention presentation starts at

1

of 1

1 of 1

We're very excited to welcome both Ivan Chelombiev and Luka Ribar from GraphCore. They will be presenting their work on SparQ Attention presentation starts at

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

SparQ Attention: Bandwidth-Efficient LLM Inference