Understanding Medusa: A Framework for LLM Inference Acceleration with Multiple Decoding Heads

Oxen via YouTube Direct link

Verifying Candidates With Medusa

9

of 15

9 of 15

Verifying Candidates With Medusa

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Understanding Medusa: A Framework for LLM Inference Acceleration with Multiple Decoding Heads