Completed
Guest Lecture by Tianyi Zhang: Faster & Cheaper LLMs with Weight and Key-value Cache Quantization
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Faster and Cheaper LLMs with Weight and Key-value Cache Quantization
Automatically move to the next video in the Classroom when playback concludes