Completed
Intro
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Unlock Faster and More Efficient LLMs with SparseGPT - Neural Magic
Automatically move to the next video in the Classroom when playback concludes
- 1 Intro
- 2 Massive Deep Models are Great
- 3 The Neural Network Pruning Problem
- 4 The Mathematics of Compression
- 5 One-Shot Compression of GPT Models
- 6 The General Approach
- 7 Our Approach: Quantization Version
- 8 Experimental Validation
- 9 Combining Sparsity and Quantization
- 10 Exploiting with DeepSparse
- 11 Software Beats Hardware (continued)
- 12 Transforming the Pareto Frontier
- 13 Enabling Anyone to Run
- 14 Enabling Anyone to Sparsify
- 15 Questions