Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Inside TensorFlow - TF Model Optimization Toolkit - Quantization and Pruning

TensorFlow via YouTube

Overview

Dive into an in-depth technical session on the TensorFlow Model Optimization Toolkit, focusing on quantization and pruning techniques. Explore the challenges of quantization, learn about different approaches including quantization during and after training, and understand the benefits of integer and hybrid quantization. Discover pruning tools, their implementation, and practical examples. Gain insights into quantization kernels, specs, and APIs, as well as the accuracy and performance benefits of these optimization techniques. Enhance your understanding of matrix multiplication in the context of model optimization as presented by TensorFlow Software Engineer Suharsh Sivakumar in this 43-minute technical deep dive.

Syllabus

Introduction
Overview
Why does this matter
Quantization is hard
Quantization during training
Quantization after training
Pruning
Quantization kernels
Quantitation spec
Cemetry
Perchannel condensation
Quantization tools
Integer Quantization
Hybrid Quantization
Postreading Integer Quantization
Quantization Accuracy
Quantization Benefits
Summary
Pruning Tools
Quantization API
Pruning Example
Pruning Summary
Matrix Multiplication

Taught by

TensorFlow

Reviews

Start your review of Inside TensorFlow - TF Model Optimization Toolkit - Quantization and Pruning

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.