Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Quantization in Neural Networks - Lecture 5

MIT HAN Lab via YouTube

Overview

Dive into the world of neural network quantization in this comprehensive lecture from MIT's TinyML and Efficient Deep Learning Computing course. Explore numeric data types in modern computing systems and gain insights into K-means-based quantization and linear quantization techniques. Learn how to optimize deep learning models for resource-constrained devices, enabling powerful AI applications on mobile and IoT platforms. Discover strategies for efficient inference, including model compression, pruning, and neural architecture search. Gain hands-on experience implementing deep learning applications on microcontrollers, mobile phones, and quantum machines through an open-ended design project focused on mobile AI.

Syllabus

Lecture 05 - Quantization (Part I) | MIT 6.S965

Taught by

MIT HAN Lab

Reviews

Start your review of Quantization in Neural Networks - Lecture 5

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.