Neural Architecture Search for Efficient Deep Learning - Lecture 9

Overview

Explore the third part of a lecture series on Neural Architecture Search in this comprehensive video from MIT's 6.S965 course on TinyML and Efficient Deep Learning Computing. Delve into advanced techniques for deploying neural networks on resource-constrained devices such as mobile phones and IoT devices. Learn about efficient inference methods, including model compression, pruning, quantization, and neural architecture search. Discover strategies for efficient training, like gradient compression and on-device transfer learning. Gain insights into application-specific model optimization for videos, point clouds, and NLP. Understand the principles of efficient quantum machine learning. Get hands-on experience implementing deep learning applications on microcontrollers, mobile devices, and quantum machines through an open-ended design project focused on mobile AI. Taught by Professor Song Han, this lecture is part of a series that equips students with the knowledge to overcome challenges in deploying and training neural networks on resource-limited devices.

Syllabus

Lecture 09 - Neural Architecture Search (Part III) | MIT 6.S965

Taught by

MIT HAN Lab

Reviews

Start your review of Neural Architecture Search for Efficient Deep Learning - Lecture 9

Taught by

TinyML and Efficient Deep Learning Computing - Lecture 24: Course Summary

Neural Architecture Search (Part II) - Lecture 8

Neural Architecture Search (Part II) - Lecture 8

TinyEngine - Efficient Training and Inference on Microcontrollers - Lecture 17

TinyEngine - Efficient Training and Inference on Microcontrollers - Lecture 17

Efficient Video Understanding and Generative Models - Lecture 19

Never Stop Learning.