Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

SGD and Weight Decay Secretly Compress Your Neural Network

MITCBMM via YouTube

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the intriguing concept of how Stochastic Gradient Descent (SGD) and weight decay techniques inadvertently compress neural networks in this insightful 55-minute conference talk by Tomer Galanti from MIT. Delve into the underlying mechanisms that contribute to this hidden compression effect, gaining a deeper understanding of how these widely-used optimization methods impact the efficiency and performance of deep learning models.

Syllabus

SGD and Weight Decay Secretly Compress Your Neural Network

Taught by

MITCBMM

Reviews

Start your review of SGD and Weight Decay Secretly Compress Your Neural Network

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.