Distributed Training: Hybrid Parallelism and Gradient Optimization - Lecture 20

Overview

Learn advanced distributed training concepts in machine learning through a recorded MIT lecture that explores hybrid parallelism, auto-parallelization techniques, and strategies for overcoming bandwidth and latency bottlenecks. Dive deep into gradient compression methods including gradient pruning for sparse communication, deep gradient compression, and gradient quantization techniques like 1-Bit SGD and TernGrad. Master the implementation of delayed gradient updates while understanding their role in addressing latency challenges in distributed systems. Taught by Professor Song Han, this comprehensive lecture from MIT's 6.5940 course provides essential knowledge for optimizing large-scale machine learning training processes.

Syllabus

EfficientML.ai Lecture 20 - Distributed Training Part 2 (Zoom Recording) (MIT 6.5940, Fall 2024)

Taught by

MIT HAN Lab

Reviews

Start your review of Distributed Training: Hybrid Parallelism and Gradient Optimization - Lecture 20

Taught by

Distributed Training: Hybrid Parallelism and Gradient Optimization - Lecture 20

Distributed Training and Gradient Compression - Lecture 14

Distributed Training - Part I - Lecture 17

Distributed Training Methods and Parallelization Techniques - Lecture 19

Distributed Training and Gradient Compression - Lecture 14

Distributed Training Methods for Efficient Machine Learning - Part 1

10 Best Machine Learning Courses for 2024: Scikit-learn, TensorFlow, and more

Never Stop Learning.