Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Demystifying the Communication Characteristics for Distributed Transformer Models

HOTI - Hot Interconnects Symposium via YouTube

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Learn about the communication patterns and characteristics of distributed transformer models in this 34-minute technical conference presentation from the HOTI Hot Interconnects Symposium. Explore detailed insights from researchers Quentin Anthony, Benjamin Michalowicz, Jacob Hatef, Lang Xu, Mustafa Abduljabbar, Aamir Shafi, Hari Subramoni and Dhabaleswar Panda as they break down the complex communication requirements for running large-scale transformer architectures across distributed systems. Part of Technical Paper Session D focused on optimizing collective operations, this talk chaired by Craig Stunkel of Nvidia provides valuable understanding of the networking and communication challenges involved in training and deploying distributed transformer models.

Syllabus

Day 2 13:00: Demystifying the Communication Characteristics for Distributed Transformer Models

Taught by

HOTI - Hot Interconnects Symposium

Reviews

Start your review of Demystifying the Communication Characteristics for Distributed Transformer Models

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.