Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

BigScience BLOOM - 3D Parallelism Explained - Large Language Models - ML Coding Series

Aleksa Gordić - The AI Epiphany via YouTube

Overview

Dive into the fourth video of the Large Language Model series, exploring the BigScience BLOOM model codebase with a focus on understanding 3D parallelism. Learn about pipeline parallelism, model parallelism, and data parallelism - the engineering concepts behind recent scaling efforts and machine learning successes. Follow along as the video walks through the eval script, model construction, sharding techniques, and the forward pass. Gain insights into embedding table sharding, transformer layer sharding, attention layer sharding, and the intricacies of ColumnParallel and RowParallel sharding. Understand how dataset building relates to data parallelism and explore pipeline parallelism communication. Conclude with a comprehensive recap of the 3D parallelism concepts covered in this in-depth, 72-minute tutorial.

Syllabus

Intro - focusing on the 3D parallelism!
Quick setup
Stepping through the eval script
3D paralellism - model construction
Sharding the embedding table model parallelism
Sharding the transformer layer
LayerNorm fused kernels
Sharding the attention layer
ColumnParallel and RowParallel sharding
Synchronizing input and output embedding tables
Building the dataset data parallelism
3D parallelism - forward pass
Pipeline parallelism communication
Pass through the sharded embedding table
Pass through the sharded transformer layer
Sharded logit and cross-entropy computation
Recap
Outro

Taught by

Aleksa Gordić - The AI Epiphany

Reviews

Start your review of BigScience BLOOM - 3D Parallelism Explained - Large Language Models - ML Coding Series

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.