Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Congestion Management in Ethernet-Based Networks for AI Cluster Fabric

Open Compute Project via YouTube

Overview

Learn about managing network congestion in AI cluster fabrics through this 22-minute technical presentation from Edgecore Networks and Drivenets experts. Explore the unique challenges of low-entropy, high-density traffic patterns and elephant flows in AI infrastructures. Discover two key methods for congestion control and avoidance through endpoint and fabric scheduling. Examine how different parameters like DLB, PFC, and ECN impact job completion time and overall throughput in AI workloads. Master techniques for minimizing network drops, latency, and jitter while maximizing performance over lossless Ethernet fabrics.

Syllabus

Congestion Management in an Ethernet based network for AI Cluster Fabric

Taught by

Open Compute Project

Reviews

Start your review of Congestion Management in Ethernet-Based Networks for AI Cluster Fabric

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.