Make Kubernetes Networking Ready for World-Class AI and HPC Workloads
CNCF [Cloud Native Computing Foundation] via YouTube
Overview
Explore the cutting-edge developments in Kubernetes networking for high-performance computing (HPC) and artificial intelligence (AI) workloads in this informative conference talk. Delve into the challenges of adapting Kubernetes for advanced AI and HPC clusters, focusing on the need for multiple high-speed network interfaces. Learn about Multus and its role in enabling multi-networking features in Kubernetes, and discover the innovative Multi-NIC CNI project designed to democratize multiple interface capabilities. Gain insights into the architecture, use cases, and performance benefits of this new open-source solution, particularly for HPC and AI applications. Witness a demonstration of the CNI's capabilities on a large-scale GPU cluster with over 1400 GPUs and dual 100G network interfaces, showcasing its potential to revolutionize Kubernetes networking for world-class AI and HPC workloads.
Syllabus
Make Kubernetes Networking Ready for World Class AI and HPC ... Sunyanan Choochotkaew & Gaurav Singh
Taught by
CNCF [Cloud Native Computing Foundation]