Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

CNCF [Cloud Native Computing Foundation]

Unlocking Heterogeneous AI Infrastructure K8s Cluster - Leveraging the Power of HAMi

CNCF [Cloud Native Computing Foundation] via YouTube

Overview

Explore the challenges and solutions for managing heterogeneous AI infrastructure in Kubernetes clusters through this comprehensive conference talk. Dive into the HAMi project, designed to address the complexities of integrating diverse AI devices like NVIDIA, Intel, and Huawei Ascend. Learn about unified scheduling, observability, and strategies to improve resource utilization of expensive AI hardware. Discover techniques for GPU sharing, ensuring QoS for high-priority tasks, and implementing flexible scheduling policies. Gain insights from real-world case studies and explore integrations with other projects such as Volcano and scheduler-plugin. Understand the current challenges and future roadmap for optimizing heterogeneous AI device management in Kubernetes environments.

Syllabus

Unlocking Heterogeneous AI Infrastructure K8s Cluster: Leveraging the...- Xiao Zhang & Mengxuan Li

Taught by

CNCF [Cloud Native Computing Foundation]

Reviews

Start your review of Unlocking Heterogeneous AI Infrastructure K8s Cluster - Leveraging the Power of HAMi

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.