Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Linux Foundation

Co-Location of CPU and GPU Workloads for High Resource Efficiency in Kubernetes

Linux Foundation via YouTube

Overview

Explore strategies for optimizing resource utilization in Kubernetes clusters by co-locating CPU and GPU workloads. Learn how Ant Financial and Alibaba achieved a 10% increase in utilization through innovative approaches. Discover the creation of a new QoS class, implementation of node-level cgroups for batch jobs, and use of PodGroup CRD for gang scheduling. Gain insights into building and managing a co-location cluster with over 100 GPU and 500 CPU nodes, effectively combining long-running services and AI batch jobs. This 37-minute conference talk from the Linux Foundation provides valuable experience and practices for maximizing resource efficiency in Kubernetes environments.

Syllabus

Co-Location of CPU and GPU Workloads with High Resource Efficiency - Penghao Cen & Jian He

Taught by

Linux Foundation

Reviews

Start your review of Co-Location of CPU and GPU Workloads for High Resource Efficiency in Kubernetes

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.