Dynamic Scaling of GPUs for Container Apps with Composable Disaggregated Infrastructure for AI Era

Overview

Learn about dynamic GPU scaling in container applications through this conference talk that explores Composable Disaggregated Infrastructure (CDI) for the AI era. Discover how to address the growing computational demands of AI and ML in Kubernetes environments while maintaining energy efficiency. Explore the innovative CDI server architecture that enables on-demand resource allocation by composing devices like compute, memory, storage, and GPUs through PCIe or CXL switch fabric. Understand the implementation of CDI operators, Custom Resource Definitions, and advanced vertical and horizontal cluster auto-scaling capabilities. See demonstrations of dynamic device attachment and detachment in nodes using Dynamic Resource Allocation (DRA) functionality, providing a practical solution for balancing high performance with sustainable power consumption in modern container environments.

Syllabus

Dynamic Scaling of GPUs for Container Apps with Composable Disaggregated... - Jin Hase & Lei Zhang

Taught by

Linux Foundation

Reviews

Start your review of Dynamic Scaling of GPUs for Container Apps with Composable Disaggregated Infrastructure for AI Era

Taught by

Tags

Unlocking the Full Potential of GPUs for AI Workloads on Kubernetes

Unleashing the Power of Dynamic Resource Allocation for Just-in-Time GPU Slicing

9 Best Kubernetes Courses for 2024

Never Stop Learning.