Leveraging DRA for Conquering ML Workload Challenges on Kubernetes
CNCF [Cloud Native Computing Foundation] via YouTube
Overview
Explore how to leverage Dynamic Resource Allocation (DRA) to overcome machine learning workload challenges on Kubernetes in this informative conference talk. Discover the limitations of traditional GPU resource management and learn how DRA addresses these issues by introducing controlled GPU sharing, support for multiple GPU models, and custom constraints. Gain insights into optimizing resource utilization and meeting the dynamic needs of modern ML workload deployment. Dive deep into practical examples of ML workload deployment on Kubernetes using GPU resource drivers for DRA, enabling dynamic allocation and scheduling of GPUs.
Syllabus
Leveraging DRA for Conquering ML Workload Challenges on... Shivay Lamba & Priyanshu Raj Shrivastava
Taught by
CNCF [Cloud Native Computing Foundation]