GPU Configuration on the Fly Using Dynamic Resource Allocation - A Tale of Two Drivers
CNCF [Cloud Native Computing Foundation] via YouTube
Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Learn how NVIDIA's GeForceNow cloud gaming service leverages Kubernetes and Dynamic Resource Allocation (DRA) in this technical conference talk. Discover the intricate details of dynamically switching GPU drivers to enable both full GPU passthrough and GPU virtualization while maintaining optimal datacenter utilization. Explore the migration journey from traditional Kubernetes device plugin API to DRA, understanding the challenges faced and solutions implemented. Gain practical insights into optimizing GPU-accelerated workloads in cloud environments, with specific focus on managing Kubevirt VMs and GPU configurations. Master the best practices for implementing similar migrations in your own infrastructure while maintaining high performance and seamless user experience.
Syllabus
A Tale of 2 Drivers: GPU Configuration on the Fly Using DRA- Alay Patel & Varun Ramachandra Sekar US
Taught by
CNCF [Cloud Native Computing Foundation]