A Deep Dive on Supporting Multi-Instance GPUs in Containers and Kubernetes
CNCF [Cloud Native Computing Foundation] via YouTube
Overview
Syllabus
Intro
GPUS AND KUBERNETES Seamlessly scale up training and inference to a cluster of GPU machines
WHAT ARE MULTI-INSTANCE GPUs? Slices of a full GPU with dedicated memory and compute resources
OUTLINE
MULTI-INSTANCE GPUs (MIG)
GPUS AND CONTAINERS The NVIDIA Container Toolkit
GPUS AND KUBERNETES Allocate GPUs to pods in a Kubernetes Cluster
MIG IN CONTAINERS AND KUBERNETES
SYSTEM LEVEL INTERFACE FOR MIG
CHALLENGES WITH MIG PARTITIONING How do I create a MG Device in the first place?
MIG PARTITION EDITOR
SUMMARY AND CONCLUSION
Taught by
CNCF [Cloud Native Computing Foundation]