Overview
Explore how Kubernetes Operators can simplify and automate AI infrastructure management in this 40-minute conference talk. Learn about the challenges of running ML applications on Kubernetes and discover how operators streamline cluster lifecycle management, hardware configuration, and deep learning model deployments. Watch a demonstration of fine-tuning an LLM workload using existing operators like the GPU Operator and Kubernetes AI Toolchain Operator. Gain insights into best practices and challenges of implementing operators in production environments. Suitable for Kubernetes users looking to optimize their AI infrastructure setup and management processes.
Syllabus
Simplify AI Infrastructure with Kubernetes Operators - Ganeshkumar Ashokavardhanan & Tariq Ibrahim
Taught by
Linux Foundation