Explore best practices and lessons learned for thriving with Kubernetes on-call in this panel discussion featuring engineers from Airbnb, Lyft, Netflix, and Robinhood. Gain insights into managing sustainable on-call rotations for critical Kubernetes infrastructure in large, public companies. Discover strategies for balancing rapid response with alert fatigue, proactively addressing production issues, and preparing engineers for on-call duties. Learn how to keep on-call engineers happy while maintaining high uptime for business-critical workloads in complex environments with constant changes and traffic fluctuations.
Thriving With Kubernetes On-Call - Best Practices and Lessons Learned
CNCF [Cloud Native Computing Foundation] via YouTube
Overview
Syllabus
Thriving With Kubernetes On... - Sunil Shah & Ramya Krishnan, Ashley Cutalo, Madhu C.S., Fabio Kung
Taught by
CNCF [Cloud Native Computing Foundation]