Explore best practices for building resilient services on Kubernetes in this 26-minute conference talk by Todd Ekenstam and Anusha Ragunathan from Intuit Inc. Learn how to eliminate HTTP 5xx errors, improve application performance during rollouts, and avoid scheduled downtime for upgrades. Discover fault-tolerant strategies implemented by Intuit, which runs 7,000 applications across 315 Kubernetes clusters with strict SLAs and SLOs. Gain insights into handling Pod terminations due to deployments, scaling events, node rotations, and scheduler bin-packing. Apply these proven patterns and practices to enhance the resilience of your Kubernetes-based applications at scale.
Overview
Syllabus
Building Resilient Services on Kubernetes - Todd Ekenstam, Intuit & Anusha Ragunathan, Intuit Inc
Taught by
CNCF [Cloud Native Computing Foundation]