Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore methods for achieving high Service Level Objectives (SLOs) in large-scale Kubernetes clusters in this conference talk by Ant Group engineers. Learn about designing SLO architecture, implementing effective strategies, and maintaining high-quality pod delivery with improved success rates and reduced latency. Discover proper indicators for measuring Kubernetes cluster health, and gain insights into creating a tracing and analysis platform for collecting metrics and computing indicators. Understand how to diagnose problems in pod delivery processes and implement a self-healing system that leverages artificial experience to automatically fix known issues. This presentation offers valuable knowledge for managing complex, large-scale Kubernetes environments and optimizing their performance.