Explore best practices for interference detection and resource isolation enhancement on Kubernetes in this informative conference talk. Learn how Kuaishou addressed the challenges of deploying a hybrid mix of latency-sensitive workloads and batch jobs on a container cloud platform. Discover how they implemented an interference observation and diagnosis system to quickly identify and troubleshoot issues. Gain insights into their approach for fine-grained control over CPU and memory resources on a per-service basis, effectively mitigating the impact of batch jobs on latency-sensitive workloads. Understand how these strategies enabled the platform to deploy more batch jobs while maintaining stability, ultimately improving overall resource utilization and reducing costs associated with adding extra servers.
Best Practice for Interference Detection and Resource Isolation Enhancement on Kubernetes
CNCF [Cloud Native Computing Foundation] via YouTube
Overview
Syllabus
Best Practice for Interference Detection and Resource Isolation Enhancement on... - Haogang Wang
Taught by
CNCF [Cloud Native Computing Foundation]