How We Scale Up to 2,000 Nodes for Batch Jobs Using Cluster Autoscaler
CNCF [Cloud Native Computing Foundation] via YouTube
Overview
Explore how ByteDance scales up to 2,000 nodes for batch jobs using cluster autoscaler in this informative conference talk. Learn about the challenges of managing batch jobs in a cloud-native environment and discover solutions for efficiently scaling Kubernetes clusters. Gain insights into optimizing pod creation, addressing scaling issues, and improving node deletion processes. Understand the unique requirements of batch jobs compared to microservices and how to leverage cloud elasticity effectively. Discover practical strategies for handling large-scale pod creation and deletion, managing cluster resources, and reducing costs in production environments.
Syllabus
How We Scale up to 2k Nodes for Batch Jobs Using Cluster Autoscaler - Lei Qian, ByteDance
Taught by
CNCF [Cloud Native Computing Foundation]