Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Cloud-Native Apache Spark Scheduling with YuniKorn on Kubernetes

Databricks via YouTube

Overview

Explore cloud-native Apache Spark scheduling using YuniKorn Scheduler in this 36-minute conference talk from Databricks. Dive into the architecture of cloud-native infrastructure and learn how YuniKorn, an open-source resource scheduler, redefines resource scheduling in the cloud. Discover how to manage quotas, resource sharing, and auto-scaling for efficient scheduling of large-scale Spark jobs on Kubernetes. Gain insights into Lyft and Cloudera's experiences with next-generation cloud-native infrastructure, and understand the challenges and solutions for running Spark on Kubernetes. Learn about YuniKorn's advantages over default schedulers, including job ordering, resource quota management, and fairness in queue allocation. Compare YuniKorn with other Kubernetes schedulers and explore its management console. Get an overview of YuniKorn's current status, community involvement, roadmap, and vision for resource management in big data and machine learning environments.

Syllabus

Intro
Role of Kos in Lyft's Data Landscape
Multi-step creation for a Spark KBs job
Problems of existing Spark K8s infrastructure Complexity of layers of custom KÃ¥s controllers to handle the scale of the
Why we need a customized K8s Scheduler
Flavors of Running Spark on KBS
Resource Scheduling in K8s
Spark on K8s: the scheduling challenges
Apache Yunikorn (Incubating)
Resource Scheduling in Yunikorn land compare w/default scheduler
Main difference (Yunikorn v.s Default Scheduler)
Run Spark with Yunikorn
Job Ordering
Resource Quota Management: K8s Namespace ResourceQuota
Resource Quota Management: Yunikorn Queue Capacity
Resource Fairness in Yunikorn Queues
Scheduler Throughput Benchmark
Fully K8s Compatible
Yunikorn Management Console
Compare Yunikorn with other K8s schedulers
Current Status
The Community
Roadmap
Our Vision - Resource Mgmt for Big Data & ML

Taught by

Databricks

Reviews

Start your review of Cloud-Native Apache Spark Scheduling with YuniKorn on Kubernetes

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.