Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Apache Spark vs Databricks - On-Premise Performance Comparison

Databricks via YouTube

Overview

Explore an on-premise comparison of Databricks and Open-Source Apache Spark in this 21-minute conference talk by Booz Allen's Cyber AI team. Discover how they achieved 10x performance gains on real-world cyber workloads using Databricks Runtime Environment in an air-gapped environment. Learn about the challenges and solutions for implementing data science in sensitive operations, including the service-oriented architecture for capability deployment and high-performance computing project architecture. Gain insights into the results of Spark Open Source vs Spark DBR and valuable lessons for future on-premise installations in data-sensitive environments.

Syllabus

Intro
The Challenge: Go Fast.... On-Premise?
Solution: A Service-Oriented Architecture for Capability Deployment
Project Architecture: Focused on High Performance Computing
Results: Spark Open Source vs Spark DBR
Lessons Learned for Future On-Premise Installs

Taught by

Databricks

Reviews

Start your review of Apache Spark vs Databricks - On-Premise Performance Comparison

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.