Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Spark at Scale: Engineering Strategies for Data Science Workflows

Data Science Festival via YouTube

Overview

Explore engineering strategies for optimizing data science workflows using Spark in this 40-minute conference talk from the Data Science Festival Summer School 2023. Dive into a case study presented by Neil McCulloch, Data Science Engineer at dunnhumby, focusing on improving the performance of problematic PySpark applications. Learn how to slash runtimes in half for in-store availability reporting science. Gain insights into tackling large-scale data processing challenges and enhancing the efficiency of Spark-based data science projects. Discover practical approaches to optimize PySpark applications and streamline big data analytics workflows.

Syllabus

Spark at Scale: Engineering Strategies for Data Science Workflows

Taught by

Data Science Festival

Reviews

Start your review of Spark at Scale: Engineering Strategies for Data Science Workflows

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.