Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Supporting Machine Learning Workloads in Presto - Optimizing Data Preparation and Processing

Presto Foundation via YouTube

Overview

Learn how machine learning users can better leverage Presto for preparing large-scale training datasets in this 24-minute technical talk from the Presto Foundation. Explore key challenges faced when using Presto for ML workloads, drawing from real implementation experiences at Meta. Dive into three critical dimensions: efficient storage and in-memory data layout optimization, compressed execution's impact on operator design, and the concept of extreme late materialization. Discover recent advancements made by Meta's team in supporting ML workloads, examine initial performance results, and understand the ecosystem of open source projects supporting this technology stack. Gain insights into areas requiring further research, development and community collaboration to enhance Presto's capabilities for machine learning applications.

Syllabus

How can Presto better support ML users?

Taught by

Presto Foundation

Reviews

Start your review of Supporting Machine Learning Workloads in Presto - Optimizing Data Preparation and Processing

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.