Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Delta Lake: Optimizing Merge Operations

Databricks via YouTube

Overview

Dive into the intricacies of Delta Lake's merge operation in this 24-minute talk from Databricks. Explore the underlying mechanics of merge, learn optimization techniques, and gain insights through code snippets and sample configurations. Understand the basics of merge, including inner and full outer joins, and discover practical tips for handling large merges. Examine partition and file pruning examples, operation metrics, and best practices for S3 bucket usage. Enhance your knowledge of Delta Lake and improve your data management skills with this informative session from the Databricks Summit Europe.

Syllabus

SUMMIT EUROPE
Merge overview
Merge basics • Tale of two joins: inner join and full outer join
Partition Prune Example
File Prune Example
Operation Metrics continued
Large merge tips s3 bucket: write at the root-53 parallelism is defined by the Each large table should have its own s3 bucket and anoth
Final recap
Feedback

Taught by

Databricks

Reviews

Start your review of Delta Lake: Optimizing Merge Operations

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.