Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Dive into the intricacies of Delta Lake's merge operation in this 24-minute talk from Databricks. Explore the underlying mechanics of merge, learn optimization techniques, and gain insights through code snippets and sample configurations. Understand the basics of merge, including inner and full outer joins, and discover practical tips for handling large merges. Examine partition and file pruning examples, operation metrics, and best practices for S3 bucket usage. Enhance your knowledge of Delta Lake and improve your data management skills with this informative session from the Databricks Summit Europe.
Syllabus
SUMMIT EUROPE
Merge overview
Merge basics • Tale of two joins: inner join and full outer join
Partition Prune Example
File Prune Example
Operation Metrics continued
Large merge tips s3 bucket: write at the root-53 parallelism is defined by the Each large table should have its own s3 bucket and anoth
Final recap
Feedback
Taught by
Databricks