Best Practices for Building Robust Data Platforms with Apache Spark and Delta

Best Practices for Building Robust Data Platforms with Apache Spark and Delta

Databricks via YouTube Direct link

Usual Data Lake

3 of 13

3 of 13

Usual Data Lake

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Best Practices for Building Robust Data Platforms with Apache Spark and Delta

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Intro
  2. 2 Data Challenges
  3. 3 Usual Data Lake
  4. 4 Getting the Data Right
  5. 5 Best Practices for Cluster Sizing & Selection
  6. 6 Selection of Instance Types
  7. 7 Selection of node size Rule of thumb
  8. 8 Observe Spark UI & tweak the workloads
  9. 9 Observe Ganglia Metrics & tweak the workloads
  10. 10 Performance Symptoms
  11. 11 Adaptive Ouery Execution
  12. 12 Data Governance with Delta Lake
  13. 13 Audit & Monitoring

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.