Migrating ETL Workflows to Apache Spark at Scale - Pinterest's Experience

Migrating ETL Workflows to Apache Spark at Scale - Pinterest's Experience

Databricks via YouTube Direct link

Balancing Performance

21 of 23

21 of 23

Balancing Performance

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Migrating ETL Workflows to Apache Spark at Scale - Pinterest's Experience

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Intro
  2. 2 We Are on Cloud
  3. 3 Spark Clusters
  4. 4 Spark Versions and Use Cases
  5. 5 Migration Plan
  6. 6 Migration Path
  7. 7 Spark API
  8. 8 Approach
  9. 9 Translate Cascading
  10. 10 UDF Translation
  11. 11 Translate Scalding
  12. 12 Secondary Sort
  13. 13 Accumulators
  14. 14 Accumulator Continue
  15. 15 Accumulator Tab in Spark UI
  16. 16 Profiling
  17. 17 Automatic Migration Service (AMS)
  18. 18 Data Validation
  19. 19 Source of Uncertainty
  20. 20 Performance Tuning
  21. 21 Balancing Performance
  22. 22 Automatic Migration & Failure Handling
  23. 23 Future Plan

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.