Beyond Shuffling - Scaling Apache Spark

Beyond Shuffling - Scaling Apache Spark

Scala Days Conferences via YouTube Direct link

Intro

1 of 17

1 of 17

Intro

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Beyond Shuffling - Scaling Apache Spark

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Intro
  2. 2 What is going to be covered
  3. 3 The different pieces of Spark
  4. 4 What is key skew and why do we care?
  5. 5 Well there is a bit of magic in the shume....
  6. 6 Iterator tortor transformations
  7. 7 Why is Spark SQL good for those things?
  8. 8 How much faster can it be?
  9. 9 How to avoid lineage explosions
  10. 10 Introducing Datasets
  11. 11 And functional style maps
  12. 12 Switching gears: Valdating Spark jobs
  13. 13 Using an accumulator for validation
  14. 14 Validating records read matches our expectations
  15. 15 Additional Spark Testing Resources
  16. 16 Additional Spark Resources
  17. 17 Spark Videos

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.