Completed
Introducing Datasets
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Beyond Shuffling - Scaling Apache Spark
Automatically move to the next video in the Classroom when playback concludes
- 1 Intro
- 2 What is going to be covered
- 3 The different pieces of Spark
- 4 What is key skew and why do we care?
- 5 Well there is a bit of magic in the shume....
- 6 Iterator tortor transformations
- 7 Why is Spark SQL good for those things?
- 8 How much faster can it be?
- 9 How to avoid lineage explosions
- 10 Introducing Datasets
- 11 And functional style maps
- 12 Switching gears: Valdating Spark jobs
- 13 Using an accumulator for validation
- 14 Validating records read matches our expectations
- 15 Additional Spark Testing Resources
- 16 Additional Spark Resources
- 17 Spark Videos