Completed
Use data sets
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Apache Spark Beyond Shuffling - Why it isn't Magic - but also where there is some really cool Magic
Automatically move to the next video in the Classroom when playback concludes
- 1 Intro
- 2 Holdens background
- 3 Trans and clear
- 4 IBM
- 5 People
- 6 What is Spark
- 7 Why people come to Spark
- 8 The magic of Spark
- 9 What is RDD
- 10 RDD Example
- 11 Word Count Example
- 12 Important Note
- 13 Example
- 14 Problems withSPARC
- 15 Tokenizing the data
- 16 Magic
- 17 Key Skew
- 18 Data with Key Skew
- 19 Explosion
- 20 San Francisco
- 21 Hack
- 22 Grouping by Key
- 23 Bad Word Count
- 24 Data Size
- 25 Input
- 26 Reduce by Key
- 27 Data Frames
- 28 Fuzzy Pandas
- 29 Python
- 30 Driver
- 31 How does this break
- 32 Use data sets
- 33 Spark videos
- 34 Testing libraries
- 35 Spark books
- 36 Corporate compliance
- 37 Office hours
- 38 Questions