Apache Spark Beyond Shuffling - Why it isn't Magic - but also where there is some really cool Magic

Apache Spark Beyond Shuffling - Why it isn't Magic - but also where there is some really cool Magic

GOTO Conferences via YouTube Direct link

IBM

4 of 38

4 of 38

IBM

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Apache Spark Beyond Shuffling - Why it isn't Magic - but also where there is some really cool Magic

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Intro
  2. 2 Holdens background
  3. 3 Trans and clear
  4. 4 IBM
  5. 5 People
  6. 6 What is Spark
  7. 7 Why people come to Spark
  8. 8 The magic of Spark
  9. 9 What is RDD
  10. 10 RDD Example
  11. 11 Word Count Example
  12. 12 Important Note
  13. 13 Example
  14. 14 Problems withSPARC
  15. 15 Tokenizing the data
  16. 16 Magic
  17. 17 Key Skew
  18. 18 Data with Key Skew
  19. 19 Explosion
  20. 20 San Francisco
  21. 21 Hack
  22. 22 Grouping by Key
  23. 23 Bad Word Count
  24. 24 Data Size
  25. 25 Input
  26. 26 Reduce by Key
  27. 27 Data Frames
  28. 28 Fuzzy Pandas
  29. 29 Python
  30. 30 Driver
  31. 31 How does this break
  32. 32 Use data sets
  33. 33 Spark videos
  34. 34 Testing libraries
  35. 35 Spark books
  36. 36 Corporate compliance
  37. 37 Office hours
  38. 38 Questions

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.