How Adobe Processes 2 Million Records Per Second Using Apache Spark

How Adobe Processes 2 Million Records Per Second Using Apache Spark

Databricks via YouTube Direct link

Intro

1 of 11

1 of 11

Intro

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

How Adobe Processes 2 Million Records Per Second Using Apache Spark

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Intro
  2. 2 What do you mean by Processing? Agenda!
  3. 3 Unified Profile Data Ingestion
  4. 4 Generic Flow
  5. 5 Flow with MinPartitions partitions on Kafka
  6. 6 MicroBatch Hard! Logic Best Practices
  7. 7 An Example
  8. 8 For Repeated Queries Over Same DF
  9. 9 Join Optimization For Interactive Queries (Opinionated)
  10. 10 How to get the magic targetPartitionCount?
  11. 11 Digging into Redis Pipelining + Spark

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.