Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Linux Foundation

Building an Open Source Streaming Analytics Stack with Kafka and Druid

Linux Foundation via YouTube

Overview

Explore how to construct a streaming analytics stack using Kafka and Druid in this 41-minute conference talk from the Linux Foundation. Learn about the challenges of batch processing systems and discover how combining Kafka and Druid can create a robust data pipeline supporting real-time and batch ingestion with flexible, low-latency queries. Delve into topics such as event handling, data delivery problems, stream processing challenges, and approximation algorithms. Gain insights into Druid's architecture and understand how this open-source technology combination can guarantee system availability, maintain data integrity, and support fast, flexible queries for deriving insights from vast quantities of data.

Syllabus

Introduction
Overview
The Problem
Events
Example
Problems
Models
Data Delivery
Data Delivery Problems
Kafka Summary
Stream Processing
Stream Processing Challenges
Stream Processing System
Challenges
Subheading Queries
Technical Overview
Approximation Algorithms
Druid Architecture
Rules of Example
Raw Data
Shuffle
Join
Joint
Conclusions

Taught by

Linux Foundation

Reviews

Start your review of Building an Open Source Streaming Analytics Stack with Kafka and Druid

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.