Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Building a High-Performance Real-Time Analytics Database with Apache Kafka and Druid

CodeWithYu via YouTube

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Learn to build a high-performance real-time analytics database in this comprehensive video tutorial that demonstrates end-to-end data engineering using Apache ecosystem tools. Master essential concepts including streaming data with Apache Kafka, implementing distributed synchronization through Zookeeper, processing and storing data with Apache Druid, and containerizing the entire environment using Docker and Orbstack. Follow along with hands-on demonstrations covering system architecture design, project initialization, container setup, data streaming implementation, Apache Druid configuration, and execution of real-time queries and time-based aggregations. Gain practical experience in connecting Apache Druid to Kafka streams and performing advanced data operations while building a production-ready analytics infrastructure.

Syllabus

Introduction
List of Apache Frameworks for Data Engineering
System Architecture
Starting up a project from scratch
Setting up the containers and services on Docker
Streaming data into Apache Kafka
Apache Druid Walkthrough
Connecting Apache Druid to Apache Kafka
Realtime Queries and Aggregations on Apache Druid
Time Aggregations on Apache Druid
Outro

Taught by

CodeWithYu

Reviews

Start your review of Building a High-Performance Real-Time Analytics Database with Apache Kafka and Druid

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.