Overview
Explore the world of streaming ETL in this 46-minute Devoxx conference talk. Dive into the importance of bridging the gap between data in motion and data at rest, with a focus on Apache Kafka as a central nervous system for company-wide data architectures. Learn about building robust data integration pipelines between MongoDB and Apache Kafka using the Kafka Connect framework. Discover configuration-based data in motion scenarios and streaming ETL pipeline examples that can be implemented without coding. Gain insights into topics such as the diminishing value of data, Data Fabric, Kafka APIs, Source Connectors, and achieving a Single Source of Truth. Understand how to synchronize data across services and see practical examples of Source Connector implementation.
Syllabus
Introduction
The diminishing value of data
Data Fabric
Kafka
Kafka API
Kafka Connect
Source Connectors
Single Source of Truth
Synchronize Data Across Services
Source Connector
Example
Taught by
Devoxx