Explore a conference talk that delves into a database native solution for managing Change Data Capture (CDC) streaming pipelines in Cassandra. Learn about the common challenges faced by consumers of Cassandra's CDC logs, including transforming log formats, de-duplicating entries, and integrating with streaming applications like Kafka or Amazon Kinesis. Discover a proposed re-imagined CDC architecture for Cassandra, drawing insights from recurring data pipeline issues and other managed CDC solutions such as ScyllaDB, Amazon DynamoDB, and Microsoft Azure. Gain valuable knowledge on streamlining CDC processes and improving data pipeline efficiency in Cassandra-based systems.
Overview
Syllabus
Database Native Support for CDC Streaming Pipeline - Neha Maheshwari & Vinit Gupta, Amazon
Taught by
Linux Foundation