Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore real-time analytics using Apache Cassandra in this conference talk from GOTO 2012. Delve into the concept of real-time data processing, learn about query optimization, and discover how to design effective solutions for maintaining counters and precomputing results. Examine the Twosies approach and various indexing techniques, including Catalyst. Gain insights into data structures, commit logs, and merge costs. Address additional requirements such as multi-date support and low latency. Learn how to write efficient code using Kinetics and implement incremental analytics. Understand the importance of replay, balancing counter replicas, and leveraging solid-state disks for optimal performance in real-time analytics systems.
Syllabus
Introduction
What is realtime
Queries
Realtime
Example
Design a solution
Maintain a counter
Precompute
Twosies
Solutions
Indexing
Catalyst
Features
Sequential Rights
Data Structures
Commit Log
Merge
Cost
Other requirements
Multidate support
Low Latency
Writing Code
Kinetics
Incremental Analytics
Replay
Balance
Counter replicas
Solid state disks
Taught by
GOTO Conferences