Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Dive into the world of high-performance analytics with this 27-minute conference talk presented by Gian Merlino from Imply. Explore the challenges and solutions for handling analytics at an impressive 1000 queries per second (QPS) and beyond. Learn about Apache Druid and its role in large-scale data processing. Discover key concepts such as CPU optimization, I/O management, data locality, and deferred computations. Gain insights into scaling strategies and techniques for managing heterogeneous workloads in high-volume analytics environments. This English-language presentation offers valuable knowledge for data engineers and analysts working with big data and real-time analytics systems.
Syllabus
Introduction
Apache Druid
Overview
CPU IO
Locality
Deferring computations
Scaling
Heterogeneous workloads
Taught by
Linux Foundation