Overview
Learn how to architect modern data lakes using object storage in this 12-minute conference talk from MinIO's Satish Ramakrishnan. Discover the synergistic relationship between distributed object stores and MPP query engines like Presto, particularly for managing massive datasets of tens to hundreds of petabytes with concurrent query processing. Explore sophisticated object storage features including throughput optimizations, multi-cloud capabilities, cross-cloud active-active replication, and lifecycle management. Gain insights into a reference architecture designed for efficient query processing at object scale, addressing the demands of distributed, unstructured data lakes that require both performance and scalability.
Syllabus
Building Modern Data Lakes for Analytics Using Object Storage - Satish Ramakrishnan, MinIO
Taught by
Presto Foundation