Overview
Learn how to significantly improve Lakehouse query performance through a technical conference talk exploring the integration of Apache Hudi and Presto's multi-modal index subsystem. Discover how Apache Hudi enhances data lakes with transactions and incremental processing capabilities, establishing core components for Lakehouse architecture. Explore the native Hudi connector in Presto, examining key optimizations and features including metadata table utilization for streamlined file listing operations in cloud storage environments. Gain insights into advanced data skipping methodologies and learn how to leverage the multi-modal subsystem to achieve substantial improvements in query latency. Master the unification of batch and stream processing through Hudi's incremental processing model, addressing critical technology gaps in modern data architecture.
Syllabus
How to Speed up your Lakehouse Queries by an Order of Magnitude with Multi-mo... Sivabalan Narayanan
Taught by
Presto Foundation