Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Discover how Apache Impala has evolved to meet modern data warehouse requirements in this 26-minute conference talk from The Apache Software Foundation. Learn about Impala's new capabilities for reading, modifying, and optimizing Apache Iceberg tables, including row-level modifications and table maintenance features. Explore how Impala now supports RDBMS-like functionalities, such as compliance with GDPR and CCPA regulations through record removal and updates. Understand the benefits of the OPTIMIZE statement for merging small data files and eliminating delete files to maintain table health. Gain insights into the DROP PARTITION statement for selective partition removal based on predicates. Presented by Cloudera engineers Zoltán Borók-Nagy, Péter Rózsa, and Noémi Pap-Takács, this talk demonstrates how Impala has adapted to emerging requirements while maintaining its focus on performance in distributed, massively parallel query execution for big data.