Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the fundamentals of Delta Lake in this 38-minute sponsored tutorial presented by Guenia Izquierdo Delgado and Sajith Appukuttan from Databricks. Learn about the open-source storage framework that enables Lakehouse architecture creation using various compute engines like Spark, PrestoDB, Flink, Trino, and Hive. Discover how Delta Lake addresses modern data engineering requirements and challenges, focusing on data reliability and optimized query performance for big data use cases. Through presentations, hands-on code examples, and notebooks, gain insights into batch and streaming data ingestion, fast interactive queries, and machine learning applications. Understand key data reliability challenges, learn how Delta Lake improves data lakes at scale, and explore its role within the wider open-source ecosystem for framework and tool developers. By the end of the tutorial, acquire knowledge on creating a Lakehouse architecture using Delta Lake and its potential benefits for your organization. To participate fully, ensure you have Docker engine installed on your computer.
Syllabus
Sponsored Session: Getting Started with Delta Lake - Guenia Izquierdo Delgado & Sajith Appukuttan
Taught by
Linux Foundation