Overview
Watch a 29-minute conference talk from OSA Con 2023 where Nadine Farah explores how to build an efficient medallion architecture using Apache Hudi and DBT. Learn about transforming raw operational data into refined analytics-ready tables through a series of processing stages. Discover how Apache Hudi's transactional data lake platform enables streaming upserts and incremental processing, while DBT's incremental loading feature optimizes resource usage by processing only new or updated records. Understand the challenges of building low-latency medallion architectures, explore Hudi's Change Data Capture (CDC) feature for processing change records, and learn practical techniques for leveraging DBT to transform data modifications across bronze to gold tables. Gain insights into implementing these technologies to create efficient, scalable data processing pipelines used by some of the largest transactional data lakes in the industry.
Syllabus
Data Alchemy: Transforming Raw Data to Gold with Apache Hudi and DBT
Taught by
OSACon
Reviews
4.0 rating, based on 1 Class Central review
Showing Class Central Sort
-
The "Data Alchemy: Transforming Raw Data to Gold with Apache Hudi and DBT" course is incredibly valuable. It provides practical insights into managing and transforming data effectively, making it ideal for anyone looking to enhance their data engineering skills. The combination of Apache Hudi's capabilities and DBT's modeling features offers a powerful toolkit for handling complex data workflows. Highly recommend for data professionals!