Dumb-Proofing Data Pipelines: Techniques for Configurable and Maintainable ETL - Databricks
Databricks via YouTube
Overview
Syllabus
Intro
Why make your data pipelines dumb-proof?
How to make your data pipelines dumb-proof?
Fixing Hard coded Data Pipelines
Parameters & Input Validation
Externalizing Configuration
Configuration in JSON Format
Optimized Configuration in HOCON format
Readable and maintainable Configuration
Configuration Library
Refactor Code - Loading and Parsing Configuration
Boilerplate free configuration code
Sample Code
Summary
Taught by
Databricks