Overview
Syllabus
Intro
Nielsen's Architecture (AT THE TIME)
Data Lake
Data Arrival Pain Points
Recovering from failures
Is it the end of the day yet? When do we process data?
Is it the end of day yet? Legacy answers to a legacy problem
Little Fires Everywhere
Auditing window? Let's design our metadata
Auditing Header Injection
Shipping Audit Window to Collection Point
Consuming Audit Data
In Context
Storing Data and Querying to Optimum
Designing Out Output Table
Shout out to my dad....
Optimizing PostgreSQL for Audit Queries
Managing Partitions with Apache Airflow
Offloading Data to History
Scheduling your spark job
It is not the end of the day
Alerts and add-ons
Alerting system
Detecting duplications
Taught by
NDC Conferences