Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore effective strategies for testing data pipelines in this informative PyCon US talk. Learn how to ensure smooth data flow and quickly identify and resolve issues in your pipelines. Discover toolkit-agnostic techniques applicable beyond Airflow, including unit testing for individual components, integration testing for the entire pipeline, and end-to-end testing for accurate data output. Gain insights into unique methods such as data snapshot testing and online and offline data quality checks. Apply software application testing principles to data pipeline development and maintenance. Access the presentation slides for a comprehensive overview of the concepts discussed in this 25-minute talk.
Syllabus
Talks - Amitosh Swain: Testing Data Pipelines
Taught by
PyCon US