Learn about monitoring and optimizing data security and data processing on Azure and prepare to pass that domain of the Microsoft Azure Data Engineering (DP-203) exam.
Overview
Syllabus
Introduction
- Course introduction
- Implement logging used by Azure Monitor
- Configure monitoring services
- Measure performance of data movement
- Monitor data system/pipeline/cluster performance
- Measure query performance
- Schedule and monitor pipeline tests
- Interpret a Spark directed acyclic graph (DAG)
- Rewrite user-defined functions (UDFs)
- Handle skew in data and data spill
- Tune shuffle partitions/pipelines
- Optimize resource management
- Tune queries by using indexers and cache
- Troubleshoot a failed Spark job and pipeline run
- Summary and next steps
Taught by
Noah Gift