Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the concepts and skills required to monitor and optimize data storage and data processing to pass the Microsoft Azure Data Engineer Associate (DP-203) certification exam.
Syllabus
1. Monitor Data Storage
- Learning objectives
- Implement logging used by Azure Monitor
- Configure monitoring services
- Measure performance of data movement
- Monitor and update statistics about data across a system
- Monitor data pipeline performance
- Measure query performance
- Learning objectives
- Monitor cluster performance
- Understand custom logging options
- Schedule and monitor pipeline tests
- Interpret Azure Monitor metrics and logs
- Interpret a Spark Directed Acyclic Graph (DAG)
- Learning objectives
- Compact small files
- Rewrite user-defined functions (UDFs)
- Handle skew in data
- Handle data spill
- Tune shuffle partitions
- Find shuffling in a pipeline
- Optimize resource management
- Learning objectives
- Tune queries by using indexers
- Tune queries by using cache
- Optimize pipelines for analytical or transactional purposes
- Optimize pipeline for descriptive versus analytical workloads
- Troubleshoot failed Spark jobs
- Troubleshoot failed pipeline runs
- Summary
Taught by
Microsoft Press and Tim Warner