Overview
Discover best practices for scaling incident response and automating remediation in this 38-minute Observability Clinic video. Learn how modern organizations eliminate 99% of Mean Time to Repair (MTTR) through automated problem detection and remediation using Ansible and Dynatrace. Explore important incident response metrics, identify high-impact problem areas, and understand the 5-step journey to auto-remediation. Watch a live demonstration showcasing the integration of Ansible and Dynatrace to significantly reduce MTTR. Gain insights into managing app diagnostics, handling incident overload, and implementing effective auto-remediation strategies for improved system reliability and efficiency.
Syllabus
– Introduction
– What you are going to learn today
– Timeline of an Incident
– How many apps can you manage?
– The Diagnostics of the Unknowns
– Incident Overload
– Identify Problem Areas
– Auto Remediation with Dynatrace
– 5 Steps to start
– LIVE Demo with Ansible & Dynatrace
– Some final thoughts
Taught by
Dynatrace