Overview
Explore strategies for managing metrics growth and cardinality in cloud-native environments during this 27-minute SREcon21 talk by Rob Skillington from Chronosphere. Learn best practices for efficiently controlling metrics data expansion, including proven KPIs and techniques for maintaining a high-performance observability function. Gain insights from real-world examples across the observability space and discover how to implement these approaches using existing SRE resources. Understand methods for tracking and measuring observability efforts, and leave with practical knowledge on maximizing your organization's observability capabilities in the face of exponential data growth.
Syllabus
Intro
Our mission help customers get to remediation as quickly as possible
Growth in monitoring data at Uber
Scenarios for taming data growth and cardinality
Tips for how to reduce these tensions
Tips for managing metrics at a more macro level
Tips for how to make observability a team effort
Internal KPIs and metrics - meta metrics
Taught by
USENIX