Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the world of observability in this 36-minute conference talk that delves into leveraging cloud-native technologies beyond traditional cloud environments. Discover the core principles of Prometheus and its role in monitoring systems. Learn about the main selling points of time series data, the ease of emitting, parsing, and reading metrics, and the scalability of Prometheus. Gain insights into OpenMetrics and the three pillars of observability. Examine Loki's capabilities in log aggregation and its connection to distributed tracing. Understand the complexities of modern systems, the importance of Site Reliability Engineering (SRE) as an implementation of DevOps practices, and the significance of shared understanding in maintaining services. Dive into the essentials of alerting and how it fits into the observability landscape.
Syllabus
Intro
Dirty secret
Buzzword alert!
IDENTIFYING WOOD
Complexity
SRE, an instantiation of DevOps
Shared understanding
Services
Alerting
Prometheus 101
Main selling points
Time series
Super easy to emit, parse & read
Prometheus scale
OpenMetrics
Three pillars
Loki 101
From logs to traces
Taught by
Linux Foundation