Better Reliability Through Observability and Experimentation
CNCF [Cloud Native Computing Foundation] via YouTube
Overview
Explore the intersection of Site Reliability Engineering (SRE), observability, and experimentation in this 37-minute conference talk from KubeCon + CloudNativeCon. Discover how to treat reliability as an organizational challenge rather than just a software problem. Learn practical approaches to improve service reliability through simulated outages, observability techniques, and analysis. Gain insights into determining workload misbehavior and preparing for service disruptions. Understand how to leverage OpenTelemetry and OpenTracing to enhance system reliability beyond deployments. Join Julie Gunderson from Gremlin and Kerim Satirli from HashiCorp as they guide you through a journey of better reliability practices in cloud-native environments.
Syllabus
Better Reliability Through Observability and Experimentation - Julie Gunderson & Kerim Satirli
Taught by
CNCF [Cloud Native Computing Foundation]