Confidence with Chaos for Kubernetes Observability
CNCF [Cloud Native Computing Foundation] via YouTube
Overview
Explore advanced Kubernetes observability techniques in this 36-minute conference talk by Michael Friedrich from GitLab. Dive into practical insights on enhancing your observability stack beyond basic Prometheus deployments and dashboards. Learn how to address overwhelming alerts, fine-tune Service Level Objectives (SLOs), and improve dashboard granularity. Discover strategies for simulating production incidents to test SLOs and alert effectiveness. Gain valuable knowledge on Kubernetes metrics, Prometheus alerting, chaos engineering with Chaos Mesh, and OpenTelemetry app instrumentation. Benefit from real-world production incident examples and failed SLOs to build confidence in chaos engineering as both an SRE and developer. Embrace day 2 DevOps practices and elevate your Kubernetes observability skills for more robust and reliable systems.
Syllabus
Confidence with Chaos for Your Kubernetes Observability - Michael Friedrich, GitLab
Taught by
CNCF [Cloud Native Computing Foundation]