Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Gray Failure: The Achilles' Heel of Cloud-Scale Systems

USENIX via YouTube

Overview

Explore the concept of gray failure in cloud-scale systems through this 44-minute conference talk from SREcon24 Americas. Delve into case studies from Microsoft Azure to understand the broad scope and consequences of subtle underlying faults that can cause major availability breakdowns and performance anomalies. Learn about differential observability and how it impacts failure detection in cloud environments. Discover practical applications of differential observability in Microsoft Azure and strategies for bridging the gap between different components' perceptions of failures. Gain valuable insights for improving reliability and performance in large-scale cloud systems.

Syllabus

SREcon24 Americas - Gray Failure: The Achilles’ Heel of Cloud-Scale Systems

Taught by

USENIX

Reviews

Start your review of Gray Failure: The Achilles' Heel of Cloud-Scale Systems

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.