Overview
Join AWS operational leaders in this 49-minute conference talk from re:Invent 2024 to explore critical lessons learned about building resilient services beyond basic high availability design. Discover practical insights from 18 years of operational excellence at AWS, featuring real-world stories that demonstrate how to prepare for and effectively manage unexpected failures. Learn valuable strategies for maintaining service stability and mitigating impact during inevitable system disruptions, drawing from AWS's extensive experience in cloud computing operations.
Syllabus
AWS re:Invent 2024 - Failing without flailing: Lessons we learned at AWS the hard way (ARC333)
Taught by
AWS Events