Explore the critical issue of cross-system interaction failures in modern cloud environments through this 40-minute conference talk from SREcon24 Americas. Delve into the complexities of orchestrating multiple independent subsystems in cloud architectures and understand how their interactions impact overall system reliability. Examine the challenges posed by microservice and serverless architectures, where individual components become simpler but interactions grow more complex. Learn about recent production incidents in large-scale cloud systems caused by failures across system boundaries. Discover new techniques and practices developed at the University of Illinois at Urbana-Champaign to address these cross-system interaction failures. Gain insights into characterizing various forms of cross-system interactions, their failure modes in cloud-native stacks, and the limitations of current software testing and verification methods in this context.
Overview
Syllabus
SREcon24 Americas - Cross-System Interaction Failures: Don't Fail through the Cracks
Taught by
USENIX