Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore chaos architecture principles and mindset in this 45-minute conference talk from GOTO Chicago 2018. Dive into cloud usage patterns, from faster data centers to complete data center replacement strategies. Learn how to handle failures, understand availability theater, and develop observability in cloud-native environments. Discover the importance of chaos engineering teams, tools, and security red teams in building resilient systems. Examine interconnected applications, operators, and users while considering safety margins and hypothesis testing. Gain insights from Adrian Cockcroft, Vice President of Cloud Architecture Strategy at Amazon Web Services, on developing a robust approach to system failures and creating more durable cloud-based workloads.
Syllabus
Introduction
What to do when something fails
Permissions lookup fails
Availability theater
The fairy tale
What goes wrong
Renew DNS entries
Flooded data center
You cant legislate against failure
Observability
Network is reliable
Book Chapter 2
Chaos Architecture
Interconnecting
Applications
Operators Users
People Testing
Who Runs the Fire
Chaos Engineering Team
Chaos Tools
Security Red Team
Security Tools
Safety
Safety anarchist
Failures
Safety Margins
Hypothesis Testing
WrapUp
Taught by
GOTO Conferences