Overview
Explore chaos engineering principles and practices in this 19-minute conference talk from Conf42 SRE 2024. Discover why chaos engineering is crucial for building fault-tolerant systems and learn about its business impact. Understand the differences between traditional testing and chaos experiments. Follow a step-by-step guide on implementing chaos engineering, including observing steady state, planning hypotheses, running experiments, and verifying results. Gain insights into the October 4 outage and its implications. Explore various chaos engineering tools to enhance system resilience and reliability.
Syllabus
intro
preamble
agenda
why chaos engineering?
update about the october 4 outage
business impact of resilience is bigger than ever
why are these issues not surfaced during testing?
testing
what is chaos engineering?
testing vs experiments
chaos engineering: how to
#1: observe steady state
#2: plan hypothesis around the steady state
#3: run experiments
#4: verify and act
chaos engineering tools
thank you!
Taught by
Conf42