Explore fault tree analysis applied to Apache Kafka deployments in this 31-minute conference talk from SREcon19 Americas. Learn how Lyft's Andrey Falko implemented this technique to enhance the resilience of Kafka clusters. Gain insights into the key focus areas for bulletproofing your own Apache Kafka infrastructure and discover practical strategies for improving system reliability. Understand the application of fault tree analysis in real-world scenarios and its potential impact on SRE practices.
Overview
Syllabus
SREcon19 Americas - Fault Tree Analysis Applied to Apache Kafka
Taught by
USENIX