Overview
Syllabus
Introduction
Reliability
Who am I
Who Cares About Reliability
How Do We Measure Reliability
Own Your Own Failures
Background Noise Failures
probabilistic failure
impact reach high
bad day
failure detection
ride around
load balancing
error codes
microservice architecture
overall success rate
failures on edges
the caller
the service
latency
observability
rollback
rollback example
rolling failures
more problems
planned events
retry storms
retry budget
cluster overload
request not getting response
bad neighbors
toggles
staging
other teams
failing collaboratively
finagle
Is Fif extensible
How would you participate in an ecosystem
Taught by
Devoxx