Overview
Syllabus
intro
preface
have you used this in your career? traffic for total.rrd
hi, i'm fred
how do you implement slos for 1000 engineers?
books
sli: good vs bad requests
slo: good/bad time_range
eb: 1-slo, 1-0.9995 = 0.05%
keys to slo / error budget democratization
latency and availability
measuring availability is easy, measuring latency is not easy
quantifying latency at scale
a common mistake
"dr. histogram - how i learned to stop worrying and love latency bands"
use raw histograms, avoid sketches & approximations
decomposing histogram modes
multi service slos / error budgets
thank you, questions?
Taught by
Conf42