Overview
Syllabus
Intro
Reliability is easy to take for granted
What is Site Reliability Engineering (SRE)?
Part I: Dev and Ops
Is conflict inevitable?
Service Level Agreement (SLA)
What do you spend your budget on?
The rule
Two nice features of Error Budgets
Part II: Staffing, Work, Ops Overload
SRE hires only coders
50% cap on Ops work
Keep DEV in the rotation
Speaking of Dev and Ops work...
SRE Portability
Part III: Death, taxes, and outages...
Minimize Damage
A word on practice...
Wheel of Misfortune
Prevent recurrence
Post-mortem philosophy
Summary
O'Reilly Book
Taught by
GOTO Conferences