Overview
Syllabus
Intro
Launch Status Check
Service Outages
Host Alerts
What Makes a Good Alert
Noise Floor
SRE Burnout
War Rooms
Sharing
SpaceX
Reliability Theater
Incident Response
Monitoring
Virtualized Servers as Cattle
Containers vs Cattle
Configuration Management
Immutable Infrastructure
Configuration Management doesnt scale
Automation doesnt scale
Centralized tools
Automation
Design Systems
Automating
Burnout Team
Feature Releases
Embedded SME
Production Ready Checklist
Periodic Revisiting
Integrations
Uptime
Risk vs Reward
Dad Jokes
The Linkage
Chaos Monkey
Complex Systems
Real Stories
Interview
Perception
Taught by
USENIX