Overview
Syllabus
Intro
And the problem space is complex.
Write workload, trailing year
Read workload, trailing year
Service Level Objectives (SLO)
Data storage engine and analytics flow
SLOs are user flows
Service-Level Objectives
Functional and visual testing.
Design for feature flag deployment.
Automated integration & human review.
Green button merge.
Auto-updates, rollbacks, & pins.
Observe behavior in prod.
Non-trivial savings.
Three case studies of failure
1 Shepherd: ingest API service
Honeycomb Ingest Outage
Now what?
Kafka: data bus
Our month of Kafka pain
Unexpected constraints
Take care of your people
Optimize for safety
Retriever: query service
Making progress carefully
Takeaways
Acknowledge hidden risks
Make experimentation routine!
Understand & control production.
Taught by
ChariotSolutions