Overview
Explore how Zendesk implemented Site Reliability Engineering (SRE) concepts, specifically Error Budgets and SLOs/SLIs, across their global engineering organization of 1000 people. Learn about the challenges faced in addressing major outages, the impact of company-wide change freezes, and the journey towards improving reliability. Gain practical insights into tooling and practices for implementing Error Budgets, as well as strategies for scoping freezes to systems with more reliability issues. Discover the wins and ongoing challenges in this 32-minute conference talk from YOW! 2019, presented by John Viner, Senior Director of Engineering at Zendesk.
Syllabus
Rolling out Error Budgets Across a 1000 Person Global Engineering Org. • John Viner • YOW! 2019
Taught by
GOTO Conferences