Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a 35-minute conference talk from SREcon18 Europe that delves into the lessons learned from deploying Site Reliability Engineering (SRE) training best practices to production. Learn about Google Ireland's journey in implementing SRE training, including the timeline, importance of learning, and key insights gained. Discover strategies for building sequential learning experiences, breaking real systems safely, continuous education, and fostering a culture of observability. Gain valuable insights from survey data and open-ended comments, and understand the importance of avoiding hero culture in SRE. This USENIX presentation offers practical takeaways for organizations looking to enhance their SRE training programs and improve overall reliability practices.
Syllabus
Intro
Agenda
Introduction
How did we get here
Timeline
Why Learning Matters
What Did We Learn
Building sequential learning experiences
Breaking real things
Ride shotgun
Continuous education
New to new territory
SLO Czar
Culture
Dont be a Hero
Observability
Survey Data
Survey Results
Openended Comments
Summary
Shoutouts
Taught by
USENIX