Overview
Discover how to run an effective Site Reliability Engineering (SRE) program with limited resources in this 38-minute conference talk from SREcon20 Americas. Learn strategies to amplify your reach and maximize impact when operating with a small team or even as a solo practitioner. Explore the use of data, tools, and communication techniques to influence your organization significantly. Gain insights into leveraging architecture reviews, game days, and chaos engineering to enhance system reliability. Understand the implementation of key tools like Erebus and Oath Keeper, and learn how to structure your approach for optimal results. Perfect for SREs and DevOps professionals looking to make a substantial impact despite resource constraints.
Syllabus
Intro
What is this talk about
Two themes
Structure
Googles Model
Team Topologies
Smallest Possible Team
Architecture Review
Game Days
The Sequence
Data
Tools
Erebus
Oath Keeper
How Your Systems Keep Running
Chaos Engineering
Writing
Taught by
USENIX