Overview
Explore the challenges and strategies of establishing a Site Reliability Engineering (SRE) team within a large enterprise in this 33-minute conference talk from SREcon20 Americas. Discover how BT approached implementing SRE principles by first addressing critical business concerns such as security, cloud sprawl, and cost control. Learn about the complexities of creating an SRE team beyond simply renaming an existing operations team or copying Google's model. Gain insights into the journey of building an SRE team, including the main obstacles faced in a corporate environment, key learnings, and valuable advice for newly formed SRE teams. Understand the importance of tailoring SRE practices to specific business needs and establishing unique standards. Follow the structured presentation as it covers topics like cloud computing standards, chaos engineering, and integrating SRE into engineering processes.
Syllabus
Introduction
Presentation Overview
BT Background
Waynes Background
Waynes Story
Where to Start
Challenges
Management
The Solution
Goals
Planning and Organizing
Cloud Sprawl
Cloud Computing Standards
Chaos Engineering
SRE at BT
SRE in Engineering
Summary
Conclusion
Taught by
USENIX