Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Pluralsight

Site Reliability Engineering (SRE): The Big Picture

via Pluralsight

Overview

SRE is how Google runs production systems, promoting high availability
with high velocity and removing operational toil. It achieves the same
goals as DevOps without the culture shift, so it's a better option for many
digital transformations.

Site Reliability Engineering (SRE) is a set of principles and practices that supports software delivery - keeping production systems stable and still delivering new features at speed. In this course, Site Reliability Engineering (SRE): The Big Picture, you 'll get a thorough overview of how SRE works and why it's a good choice for many organizations. First, you'll learn the differences between SRE, DevOps, and traditional operations. Next, you'll discover how engineering practices help to reduce toil and provide more time to focus on high value tasks. Finally, you'll learn how SRE approaches monitoring and alerting, and about the SRE approach to managing incidents. When you're finished with this course, you'll be able to evaluate SRE and see if it's a good fit for your organization.

Syllabus

  • Course Overview 2mins
  • Introducing Site Reliability Engineering 27mins
  • Automation and Eliminating Toil 30mins
  • Service Levels, Monitoring, and Alerting 28mins
  • Incident Management: On-call and Postmortems 22mins

Taught by

Elton Stoneman

Reviews

4.8 rating at Pluralsight based on 39 ratings

Start your review of Site Reliability Engineering (SRE): The Big Picture

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.