Overview
Explore a conference talk on debugging cluster issues as an on-call Site Reliability Engineer (SRE). Dive into the world of SRE, understanding the on-call process, and common cluster problems. Learn effective debugging approaches, the role of automation in resolving issues, and various levels of automation implementation. Gain valuable insights and advice for beginners entering the field of SRE. The talk covers topics such as introduction to SRE, on-call processes, cluster troubleshooting, debugging strategies, automation benefits, and practical tips for newcomers.
Syllabus
intro
preamble
agenda
whoami
introduction to sre
understanding on-call process
some common cluster issues
approach to debugging
automation to the rescue?
shades of automation
advice for beginners
thank you!
Taught by
Conf42