Incident Response: A Scientific Approach to Improving System Reliability

Incident Response: A Scientific Approach to Improving System Reliability

Conf42 via YouTube Direct link

intro

1 of 24

1 of 24

intro

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Incident Response: A Scientific Approach to Improving System Reliability

Automatically move to the next video in the Classroom when playback concludes

  1. 1 intro
  2. 2 preamble
  3. 3 incident response can learn from safety engineers in other domains
  4. 4 a definition...
  5. 5 catastrophe is always around the corner
  6. 6 incident response isn't easy
  7. 7 an overreliance of dashboards and runbooks
  8. 8 guesswork
  9. 9 spending a long time on the wrong hypothesis
  10. 10 fear of failure
  11. 11 'history doesn't repeat itselg but it often rhymes'
  12. 12 'it seems easy to look back at an incident and determine what went wrong ...'
  13. 13 normative language
  14. 14 mechanistic reasoning
  15. 15 above the line, below the line
  16. 16 change introduces new forms of failure
  17. 17 experienced troubleshootes rely more on case-based strategies
  18. 18 science - definition
  19. 19 the theory of falsifiability
  20. 20 'a more scientific, hypothesis-driven, approach to how humans perform ... can improve reliability
  21. 21 why bother?
  22. 22 3 steps
  23. 23 all practitioner acts are a gamble
  24. 24 thank you

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.