Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Planning for and Handling Failures - From Open Hardware and Aviation to Production at Google

linux.conf.au via YouTube

Overview

Explore a comprehensive analysis of failure management across diverse fields in this 46-minute conference talk from linux.conf.au. Delve into real-world examples from open hardware, aviation, and Google's production environment to gain valuable insights on anticipating, preventing, and learning from failures. Discover practical strategies for developing a keen sense for potential issues, implementing effective procedures, and conducting thorough root cause analyses. Learn from critical incidents in aviation, such as AF447 and QF32, and understand the implications of automation gone wrong. Gain knowledge on avoiding hardware mishaps, improving software development practices, and the importance of proper postmortems. This talk equips you with essential skills to enhance your approach to risk management and failure prevention across various technological domains.

Syllabus

Intro
Managing failures
Eusebio
Be mindful
Hardware
Phone
Spare to spare
Software
Code Reviews
Change Requests
Unit tests
Continuous integration
File updates
Postmortems
Practicing emergencies
Have backups be careful
Disk Erase
Rate Limits
Postmortem
Personal Lessons
Aviation Lessons
Risk Management
Post Mortem
Automation
Selfdriving cars
Air France 447
Airbus QF32
Indonesia
Aircraft accident
Boeing
Certification
Make a difference
Conclusions
QA

Taught by

linux.conf.au

Reviews

Start your review of Planning for and Handling Failures - From Open Hardware and Aviation to Production at Google

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.