Overview
Explore strategies for planning and handling failures across diverse industries in this 44-minute talk by Marc Merlin from Google. Delve into real-world examples from open hardware, aviation, and Google's production environment to gain insights on effective failure management. Learn about the importance of spares, change requests, unit tests, continuous integration, and rollouts. Examine the critical role of postmortems, emergency planning, and automation in mitigating risks. Analyze case studies from aviation, including Air France 447 and the Boeing 737 Max, to understand the complexities of automation and human factors. Discover the impact of management pressure and regulatory oversight on safety practices. Gain valuable knowledge to enhance your approach to failure prevention and response in various technological contexts.
Syllabus
Intro
Learning by Example
Samsung Note 7s
Spares
Other failures
Change requests
TBRS
Unit Tests
Continuous Integration
Flakes
Rollouts
Postmortem
Emergency Planning
Online Commands
Automation
Postmoderns
Have a Plan
Aviation
Automation in Aviation
Automation in Cars
Air France 447
Automation is bulletproof
Boeing 737 Max
FAA
Management Pressure
The FAA
Outro
Taught by
Linux Foundation