Completed
Insights from Cook's model Engineering resilience requires a model of safety based on: mentoring, responding, adapting, and learning System safety is about what can happen, where the operating point …
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Architectural Patterns of Resilient Distributed Systems
Automatically move to the next video in the Classroom when playback concludes
- 1 Architectural Patterns of Resilient Distributed Systems
- 2 Resilience is the ability of a system to adapt or keep working when challenges occur
- 3 Decomposing into subsystems independently intolerant to harvest degradation but the application can continue if they fail You can only provide strong consistency for the subsystems that need it Ortho…
- 4 Insights from Cook's model Engineering resilience requires a model of safety based on: mentoring, responding, adapting, and learning System safety is about what can happen, where the operating point …
- 5 Thinking about building system resilience using a single discipline is insufficient. We need different strategies.
- 6 Evolution of our purging system from vi to v3 Used Bimodal Multicast (Gossip protocol) to provide extremely fast purging speed Desian concerns & system evolution
- 7 Faild allows us to fail & recover hosts via MAC- swapping and ECMP on switches Do immediate or gradual host failure & recovery Watch Joao's talk SREcon15
- 8 Resiliency and testing are correlated. TEST! Versioning from the start - provide an upgrade path from day 1 Upgrades & evolvability of systems is still tricky. Mixed-mode operations need to be common…