Overview
Syllabus
Intro
Complex systems
Partial failure mode
The Famous 95 of availability
Availability in parallel
AWS Region and availability zones
Multi-AZ architecture
Theoretical blast radius
Typical service application
Partial availability zone failure
System properties . Workload isolation
Cascading Failures
Cell-based architecture
Shuffle sharding
Auto-scaling for self-healing
Decoupling with async pattern
Degrade & prioritize traffic with queues
Set the timeouts!
Backing off between retries
Idempotent operation
Shallow health check
Deep health check
Service Degradation & Fallbacks
Pattern B: Circuit Breaker
Database Federation
Database Sharding
Read/Write separation
Pattem 9+
Chaos engineering
Taught by
NDC Conferences