Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a critical incident in Netflix's distributed caching system during this 15-minute conference talk from SREcon24 Americas. Dive into the intricacies of a high-performance replication engine processing 30 million requests per second, and learn how a potential crisis threatening a global business launch was averted. Follow the speakers' debugging journey, uncovering valuable insights applicable to any organization dealing with distributed systems. Gain understanding of how simple assumptions can disrupt entire technology stacks, and discover techniques for early problem detection and resolution in complex distributed environments.