Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Real-Time Adaptive Controls for Resilient Distributed Systems

USENIX via YouTube

Overview

Explore a conference talk on implementing real-time adaptive controls for enhancing the resilience of distributed systems. Dive deep into CrowdStrike's approach to dynamically tuning service parameters using techniques inspired by TCP congestion control. Learn how this method improves system resilience by real-time sampling of errors and latencies, eliminating the need for periodic manual adjustments. Discover the challenges and lessons learned from deploying this feature in CrowdStrike's massive production environment, which handles trillions of events daily. Gain insights into minimizing configuration surfaces, reducing operational toil, and preventing overload and cascading failures in modern services with hundreds of tunables.

Syllabus

SREcon22 APAC - Real-Time Adaptive Controls for Resilient Distributed Systems

Taught by

USENIX

Reviews

Start your review of Real-Time Adaptive Controls for Resilient Distributed Systems

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.