Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Scalable Anomaly Detection - With Zero Machine Learning

Strange Loop Conference via YouTube

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore a comprehensive talk from the Strange Loop Conference on building a scalable anomaly detection system without using machine learning. Dive into Netflix's approach to detecting and pinpointing failures in their complex cloud architecture, composed of thousands of services and hundreds of thousands of VMs and containers. Learn how Zuul, Netflix's front-door for all cloud traffic, is leveraged to stream real-time events and identify broken paths in their microservices maze. Discover the innovative use of stream processing, anomaly detection algorithms, and a rules engine to create an efficient system capable of handling millions of requests across thousands of nodes. Gain insights into the benefits of using "old-fashioned math" over machine learning in certain scenarios, and understand the implementation of dynamic and adaptive thresholds. Examine the anomaly detection algorithm in-depth, including median estimation, MAD, and recovery detection. Explore the impact assessment process, data visualization techniques, and the use of Spinnaker events for more accurate problem identification. Understand how this system provides real-time alerting, reduces operational burden, and improves accuracy in detecting service issues within Netflix's complex microservices architecture.

Syllabus

Introduction
Netflixs Microservices
Time Series Database
Alerts Only for Zul
Dynamic Thresholds
Adaptive Thresholds
Anomaly Detector Raju
Results
No Machine Learning
How We Built It
Impact Graph
Context
Accuracy
Operational Burden
Realtime alerting
Realtime events
Mantis
How it works
Querying
Stream Processing
Aggregate
Job Chain
Requirements
Median estimation
Mad
Raju
Simple
Recovery Detection
Recovery Algorithm
What Raju Looks Like
Permutations
Data Visualization
Impact Assessment
Timeline of Events
What is API
Another needle in a haystack
Example
Gold Standard KPI
Spinnaker Events
Emailing the culprits
Benefits
Conclusion

Taught by

Strange Loop Conference

Reviews

Start your review of Scalable Anomaly Detection - With Zero Machine Learning

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.