Explore a 20-minute conference talk from USENIX ATC '24 that introduces MSFRD (Mutation Similarity based SSD Failure Rating and Diagnosis), a novel approach for predicting and diagnosing SSD failures in complex data center environments. Learn how this scheme utilizes Telemetry data to dynamically detect internal SSD mutations and compare them to historical failure patterns, enabling more accurate failure prediction and early rating. Discover how MSFRD improves upon existing methods, offering a 23.8% increase in precision and a 38.9% boost in recall for failure prediction. Gain insights into the scheme's effectiveness in failure rating and progressive diagnosis, and understand its potential impact on storage reliability and performance in large-scale data center operations.
Overview
Syllabus
USENIX ATC '24 - MSFRD: Mutation Similarity based SSD Failure Rating and Diagnosis for Complex...
Taught by
USENIX