Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

How Tracing Uncovers Half-truths in Slack's CI Infrastructure

Strange Loop Conference via YouTube

Overview

Explore how tracing uncovers half-truths in Slack's CI infrastructure in this 23-minute conference talk from Strange Loop. Discover why traditional monitoring tools like logs and metrics were insufficient for debugging CI system failures. Learn how traces provided critical capabilities for understanding fault occurrences in interconnected systems such as GHE, Checkpoint, and Cypress. Gain insights into shared tooling for high-dimensionality event traces using SlackTrace and SpanEvents, and how they increased velocity in diagnosing code and debugging complex system interactions. Follow the journey from early incidents that motivated investment in internal tooling to improvements in performance and resiliency across Slack's infrastructure. Delve into topics including developer productivity, span event structure, shared dimensions, use cases, fuzzy service boundaries, incident command systems, and testing changes.

Syllabus

Intro
Developer Productivity
Span Event Structure
Whats Next
Shared Dimensions
Use Cases
The Root Challenge
The Results
Fuzzy Service Boundaries
Incident Command System
Testing Changes
Summary

Taught by

Strange Loop Conference

Reviews

Start your review of How Tracing Uncovers Half-truths in Slack's CI Infrastructure

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.