Courses from 1000+ universities
Two years after its first major layoff round, Coursera announces another, impacting 10% of its workforce.
600 Free Google Certifications
Web Development
Software Development
Graphic Design
Functional Programming Principles in Scala
Mountains 101
Industrial Pharmacy-I
Organize and share your learning with Class Central Lists.
View our Lists Showcase
Explore all talks and presentations from SREcon. Dive deep into the latest insights, research, and trends from the world's leading experts.
Explore canarying best practices, pitfalls, and strategies for safe production changes. Learn to balance priorities, handle diverse scenarios, and implement effective canary processes in software deployment.
Humorous talk contrasting idealized SRE practices with real-world challenges. Speakers debunk perfect environments, offering practical insights and relatable experiences for SRE professionals.
Explore Netflix's multi-region strategy for improved availability and latency, including algebraic models, incident management, and design considerations for efficient failovers and user steering.
Explore Wikipedia's server-side architecture, from routers to microservices, and learn how open-source technologies power one of the world's top websites.
Discover strategies to enhance organizational resilience through improved incident learning. Explore research-backed approaches to post-incident reviews and avoid common investigation pitfalls.
Practical guidance on implementing Site Reliability Engineering in smaller organizations, addressing unique challenges, gaining buy-in, and fostering a culture of continuous improvement and experimentation.
Introductory overview of formal verification techniques in industry, focusing on safety-critical systems. Explores tools, applications, and adaptability to existing infrastructures.
Explore Pinterest's journey in scaling observability tools, from metrics to log search and distributed tracing, as the company grew from startup to web-scale platform.
Explore Adaptive Paging, an innovative alert handler that uses tracing and heuristics to identify and notify the team closest to the problem, reducing alert fatigue in complex distributed systems.
Exploring distributed tracing in real-time data streaming systems, focusing on challenges and solutions for trading platforms, including session tracking, data flow management, and storage optimization.
Explore principles and tools for safer production environments through automation, safe proxies, and audited break-glass, reducing human errors and insider threats in system operations.
Explores limitations of Machine Learning in production engineering, debunking common misconceptions and discussing potential feasible applications for SREs.
Learn how Squarespace's team adopted SRE practices to transform their unreliable logging platform into a trusted system with 99.9% uptime, sharing valuable insights and strategies for improving service reliability.
Explore systems thinking for safety and cybersecurity, integrating approaches to manage emergent properties and control problems in complex systems.
Explore strategies for efficient systems data management, including sampling and aggregation techniques, to maintain crucial information while reducing data volume and costs.
Get personalized course recommendations, track subjects and courses with reminders, and more.