Courses from 1000+ universities
Two years after its first major layoff round, Coursera announces another, impacting 10% of its workforce.
600 Free Google Certifications
Web Development
Software Development
Graphic Design
Functional Programming Principles in Scala
Mountains 101
Industrial Pharmacy-I
Organize and share your learning with Class Central Lists.
View our Lists Showcase
Explore all talks and presentations from SREcon. Dive deep into the latest insights, research, and trends from the world's leading experts.
Explore the journey of defining effective SLOs for data-intensive services, focusing on search engines. Learn about monitoring processes, consistency, and automated mitigation strategies for complex systems.
Explore Google's SRE training program, featuring hands-on exercises in a safe environment. Learn how SRE principles were applied to improve the curriculum, minimize toil, and enhance reliability through automation and monitoring.
Learn to quickly estimate system performance using base rates and napkin math, enabling informed decision-making in technical discussions and design processes without building systems first.
Learn to create a PID controller for autoscaling Kubernetes deployments, ensuring smooth scaling based on custom targets. Explore control theory principles and their application in SRE practices.
Transforming engineering culture: One SRE's journey from chaos to improved reliability, featuring practical tips on implementing SLIs, reducing incident response times, and fostering organizational change.
Discover how to identify and mitigate hidden vulnerabilities in microservice architectures using OpenTelemetry, with real-world examples from Google Maps' high-risk dependencies.
Explore biases in SRE, their impact on organizations, and strategies for mitigation. Learn to identify, discuss, and address cognitive biases and stereotypes to improve workplace equity and SRE integration.
Explore how Microsoft Teams adapted to enable remote education for millions during COVID-19, discussing challenges, solutions, and engineering strategies for large-scale online learning.
Explore a new model of operational debt in SRE, focusing on process gaps and risk. Learn to prioritize and address issues for improved service reliability and team collaboration.
Discover USAA's journey in establishing an IT-wide Postmortem Review meeting, with tips for implementing your own large-scale review process. Learn from their experiences and insights.
Explore SRE principles in high-frequency trading, balancing innovation with risk management. Learn strategies for safeguarding nanosecond performance in a regulated, failure-prone environment at scale.
Explore the impact of language in Site Reliability Engineering, examining key terms and their role in shaping industry practices and communication with stakeholders.
Explore how medical field strategies can enhance incident response in tech systems, from algorithm-guided decisions to standardized protocols for effective management and resolution.
Explore the limitations of digital troubleshooting compared to analog electronics, and discover why our mental models for digital systems are constrained.
Discover how LinkedIn uses spike detection in alert correlation to quickly identify root causes of outages in their complex microservices architecture, reducing false positives and engineer toil.
Get personalized course recommendations, track subjects and courses with reminders, and more.