Courses from 1000+ universities
Two years after its first major layoff round, Coursera announces another, impacting 10% of its workforce.
600 Free Google Certifications
Web Development
Software Development
Graphic Design
Functional Programming Principles in Scala
Mountains 101
Industrial Pharmacy-I
Organize and share your learning with Class Central Lists.
View our Lists Showcase
Explore all talks and presentations from SREcon. Dive deep into the latest insights, research, and trends from the world's leading experts.
Explore Google's SRE team's approach to sublinear scaling, automation, and cultural shifts for managing 1000+ services efficiently without increasing staff.
Strategies for resilient data pipelines: observability, immutable inputs, declarative pipelines, and data validation. Reduce operational complexity, update risks, and accuracy concerns in complex systems.
Explore pragmatic automation strategies for reducing toil in cloud operations, drawing insights from a large public cloud provider's experiences and applicable to various work environments.
Explore scaling Kafka with limited resources, covering challenges, failures, and tactical approaches for managing complex distributed systems in a fast-growing business environment.
Explore core SRE principles, focusing on incident management, monitoring, and proactive approaches. Gain insights from a young engineer's journey transitioning from academia to industry.
Discover Pinterest's scalable API ownership framework, designed to manage 1700+ endpoints across 70+ teams, addressing challenges and implementing effective solutions for large-scale code management.
Learn from real Kubernetes production incidents and best practices for maintaining high cluster availability, focusing on challenges with large-scale deployments and common operational pitfalls.
Explore common pitfalls in Site Reliability Engineering and learn strategies to build successful SRE programs, drawing insights from industry leaders like Netflix and Google.
Explore strategies for repurposing existing tools and frameworks to establish an effective SRE organization, drawing from real-world experiences in building comprehensive delivery pipelines.
Explore principles of Chaos Engineering to improve resilience in distributed systems. Learn techniques for surfacing inherent chaos and building confidence in system behavior at scale.
Practical guide for new SRE leads on organizing teams, managing expectations, and implementing effective strategies. Covers phased approaches, lessons learned, and pitfalls to avoid in SRE journey.
Explore DNSControl, a DNS DSL and compiler enabling DevOps practices for DNS management. Learn how StackOverflow.com automates complex configurations, simplifies updates, and enhances reliability.
Insights on building an SRE team in a large enterprise, addressing challenges, establishing standards, and focusing on business priorities like security and cloud cost control.
Explores challenges and impacts of pervasive automation in SRE, discussing human factors, potential harm, and strategies for effective implementation in complex socio-technical systems.
Strategies for maximizing SRE impact with limited resources. Learn to leverage data, tools, and communication for effective small-scale SRE implementation and organizational influence.
Get personalized course recommendations, track subjects and courses with reminders, and more.