Courses from 1000+ universities
Two years after its first major layoff round, Coursera announces another, impacting 10% of its workforce.
600 Free Google Certifications
Digital Marketing
Computer Science
Graphic Design
Mining Massive Datasets
Making Successful Decisions through the Strategy, Law & Ethics Model
The Science of Well-Being
Organize and share your learning with Class Central Lists.
View our Lists Showcase
Explore all talks and presentations from SREcon. Dive deep into the latest insights, research, and trends from the world's leading experts.
Explore pragmatic automation strategies for reducing toil in cloud operations, drawing insights from a large public cloud provider's experiences and applicable to various work environments.
Explore scaling Kafka with limited resources, covering challenges, failures, and tactical approaches for managing complex distributed systems in a fast-growing business environment.
Explore best practices for implementing and improving code review processes, focusing on organizational culture, author communication, and reviewer strategies for high-quality software development.
Comprehensive overview of load balancing techniques across network layers, exploring technologies and tradeoffs for fast, reliable multi-region services at Internet scale.
Transforming junior engineers into SREs: creating a supportive environment, fostering a 'Culture of Error', encouraging mentorship, and prioritizing skill development for successful transitions and career growth.
Interactive workshop on designing distributed systems, covering scaling, failure handling, reliability, and consistency. Participants apply concepts to real-world scenarios using cloud components.
Explore categories of unforeseen, catastrophic system failures and strategies to fortify against them, including capacity testing, incident management, and communication.
Explore real-world insights on implementing testing in production for large-scale microservices, focusing on architecture, capacity planning, and infrastructure components.
Explores challenges of SRE autonomy, tool diversity, and development duplication. Presents solutions using a widely-adopted internal tool's history, addressing selection autonomy and automation strategies.
Explore how software engineering can learn from NASA's Challenger incident, drawing parallels to modern reliability challenges and navigating complexity in system architecture.
Learn to predict resource exhaustion in external systems using linear regression, enabling proactive planning and sizing for improved application performance.
Explore cognitive science techniques to enhance SRE learning, incident response, and team collaboration. Optimize human observability for better system understanding and problem-solving.
Exploring hidden complexities in SRE incident response, revealing surprising findings on coordination strategies, tooling impacts, and adaptive choreography in managing service outages.
Discover how Honeycomb improved system reliability through intentional node termination, uncovering bugs and progressing towards continuous experimentation for enhanced resilience and scalability.
Explore how technical decisions shape DevOps culture, focusing on Two Sigma's COIN platform. Learn strategies for fostering collaboration, innovation, and community in tech environments.
Get personalized course recommendations, track subjects and courses with reminders, and more.