Courses from 1000+ universities
Discover an easier way to explore affordable, credit-worthy online courses with our expanded community college catalog.
600 Free Google Certifications
Web Development
Python
Graphic Design
Astronomy: Exploring Time and Space
Inglés empresarial: ventas, gestión y liderazgo
AI and Big Data in Global Health Improvement
Organize and share your learning with Class Central Lists.
View our Lists Showcase
Explore all talks and presentations from SREcon. Dive deep into the latest insights, research, and trends from the world's leading experts.
Explore strategies for building distributed service ownership in software teams, focusing on documentation, telemetry, and empowering teams to drive improvements in system reliability and performance.
Learn from real Kubernetes production incidents and best practices for maintaining high cluster availability, focusing on challenges with large-scale deployments and common operational pitfalls.
Explore the intersection of SRE and Machine Learning, discussing its importance, challenges in ML reliability, and potential changes to the SRE profession. Skeptically examines ML automation in production.
Exploring sociotechnical systems in SRE, balancing operational load with learning, and examining knowledge sharing and risk reduction. Offers new perspectives on organizational structures shaping work practices.
Explore how Equinix Metal implemented OpenTelemetry tracing for bare metal provisioning, improving debugging and reliability across their global infrastructure.
Explore techniques to enhance observability data visualization for SREs, including multivariate relationships, small multiples, and sparklines, while avoiding common pitfalls in engineering presentations.
Explore the relationship between DevOps and SRE, their effectiveness, and how elite software delivery teams can benefit from modernizing technical operations.
Explore the future of Above-the-Line tooling in complex systems, its challenges, and potential functions. Learn from experts about improving system monitoring and human performance in critical environments.
Challenges industry-standard incident metrics, proposing alternative approaches for better system resilience. Explores VOID database insights to improve incident response and organizational learning.
Explore eBPF, a revolutionary Linux kernel technology for infrastructure observation and protection. Learn its capabilities, use cases, and implementation for enhanced system reliability and security.
Strategies for effective mentorship in SRE, focusing on remote work challenges and the importance of guiding principles, documentation, and sponsorship for career growth.
Explore innovative analytical methods for high-fidelity insights in distributed systems, focusing on performance analytics, statistical approaches, and practical applications in complex services at scale.
Automated OS certification at LinkedIn: Reducing toil, increasing velocity. Discusses project evolution, challenges, and early results in enhancing certification processes across data centers and Azure.
Discover how McGraw Hill applied 'Upstream Thinking' to thrive during the pandemic, scaling services and implementing SRE principles to support millions of remote learners without compromising security or incurring high costs.
Strategies for demonstrating ROI in SRE and availability work, addressing challenges and potential regulatory issues to ensure continued investment and positive outcomes.
Get personalized course recommendations, track subjects and courses with reminders, and more.