Explore the challenges of ensuring consistent reliability in microservices communication and learn how to achieve fault tolerance in Istio using observability-driven load management. Dive into the limitations of traditional mitigation strategies like circuit breakers and rate-limiting when dealing with metastable failures such as cascading failures and retry storms. Discover Aperture, an open-source load management system that introduces adaptive service protection and workload prioritization. Understand how Aperture Agent integrates with Istio using Envoy's External Authorization API to gain traffic insights and make informed decisions to safeguard against failures. Witness a real-world deployment showcasing Aperture's ability to protect multi-tenant databases like Apache Druid and PostgreSQL from overloads by adaptively scheduling GRPC and GraphQL traffic.
Achieving Fault Tolerance in Istio with Observability-Driven Load Management
CNCF [Cloud Native Computing Foundation] via YouTube
Overview
Syllabus
Achieving Fault Tolerance in Istio with Observability-Driven Load Management - Tanveer Gill
Taught by
CNCF [Cloud Native Computing Foundation]