Completed
Autoscaling for ML Models
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
State of Ray Serve in 2.0 - Features and Updates for Multi-model Inference
Automatically move to the next video in the Classroom when playback concludes
- 1 Intro
- 2 Working Example: Content Understanding
- 3 Content Understanding Architecture
- 4 Requirements for Online Inference
- 5 Basic Solution: Multi-model Monolith
- 6 Ray Serve is built for Multi-model Inference
- 7 Model Composition Requirements
- 8 Solution: Model Composition API
- 9 Model Composition Pattern
- 10 Ray Serve Model Composition API
- 11 Autoscaling for ML Models
- 12 Production Hardening
- 13 Chaos Testing: 99.99% uptime