Completed
- Intelligent routing
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Azure OpenAI Deployment Types and Resiliency - Understanding Models, Capacity, and High Availability
Automatically move to the next video in the Classroom when playback concludes
- 1 - Introduction
- 2 - Generative API is stateless
- 3 - Regional Azure OpenAI resource
- 4 - Capacity pools
- 5 - Responsible AI
- 6 - Model deployment types
- 7 - Standard
- 8 - Global
- 9 - Network vs inference latency
- 10 - Intelligent routing
- 11 - Quota vs available capacity
- 12 - Data zone and data residency
- 13 - Availability benefits?
- 14 - Resource is regional
- 15 - Multiple regional resources
- 16 - Enabling in the application
- 17 - API Management
- 18 - Prompt caching impact
- 19 - Provisioned service
- 20 - PayGo features
- 21 - PTU features
- 22 - Azure reservations
- 23 - Batch service
- 24 - Summary
- 25 - Close