Azure OpenAI Deployment Types and Resiliency - Understanding Models, Capacity, and High Availability

Azure OpenAI Deployment Types and Resiliency - Understanding Models, Capacity, and High Availability

John Savill's Technical Training via YouTube Direct link

- Prompt caching impact

18 of 25

18 of 25

- Prompt caching impact

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Azure OpenAI Deployment Types and Resiliency - Understanding Models, Capacity, and High Availability

Automatically move to the next video in the Classroom when playback concludes

  1. 1 - Introduction
  2. 2 - Generative API is stateless
  3. 3 - Regional Azure OpenAI resource
  4. 4 - Capacity pools
  5. 5 - Responsible AI
  6. 6 - Model deployment types
  7. 7 - Standard
  8. 8 - Global
  9. 9 - Network vs inference latency
  10. 10 - Intelligent routing
  11. 11 - Quota vs available capacity
  12. 12 - Data zone and data residency
  13. 13 - Availability benefits?
  14. 14 - Resource is regional
  15. 15 - Multiple regional resources
  16. 16 - Enabling in the application
  17. 17 - API Management
  18. 18 - Prompt caching impact
  19. 19 - Provisioned service
  20. 20 - PayGo features
  21. 21 - PTU features
  22. 22 - Azure reservations
  23. 23 - Batch service
  24. 24 - Summary
  25. 25 - Close

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.