Deploy LLMs More Efficiently with vLLM and Neural Magic

Deploy LLMs More Efficiently with vLLM and Neural Magic

Neural Magic via YouTube Direct link

Quantization

15 of 18

15 of 18

Quantization

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Deploy LLMs More Efficiently with vLLM and Neural Magic

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Introduction
  2. 2 Our Vision and Mission
  3. 3 History of Open Source AI
  4. 4 Advantages of Open Source
  5. 5 Deployment Paradigms
  6. 6 What is a VM
  7. 7 Who Neural Magic is
  8. 8 Our Mission
  9. 9 Why vLLM
  10. 10 VM Adoption
  11. 11 Hardware Support
  12. 12 Neural Magics Role in VM
  13. 13 Neural Magics Business
  14. 14 Stable Distribution of vLLM
  15. 15 Quantization
  16. 16 Case Study
  17. 17 Model Registry
  18. 18 Scalable Deployment

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.