Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Building a Multi-Cluster Privately Hosted LLM Serving Platform on Kubernetes

CNCF [Cloud Native Computing Foundation] via YouTube

Start learning Write review

Details

Start learning

Provider

YouTube
Pricing

Free Video
Languages

English
Duration & workload

26 minutes
Sessions

On-Demand

Found in

Overview

Explore the challenges and solutions of building a cloud-agnostic, privately hosted Large Language Model (LLM) serving platform on Kubernetes in this 26-minute conference talk. Discover how Predibase tackled the complexities of hosting LLMs, including their large size and GPU resource requirements. Learn about the architecture of their control plane and dataplane, secured with an Istio service mesh, and the implementation of KEDA for event-driven auto-scaling to support serverless inference of open-source models. Gain valuable insights into deploying LLMs and acquire practical knowledge on applying tools and techniques for your own organization's LLM hosting needs.

Syllabus

Building a Multi-Cluster Privately Hosted LLM Serving Platform on Ku... Julian Bright & Noah Yoshida

Taught by

CNCF [Cloud Native Computing Foundation]

Reviews

Start your review of Building a Multi-Cluster Privately Hosted LLM Serving Platform on Kubernetes

Start learning

Taught by

9 Best Kubernetes Courses for 2024

Never Stop Learning.