Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

CNCF [Cloud Native Computing Foundation]

Enhancing the Performance Testing Process for gRPC Model Inferencing at Scale

CNCF [Cloud Native Computing Foundation] via YouTube

Overview

Explore the intricacies of performance testing for gRPC model inferencing at scale in this informative conference talk. Discover how to set up a Kubernetes cluster with KServe's ModelMesh for high-density deployment of machine learning models. Learn about load testing thousands of models and utilizing Prometheus and Grafana for monitoring key performance metrics. Gain insights into the complexities of model deployment, scalability challenges, and the features of Model Mesh. Delve into the automation of performance testing, including the setup of testing environments, QFlow pipeline, and K6 load tools. Witness a demonstration of the testing process, analyze testing logs and results, and understand the implications of cashmiss actions. Evaluate the benefits of using Model Mesh for your specific use case.

Syllabus

Introduction
Model Deployment
Kubernetes
Complexities
Kserve
Scalability
Model Mesh
Model Mesh Features
Performance Testing Automation
Performance Testing Setup
Performance Testing Environment
QFlow Pipeline
K6 Load Tools
GRPC
Prometheus
Demo
Testing
Testing Log
Testing Results
Cashmiss Action
Should I use Model Mesh

Taught by

CNCF [Cloud Native Computing Foundation]

Reviews

Start your review of Enhancing the Performance Testing Process for gRPC Model Inferencing at Scale

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.