DevOps to MLOps: Scaling ML Models to 2 Million+ Requests per Day

Overview

Explore the journey from DevOps to MLOps in this conference talk that delves into scaling machine learning models to handle over 2 million requests per day. Learn about the key steps in MLOps, including model development, deployment, and monitoring. Discover a real-world case study on eKYC SaaS APIs, examining the cloud-agnostic architecture and scaling strategies employed. Gain insights into eliminating single points of failure, capacity planning, and cost optimization through autoscaling. Analyze production issues related to GPU utilization and high latency, and extract valuable lessons for implementing MLOps at scale. Enhance your understanding of the challenges and solutions in transitioning from traditional DevOps to MLOps for large-scale machine learning applications.

Syllabus

intro
preamble
chinmay naik
agenda
what is mlops
mlops steps
simpelst mlops flow
production work ahead
case study - ekyc saas apis
ml model apis
architecture
ekyc saas apis - requirements
cloud agnostic architecture
why cloud agnostic?
scaling journey
eliminate single points of failure
capacity planning
cost optimization and autoscaling
production issue 1 - gpu utilization in nomad
production issue 2 - high latency issue
lessons
keep learning

Taught by

Conf42

Reviews

Start your review of DevOps to MLOps: Scaling ML Models to 2 Million+ Requests per Day

100 Most Popular Courses For October

Most common

Popular subjects

Popular courses

DevOps to MLOps: Scaling ML Models to 2 Million+ Requests per Day

Overview

Syllabus

Taught by

Reviews

100 Most Popular Courses For October

Taught by

DevOps, DataOps, MLOps

MLOps Platforms: Amazon SageMaker and Azure ML

AI Inference Workloads - Solving MLOps Challenges in Production

10 Best Machine Learning Courses for 2024: Scikit-learn, TensorFlow, and more

9 Best Microservices Courses for 2024: Scalability, Block by Block

Never Stop Learning.