Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the intricacies of deploying deep learning models in production at Booking.com in this 31-minute EuroPython 2017 conference talk. Delve into the challenges and solutions for serving model predictions, covering topics such as training models in Docker containers, automated retraining processes, and deployment using Kubernetes. Learn about optimizing prediction serving for both latency and throughput in a containerized environment. Gain valuable insights into the lifecycle of a model, from initial training on a laptop to full-scale production deployment. Discover practical applications in image tagging and recommendation engines, and understand how to overcome bottlenecks in putting machine learning models into production. Perfect for data scientists and engineers looking to bridge the gap between model development and real-world implementation.
Syllabus
Intro
Agenda
Image Tagging
Recommendation Engine
Lifecycle of a model
Training a Model - on laptop
Deploying a Model
Performance
Optimizing for Latency
Optimizing for Throughput
Summary
Want to get in touch?
Taught by
EuroPython Conference