Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore how to elevate MLflow serving to production-grade status in this 28-minute conference talk from Databricks. Learn about deploying machine learning models as REST API endpoints and advancing them to containerized production environments. Discover techniques for implementing custom middlewares, monitoring, logging, and performance optimization for high-scale applications. Gain insights into Yotpo's approach to making MLflow serving production-ready, covering topics such as continuous delivery, request transformation, exporting metrics, deployment strategies, and monitoring best practices. Delve into optimizations and control mechanisms to enhance your MLflow serving capabilities for real-world, high-performance scenarios.