Methods for Evaluating Your GenAI Application Quality

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!

Grab it

Explore comprehensive methods for evaluating Generative AI application quality in this 37-minute conference talk by Databricks. Dive into the suite of tools including inference tables, Lakehouse Monitoring, and MLflow for rigorous evaluation and quality assurance of model responses. Learn to conduct offline evaluations and real-time monitoring, ensuring high-performance standards. Discover best practices for using LLMs as judges, integrating MLflow for experiment tracking, and leveraging inference tables and Lilac for enhanced model management. Optimize workflows and ensure robust, scalable GenAI applications aligned with production goals. Presented by Alkis Polyzotis and Michael Carbin, this talk offers valuable insights for developers and data scientists working with Generative AI technologies.

Syllabus

Methods for Evaluating Your GenAI Application Quality

Taught by

Databricks

Reviews

Start your review of Methods for Evaluating Your GenAI Application Quality

Taught by

GenAI in Data Analytics

H2O ai Large Language Models (LLMs) - Level 3

H2O Gen AI Ecosystem Overview - Level 1

GenAI and Model Selection

Generative AI with Large Language Models

Generative AI Architecture and Application Development

Never Stop Learning.