Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Towards Robust GenAI: Techniques for Evaluating Enterprise LLM Applications

MLOps World: Machine Learning in Production via YouTube

Overview

Explore techniques for evaluating enterprise LLM applications in this 45-minute conference talk from MLOps World: Machine Learning in Production. Delve into the challenges of assessing performance and safety in increasingly capable language models. Examine the limitations of traditional human evaluation methods and their impact on enterprise AI adoption. Discover emerging automated evaluation solutions that combine real-time "micro evaluators" with strategic human feedback loops. Learn how to gain constant insights into a model's strengths, weaknesses, and blind spots. By the end of the talk, acquire strategies to confidently implement language models in your applications and products, enhancing the robustness of your generative AI systems.

Syllabus

Towards Robust GenAI: Techniques for Evaluating Enterprise LLM Applications

Taught by

MLOps World: Machine Learning in Production

Reviews

Start your review of Towards Robust GenAI: Techniques for Evaluating Enterprise LLM Applications

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.