Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Evaluating Language Models - Challenges and Best Practices

MLOps.community via YouTube

Overview

Explore the challenges and solutions for evaluating language models in this 23-minute lightning talk from the AI in Production Conference. Delve into the metrics and datasets available for assessment, and examine the difficulties of continuous evaluation in production environments. Learn about common pitfalls to avoid and gain insights from Matthew Sharp, author of "LLMs in Production" and a seasoned professional with over a decade of experience in ML/AI and deploying models to production. Discover the importance of contributing to public evaluation datasets and join the call for a community-wide effort to reduce harmful bias in language models. Gain valuable takeaways for improving language model evaluation practices in your own projects or organizations.

Syllabus

Evaluating Language Models // Matthew Sharp // AI in Production Conference Lightning Talk

Taught by

MLOps.community

Reviews

Start your review of Evaluating Language Models - Challenges and Best Practices

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.