Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Evaluation Techniques for Large Language Models

MLOps World: Machine Learning in Production via YouTube

Overview

Explore practical tools and best practices for evaluating and choosing Large Language Models (LLMs) in this comprehensive tutorial presented by Rajiv Shah, Machine Learning Engineer at Hugging Face. Gain insights into the capabilities of LLMs compared to traditional ML models and learn various evaluation techniques, including evaluation suites, head-to-head competition approaches, and using LLMs to evaluate other LLMs. Delve into the subtle factors affecting evaluation, such as the role of prompts, tokenization, and requirements for factual accuracy. Examine model bias and ethical considerations through working examples. Acquire an in-depth understanding of LLM evaluation tradeoffs and methods, with reusable code provided in Jupyter Notebooks for each technique discussed.

Syllabus

Evaluation Techniques for Large Language Models

Taught by

MLOps World: Machine Learning in Production

Reviews

Start your review of Evaluation Techniques for Large Language Models

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.