Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Evaluating LLMs for AI Risk - Techniques for Red Teaming Generative AI

MLOps.community via YouTube

Overview

Explore cutting-edge techniques for evaluating and stress-testing Large Language Models (LLMs) in this lightning talk from the LLMs in Production Conference III. Learn about a comprehensive framework for managing AI risk throughout the model lifecycle, from data collection to production deployment. Discover methods for red-teaming generative AI systems and building validation engines that algorithmically probe models for security, ethics, and safety issues. Gain insights into failure modes, automation strategies, and specific testing approaches such as prompt injection attacks, prompt extraction, data transformation, and model alignment tests. Equip yourself with the knowledge to effectively assess and mitigate potential risks associated with LLMs in production environments.

Syllabus

Introduction
Why Red teaming
What is Red teaming
What to test
Failure modes
Automation
Red teaming
Prompt injection attack
Prompt extraction
Data transformation
Model alignment test
Summary

Taught by

MLOps.community

Reviews

Start your review of Evaluating LLMs for AI Risk - Techniques for Red Teaming Generative AI

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.