Reliable Hallucination Detection in Large Language Models

Overview

Explore reliable hallucination detection techniques for large language models in this 35-minute AI in Production talk by Jiaxin Zhang. Delve into the critical aspects of understanding trustworthiness in modern language models by examining existing detection approaches based on self-consistency. Discover two types of hallucinations stemming from question-level and model-level issues that cannot be effectively identified through self-consistency checks alone. Learn about the novel sampling-based method called semantic-aware cross-check consistency (SAC3), which expands on the principle of self-consistency checking. Understand how SAC3 incorporates additional mechanisms to detect both question-level and model-level hallucinations by leveraging semantically equivalent question perturbation and cross-model response consistency checking. Gain insights from extensive empirical analysis demonstrating SAC3's superior performance in detecting non-factual and factual statements across multiple question-answering and open-domain generation benchmarks.

Syllabus

Reliable Hallucination Detection in Large Language Models // Jiaxin Zhang // AI in Production Talk

Taught by

MLOps.community

Reviews

Start your review of Reliable Hallucination Detection in Large Language Models

100 Most Popular Courses For October

Most common

Popular subjects

Popular courses

Reliable Hallucination Detection in Large Language Models

Overview

Syllabus

Taught by

Reviews

100 Most Popular Courses For October

Taught by

Introduction to Large Language Models

Generative AI with Large Language Models

Large Language Models with Azure

Mastering Large Language Model Evaluations - Techniques for Ensuring Generative AI Reliability

Never Stop Learning.