Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Distortion-Free Mechanisms for Language Model Provenance - Watermarking and Training Independence

Simons Institute via YouTube

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Watch a research lecture exploring mechanisms for establishing provenance in language model artifacts, focusing on both text and model weights. Learn about innovative watermarking techniques for autoregressive language model outputs that remain robust even when a constant fraction of text is edited, developed through collaboration with John Thickstun, Tatsu Hashimoto, and Percy Liang. Discover methods for testing the independence of language model training processes by examining model weights, presented from research conducted with Sally Zhu, Ahmed Ahmed, and Percy Liang. Gain insights into the latest developments in language model security, alignment, and copyright protection through this technical presentation from Stanford University researcher Rohith Kuditipudi at the Simons Institute.

Syllabus

Distortion-free mechanisms for language model provenance

Taught by

Simons Institute

Reviews

Start your review of Distortion-Free Mechanisms for Language Model Provenance - Watermarking and Training Independence

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.