Beyond Accuracy: Behavioral Testing of NLP Models with CheckList

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!

Grab it

Explore the innovative CheckList methodology for testing NLP models in this 35-minute talk by Marco Túlio Ribeiro, Senior Researcher at Microsoft Research. Discover a task-agnostic approach inspired by software engineering principles that goes beyond traditional accuracy metrics. Learn about intriguing bugs uncovered in both commercial and research models, including those from tech giants like Microsoft, Amazon, and Google, as well as popular models like BERT and RoBERTA. Gain insights into the effectiveness of CheckList through case studies and user feedback from researchers and engineers. Understand how this powerful tool can enhance the testing and debugging process for NLP models, benefiting both practitioners and researchers in the field.

Syllabus

Beyond Accuracy: Behavioral Testing of NLP Models with CheckList

Taught by

Toronto Machine Learning Series (TMLS)

Reviews

Start your review of Beyond Accuracy: Behavioral Testing of NLP Models with CheckList

Taught by

Natural Language Processing with Attention Models

Advanced Natural Language Processing with Apache Spark NLP

Hugging Face Transformers - The Basics - Practical Coding Guides - NLP Models (BERT/RoBERTa)

Building World-Class NLP Models with Transformers and Hugging Face

Building Better Language Models - Paradigms and Techniques

Cohere AI's LLM for Semantic Search in Python

Never Stop Learning.