Overview
Learn about data annotation techniques, tools, and best practices for multilingual natural language processing in this 53-minute lecture from CMU's CS11-737 course. Explore topics such as finding and managing labelers, defining annotation tasks, interactive dictionary learning, and ensuring consistency and efficiency in the labeling process. Gain insights into popular labeling toolkits like BRAT and discover strategies for building effective multilingual NLP datasets.
Syllabus
Data Annotation
Finding Labelers
Non-native labeling
What can be labeled
Defining the task
Dictionary: Interactive Learning
Spice: Lex Learner Build Your User awb Language eng Project aug 19 Logout
Checking the Labeling
Checking for consistency
Checking for efficiency
Checking for usefulness
Choose the Task
Labeling Toolkits
BRAT Labeling Tool
Taught by
Graham Neubig