Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Sentencepiece Tokenizer With Offsets for T5, ALBERT, XLM-RoBERTa and Many More

Abhishek Thakur via YouTube

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Learn how to implement Google's Sentencepiece tokenizer with offsets for question-answering systems in this 25-minute video tutorial. Discover techniques for using this tokenizer with ALBERT and other transformer-based models, while modifying data processing functions from previous lessons. Explore encoding, offsets, and class format data as you follow along with practical code examples. Access the complete implementation on Kaggle and build upon your knowledge from related tutorials on transformer models and question-answering systems.

Syllabus

Introduction
First Guest
The Problem
Encoding
Offsets
Class
Format Data
Outro

Taught by

Abhishek Thakur

Reviews

Start your review of Sentencepiece Tokenizer With Offsets for T5, ALBERT, XLM-RoBERTa and Many More

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.