Completed
Dhivehi Dataset
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Building Transformer Tokenizers - Dhivehi NLP #1
Automatically move to the next video in the Classroom when playback concludes
- 1 Intro
- 2 Dhivehi Project
- 3 Hurdles for Low Resource Domains
- 4 Dhivehi Dataset
- 5 Download Dhivehi Corpus
- 6 Tokenizer Components
- 7 Normalizer Component
- 8 Pre-tokenization Component
- 9 Post-tokenization Component
- 10 Decoder Component
- 11 Tokenizer Implementation
- 12 Tokenizer Training
- 13 Post-processing Implementation
- 14 Decoder Implementation
- 15 Saving for Transformers
- 16 Tokenizer Test and Usage
- 17 Download Dhivehi Models
- 18 First Steps