Building Transformer Tokenizers - Dhivehi NLP #1

Building Transformer Tokenizers - Dhivehi NLP #1

James Briggs via YouTube Direct link

Dhivehi Dataset

4 of 18

4 of 18

Dhivehi Dataset

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Building Transformer Tokenizers - Dhivehi NLP #1

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Intro
  2. 2 Dhivehi Project
  3. 3 Hurdles for Low Resource Domains
  4. 4 Dhivehi Dataset
  5. 5 Download Dhivehi Corpus
  6. 6 Tokenizer Components
  7. 7 Normalizer Component
  8. 8 Pre-tokenization Component
  9. 9 Post-tokenization Component
  10. 10 Decoder Component
  11. 11 Tokenizer Implementation
  12. 12 Tokenizer Training
  13. 13 Post-processing Implementation
  14. 14 Decoder Implementation
  15. 15 Saving for Transformers
  16. 16 Tokenizer Test and Usage
  17. 17 Download Dhivehi Models
  18. 18 First Steps

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.