Completed
Preprocessing
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
How Replit Trained Their Own LLMs - LLM Bootcamp
Automatically move to the next video in the Classroom when playback concludes
- 1 Why train your own LLMs?
- 2 The Modern LLM Stack
- 3 Data Pipelines: Databricks & Hugging Face
- 4 Preprocessing
- 5 Tokenizer Training
- 6 Running Training: MosaicML, Weights & Biases
- 7 Testing & Evaluation: HumanEval, Hugging Face
- 8 Deployment: FasterTransformer, Triton Server, k8s
- 9 Lessons learned: data-centrism, eval, and collaboration
- 10 What makes a good LLM engineer?