Completed
- Decompressing all of the gzip archives
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Building an LLM Fine-Tuning Dataset - From Reddit Comments to QLoRA Training
Automatically move to the next video in the Classroom when playback concludes
- 1 - Introduction to Dataset building for fine-tuning.
- 2 - The Reddit dataset options Torrent, Archive.org, BigQuery
- 3 - Exporting BigQuery Reddit and some other data
- 4 - Decompressing all of the gzip archives
- 5 - Re-combining the archives for target subreddits
- 6 - How to structure the data
- 7 - Building training samples and saving to database
- 8 - Creating customized training json files
- 9 - QLoRA training and results