Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Fine-Tuning Llama 3 on a Custom Dataset for RAG Q&A - Training LLM on a Single GPU

Venelin Valkov via YouTube

Overview

Learn how to fine-tune Llama 3 on a custom dataset for a RAG Q&A use case using a single GPU in this comprehensive 33-minute tutorial. Explore the benefits of fine-tuning, understand the process overview, and dive into practical steps including dataset preparation, model loading, custom dataset creation, and LoRA setup. Follow along with Google Colab setup, establish a baseline, train the model, and evaluate its performance against the base model. Gain insights into pushing the fine-tuned model to the HuggingFace hub and discover how even smaller models can outperform larger ones when properly fine-tuned for specific tasks.

Syllabus

- Why fine-tuning?
- Text tutorial on MLExpert.io
- Fine-tuning process overview
- Dataset
- Lllama 3 8B Instruct
- Google Colab Setup
- Loading model and tokenizer
- Create custom dataset
- Establish baseline
- Training on completions
- LoRA setup
- Training
- Load model and push to HuggingFace hub
- Evaluation comparing vs the base model
- Conclusion

Taught by

Venelin Valkov

Reviews

Start your review of Fine-Tuning Llama 3 on a Custom Dataset for RAG Q&A - Training LLM on a Single GPU

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.