Fine-Tuning Llama 3 on a Custom Dataset for RAG Q&A - Training LLM on a Single GPU

Overview

Learn how to fine-tune Llama 3 on a custom dataset for a RAG Q&A use case using a single GPU in this comprehensive 33-minute tutorial. Explore the benefits of fine-tuning, understand the process overview, and dive into practical steps including dataset preparation, model loading, custom dataset creation, and LoRA setup. Follow along with Google Colab setup, establish a baseline, train the model, and evaluate its performance against the base model. Gain insights into pushing the fine-tuned model to the HuggingFace hub and discover how even smaller models can outperform larger ones when properly fine-tuned for specific tasks.

Syllabus

- Why fine-tuning?
- Text tutorial on MLExpert.io
- Fine-tuning process overview
- Dataset
- Lllama 3 8B Instruct
- Google Colab Setup
- Loading model and tokenizer
- Create custom dataset
- Establish baseline
- Training on completions
- LoRA setup
- Training
- Load model and push to HuggingFace hub
- Evaluation comparing vs the base model
- Conclusion

Taught by

Venelin Valkov

Reviews

Start your review of Fine-Tuning Llama 3 on a Custom Dataset for RAG Q&A - Training LLM on a Single GPU

Taught by

Fine-Tuning with Llama 3

Fine-tuning Llama 2 for Tone or Style Using Shakespeare Dataset

Fine-tuning Llama 3.2 for Mental Health Sentiment Analysis Using TorchTune

Fine-tuning Llama 3 on Wikipedia Datasets for Low-Resource Languages

Custom RAG Pipelines and LLM Fine-Tuning - A Gradient Tutorial

Fine-Tuning Alpaca- Train Alpaca LoRa for Sentiment Analysis on a Custom Dataset

Never Stop Learning.