Completed
Multi-modal inference endpoint
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Training and Serving Custom Multi-modal Models - IDEFICS 2 and LLaVA Llama 3
Automatically move to the next video in the Classroom when playback concludes
- 1 Fine-tuning and server setup for multi-modal models
- 2 Prerequisites pre-watching
- 3 IDEFICS 2 Model Overview
- 4 Model loading, evaluation and LoRA setup
- 5 Evaluating OCR performance
- 6 Evaluating multiple image inputs
- 7 Training / Fine-tuning
- 8 LLaVA Llama 3 Model Review
- 9 Multi-modal inference endpoint
- 10 VRAM Requirements for multi-modal models
- 11 IDEFICS 2 - my recommended model to build on