Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Mistral 7B: Architecture, Evaluation, and Advanced Techniques

Trelis Research via YouTube

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the capabilities and architecture of Mistral 7B in this 19-minute video tutorial. Dive into the model's design, setup process on Runpod, and comprehensive evaluation through various tests including random sequence reversal, code generation, passkey retrieval, and fine-tuning. Gain insights into the model's performance and understand advanced concepts like Grouped Query Attention and Sliding Window Attention. Access additional resources including a comparison notebook, Runpod setup guide, and a supervised fine-tuning tutorial to enhance your understanding of this powerful language model.

Syllabus

Intro
Video Overview
Mistral 7B architecture and design
Runpod setup
Mistral 7B Evaluation
Test 1: Random sequence reversal
Test 2: Code generation
Test 3: Passkey retrieval
Test 4: Fine-tuning
Evaluation Summary
EXTRA: Grouped Query Attention
EXTRA: Sliding Window Attention

Taught by

Trelis Research

Reviews

Start your review of Mistral 7B: Architecture, Evaluation, and Advanced Techniques

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.