Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Chinchilla Explained - Compute-Optimal Massive Language Models

Edan Meyer via YouTube

Overview

Explore the groundbreaking Chinchilla language model in this 33-minute video lecture. Delve into DeepMind's innovative approach to scaling large language models in a compute-optimal manner, resulting in Chinchilla's superior performance over GPT-3, Gopher, and Megatron-Turing NLG with only 70 billion parameters. Learn about the extensive research involving 400 large models to determine the optimal ratio of parameters and training data. Gain insights into the paper's introduction, methodology, scaling implications, and Chinchilla's overview and performance. Conclude with a summary and critical analysis of this significant advancement in natural language processing.

Syllabus

- Overview
- Paper Intro
- Methods
- Scaling Implications
- Chinchilla Overview
- Chinchilla Performance
- Summary
- Thoughts & Critiques

Taught by

Edan Meyer

Reviews

Start your review of Chinchilla Explained - Compute-Optimal Massive Language Models

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.