Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

The Inner Workings of Large Language Models - Visualizing Self-Attention Mechanisms

Discover AI via YouTube

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the fundamental mechanics of Large Language Models (LLMs) in this 35-minute educational video that demystifies complex concepts through clear visualizations and beginner-friendly explanations. Dive into the self-attention mechanism powering models like ChatGPT and GPT-4, understand the key differentiators between various LLMs including their weights, pre-trained datasets, and architectural design structures, and learn about performance optimization through hardware, software, and architectural tuning. Master essential concepts about decoder-based Transformers, LangChain implementation, and Vector stores with their embeddings. Focus specifically on the decoder stack while examining real-world applications through examples like Claude from Anthropic and cutting-edge research from "AttentionViz: A Global View of Transformer Attention." Access interactive demonstrations and comprehensive documentation to reinforce understanding of these transformative AI technologies.

Syllabus

The inner workings of LLMs explained - VISUALIZE the self-attention mechanism

Taught by

Discover AI

Reviews

Start your review of The Inner Workings of Large Language Models - Visualizing Self-Attention Mechanisms

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.