Understanding Neural Attention in Deep Learning - From Basics to Transformers

Overview

Coursera Plus Monthly Sale: All Certificates & Courses 40% Off!

Grab it

Learn the foundational concepts of Neural Attention through an 18-minute video that breaks down this revolutionary AI framework powering Transformer architectures and Large Language Models (LLMs). Progress from building a basic recommendation system to understanding contrastive learning, mean pooling, and weighted attention mechanisms. Master encoder-decoder attention and multiheaded attention concepts through visual illustrations and practical examples. Explore real-world applications in machine translation while examining key research papers like "Attention is All You Need" and "Neural Machine Translation by Jointly Learning to Align and Translate." Part of a comprehensive series on Neural Networks, with the follow-up video focusing on Self Attention.

Syllabus

- Intro
- Let's make a Recommendation System
- Contrastive Learning
- A Chatty Recommender
- Mean Pooling
- Attention as a weighted mean
- Math
- Machine Translation
- Encoder Decoder Attention
- More Neural Attention
- Multiheaded Attention
- Conclusion and Next Video!