What Learning Algorithm is In-Context Learning? - Understanding Transformer Models and Neural Sequence Learning
Harvard CMSA via YouTube
Overview
Syllabus
Introduction
InContext Learning
Outline
In Context Learning
Training
What is a good ICL
Not just with welldefined problems
Identifying skills
Learning French
Can Transformer Models Do Real Learning
Offline Transformations
Computing Dot Products
Nonlinearity
Generic Transformer
Read Attend Write Operator
Parameterization
Learning Setup
Quality of Predictions
Other Predictions
Nearest Neighbors
Summary
Natural Questions
Standard Transformer
Linear Selfattention
Data Diversity
Real Models
Next Direction
Credits
Taught by
Harvard CMSA