Completed
– Transformer applications: 1. Multilingual transformer Architecture XML-R
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Differentiable Associative Memories, Attention, and Transformers
Automatically move to the next video in the Classroom when playback concludes
- 1 – Motivation for reasoning & planning
- 2 – Inference through energy minimization
- 3 – Disclaimer
- 4 – Planning through energy minimization
- 5 – Q&A Optimal control diagram
- 6 – Differentiable associative memory and attention
- 7 – Transformers
- 8 – Q&A Other differentiable attention architectures
- 9 – Transformer architecture
- 10 – Transformer applications: 1. Multilingual transformer Architecture XML-R
- 11 – 2. Supervised symbol manipulation
- 12 – 3. NL understanding & generation
- 13 – 4. DETR
- 14 – Planing through optimal control
- 15 – Conclusion