Overview
Syllabus
– Motivation for reasoning & planning
– Inference through energy minimization
– Disclaimer
– Planning through energy minimization
– Q&A Optimal control diagram
– Differentiable associative memory and attention
– Transformers
– Q&A Other differentiable attention architectures
– Transformer architecture
– Transformer applications: 1. Multilingual transformer Architecture XML-R
– 2. Supervised symbol manipulation
– 3. NL understanding & generation
– 4. DETR
– Planing through optimal control
– Conclusion
Taught by
Alfredo Canziani