TransformerFAM and BSWA: Understanding Feedback Attention Memory and Block Sliding Window Attention
Discover AI via YouTube
Overview
Syllabus
3 videos on infinity context length
Visualization of new transformerFAM
Pseudocode for two new transformer
Basics of Attention calculations
TransformerBSWA - Block Sliding Window Attention
TransformerFAM - Feedback Attention Memory
Symmetries in operational feedback code
Time series visualization of new FAM and BSWA
Outlook on Reasoning w/ TransformerFAM
Taught by
Discover AI