Reformer - The Efficient Transformer

Overview

Explore the groundbreaking Reformer model in this informative video, which addresses the resource-intensive nature of the famous Transformer architecture. Learn how the Reformer combines Locality Sensitive Hashing and concepts from Reversible Networks to significantly reduce memory usage and enable processing of much longer input sequences. Discover how this innovative approach allows for handling up to 16K tokens with just 16GB of memory, making it a game-changer for natural language processing tasks. Delve into the technical details of the model's O(LlogL) complexity, reversible residual layers, and their impact on efficiency. Gain insights into the Reformer's performance, which rivals traditional Transformer models while offering substantial improvements in memory efficiency and processing speed for long sequences.

Syllabus

Reformer: The Efficient Transformer

Taught by

Yannic Kilcher

Reviews

Start your review of Reformer - The Efficient Transformer

Taught by

Reversible Transformer - GPU Memory Optimization Using ReFORMER and Reversible Residual Layers

Longformer - The Long-Document Transformer

Scaling Transformer to 1M Tokens and Beyond with RMT - Paper Explained

Efficient Inference of Extremely Large Transformer Models

The Narrated Transformer Language Model

Not All Memories Are Created Equal - Learning to Forget by Expiring

10 Best Machine Learning Courses for 2024: Scikit-learn, TensorFlow, and more

Never Stop Learning.