LIMoE- Learning Multiple Modalities with One Sparse Mixture-of-Experts Model

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!

Grab it

Explore a 17-minute video delving into LIMoE (Learning Multiple Modalities with One Sparse Mixture-of-Experts Model), a large-scale multimodal architecture that processes both images and text using sparsely activated experts. Gain insights into LIMoE's internal architecture, data processing techniques, and performance. Follow along as the video covers the research paper introduction, key topics, LIMoE internals, training system, multimodal contrastive learning, behavior understanding, and performance analysis. Access additional resources, including GitHub repositories and research papers, to further enhance your understanding of this innovative AI model.

Syllabus

- Research Paper intro
- Topics Covered
- LIMoE Internals
- Training System
- Multimodal Contrastive Learning
- LIMoE Behavior Understanding
- LIMoE Performance
- Conclusion

Taught by

Prodramp

Reviews

Start your review of LIMoE- Learning Multiple Modalities with One Sparse Mixture-of-Experts Model

Taught by

Research Paper Deep Dive - The Sparsely-Gated Mixture of Experts

Mixture of Experts (MoE) in Large Language Models - A Simple Guide

Stanford Seminar - Mixture of Experts Paradigm and the Switch Transformer

Machine Learning: Regression

Google Gemini - Technical Report on Model Architecture, Dataset, and Training

Building Makemore - MLP

Never Stop Learning.