AdaEmbed - Adaptive Embedding for Large-Scale Recommendation Models

Overview

Explore a cutting-edge approach to optimizing deep learning recommendation models (DLRMs) in this conference talk from OSDI '23. Dive into AdaEmbed, an innovative system designed to reduce embedding size while maintaining model accuracy through in-training embedding pruning. Learn how this technique leverages heterogeneous access patterns and weights across embedding rows to dynamically identify and prune less important embeddings at scale. Discover the potential of AdaEmbed to significantly reduce deployment costs and improve model execution speed in large-scale recommendation systems. Gain insights into the challenges of working with DLRMs containing billions of embeddings and how AdaEmbed addresses these issues in industrial settings. Understand the impact of this approach on embedding size reduction, model execution speed, and accuracy gains in real-world applications.

Syllabus

OSDI '23 - AdaEmbed: Adaptive Embedding for Large-Scale Recommendation Models

Taught by

USENIX

Reviews

Start your review of AdaEmbed - Adaptive Embedding for Large-Scale Recommendation Models

Taught by

Check-N-Run - A Checkpointing System for Training Deep Learning Recommendation Models

OPER: Optimality-Guided Embedding Table Parallelization for Large-scale Recommendation Models

Adaptive In-Context Learning with Large Language Models for Bundle Generation - Recommendation Systems

Hydro - Surrogate-Based Hyperparameter Tuning Service in Datacenters

Defcon - Preventing Overload with Graceful Feature Degradation

AlpaServe - Statistical Multiplexing with Model Parallelism for Deep Learning Serving

Never Stop Learning.