Ladder: Enabling Efficient Low-Precision Deep Learning Computing through Hardware-aware Tensor Transformation

Overview

Explore a groundbreaking compiler called Ladder in this 16-minute conference talk from OSDI '24. Dive into the world of efficient low-precision deep learning computing through hardware-aware tensor transformation. Learn how Ladder bridges the gap between evolving custom data types and fixed precision formats supported by current hardware. Discover the general type system tType and extended tensor expression that enable Ladder to transform deep neural network computations into optimized computing pipelines. Understand how Ladder employs new tensor scheduling primitives and a hardware-aware optimization policy to navigate complex transformation spaces, ensuring optimal performance across different memory layers and DNN operators. Gain insights into Ladder's capability to systematically support a wide array of low-bit precision custom data types, significantly enhancing DNN computation performance on modern accelerators without hardware modifications. See how this innovation empowers model designers to explore data type optimizations and provides hardware vendors with a flexible solution to expand support for diverse precision formats.

Syllabus

OSDI '24 - Ladder: Enabling Efficient Low-Precision Deep Learning Computing through...

Taught by

USENIX

Reviews

Start your review of Ladder: Enabling Efficient Low-Precision Deep Learning Computing through Hardware-aware Tensor Transformation

Taught by

Rammer - Enabling Holistic Deep Learning Compiler Optimizations with rTasks

Arbitor: A Numerically Accurate Hardware Emulation Tool for DNN Accelerators

nnScaler: Constraint-Guided Parallelization Plan Generation for Deep Learning Training

Mastering the Three Pillars of AI Acceleration: Algorithms, Hardware, and Software

Deep Learning Neural Network Acceleration at the Edge

10 Best Deep Learning Courses for 2024

Never Stop Learning.