Near Memory Compute for AI Inferencing - Optimizing Data Center Design and TCO

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!

Grab it

Explore a 21-minute technical presentation that delves into innovative solutions for AI inferencing data center challenges, focusing on Near-Memory Compute (NMC) technology. Learn how low-cost remote memory connected through low-latency interconnects can optimize inferencing operations. Discover the benefits of offloading specific inferencing tasks to smaller cores positioned near remote memory, supported by simulation data demonstrating reduced execution latency. Understand the potential impact on Total Cost of Ownership (TCO) reduction in inferencing data centers through the implementation of cost-effective remote memory pools. Gain insights into forward-thinking data center design principles that prioritize both sustainability and operational efficiency.

Syllabus

Near Memory Compute for AI Inferencing

Taught by

Open Compute Project

Reviews

Start your review of Near Memory Compute for AI Inferencing - Optimizing Data Center Design and TCO

Taught by

Memory Wall Mitigation and Acceleration of AI Workloads Using CXL Near Memory Computing

Optimizing AI Inferencing with CXL Memory - Memory Tiering Strategies for Enhanced Performance

Optimizing Data Center TCO - An In-Depth Analysis and Sensitivity Study

CXL Accelerator-Based Memory Solutions - Bringing Unique Customer Value

Flash-Centric Storage Architecture Enabled by E1.S Form Factor

Efficiency of Data-Centric Computing: Integrating Compute and Storage Systems

Never Stop Learning.