Architectural Challenges and Innovation for Compute Infrastructure Co-Design in Generative AI

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!

Grab it

Explore a technical presentation that delves into the computing infrastructure challenges posed by large-scale generative AI models and potential solutions through heterogeneous computing approaches. Learn about the key challenges facing modern AI systems, including managing massive model sizes like GPT-3's 175 billion parameters and addressing data movement bottlenecks. Discover how combining CPU, GPU, and FPGA technologies can achieve greater energy efficiency compared to GPU-only solutions. Examine future possibilities with chiplet-based systems, understanding how enhanced inter-chiplet bandwidth could enable more effective workload partitioning within system-in-package designs. Gain insights into the architectural innovations necessary for supporting next-generation AI applications through this 14-minute talk delivered by Peipei Zhou from the University of Pittsburgh at the Open Compute Project.

Syllabus

Architectural Challenges and Innovation for Compute Infrastructure Co-Design

Taught by

Open Compute Project

Reviews

Start your review of Architectural Challenges and Innovation for Compute Infrastructure Co-Design in Generative AI

Taught by

Optical Compute Interconnect: Co-Packaged Optics for AI and Computing Infrastructure

The Challenges of Scaling Beyond Moore's Law - From Monolithic Dies to 3D Heterogeneous Integration

Enabling Sustainable AI Datacenters with ARM-based Chiplets

Optical Interconnects for Large-Scale AI Clusters - A Meta Perspective

Optimizing High Density Storage for AI and ML Applications

Never Stop Learning.