Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Scaling Systems for Generative AI - Building and Deploying Large Language Models

Open Compute Project via YouTube

Overview

Learn how to efficiently scale model deployment and service for Generative AI and large language models in this 15-minute conference talk from Intel's VP of AI Systems Product Management. Explore the critical requirements for AI compute systems and discover Intel's vision for building scalable solutions that enable seamless training and deployment of models across configurations ranging from single nodes to thousands of nodes. Gain insights into how Intel technologies can be leveraged to address the unique challenges of scaling AI systems for optimal performance and efficiency.

Syllabus

Scaling Systems for Gen AI

Taught by

Open Compute Project

Reviews

Start your review of Scaling Systems for Generative AI - Building and Deploying Large Language Models

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.