Scaling Systems for Generative AI - Building and Deploying Large Language Models
Open Compute Project via YouTube
Overview
Learn how to efficiently scale model deployment and service for Generative AI and large language models in this 15-minute conference talk from Intel's VP of AI Systems Product Management. Explore the critical requirements for AI compute systems and discover Intel's vision for building scalable solutions that enable seamless training and deployment of models across configurations ranging from single nodes to thousands of nodes. Gain insights into how Intel technologies can be leveraged to address the unique challenges of scaling AI systems for optimal performance and efficiency.
Syllabus
Scaling Systems for Gen AI
Taught by
Open Compute Project