Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

PeriFlow: High-Performance Generative AI Serving Engine

SK AI SUMMIT 2024 via YouTube

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Discover an innovative technical presentation from SK TECH SUMMIT 2023 exploring PeriFlow, the fastest available generative AI serving engine in the market. Learn how this groundbreaking technology reduces GPU resource requirements by 70-90% when serving generative AI models like Llama 2. Explore PeriFlow's specialized batching technology that significantly improves throughput while maintaining low latency, protected by patents in the United States and Korea. Gain insights from Dr. Kyungin Yu, who holds a Ph.D. in Computer Science from Seoul National University and specializes in developing efficient systems for AI models including LLMs. Understand how PeriFlow is delivered both as a container and cloud (SaaS) solution, making it accessible for various implementation needs. The 20-minute talk demonstrates how today's technology shapes a more convenient and secure tomorrow through the expertise shared by leading technology companies and professionals.

Syllabus

[SK TECH SUMMIT 2023] FriendliAI PeriFlow 소개

Taught by

SK AI SUMMIT 2024

Reviews

Start your review of PeriFlow: High-Performance Generative AI Serving Engine

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.