OpenYurt and Dragonfly - Enhancing Efficient Distribution of LLMs in Cloud-Edge Collaborative Scenarios
CNCF [Cloud Native Computing Foundation] via YouTube
Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore how OpenYurt and Dragonfly enhance efficient distribution of Large Language Models (LLMs) in cloud-edge collaborative scenarios. Learn about the challenges of deploying and delivering growing LLMs in edge computing environments with thousands of nodes. Discover how OpenYurt efficiently distributes LLM applications across dispersed edge nodes and how Dragonfly's P2P image distribution technology addresses public network bandwidth consumption during cross-site transmission. Understand how this solution can reduce public network traffic consumption by up to 90% compared to conventional LLM distribution and achieve rapid, efficient sharing of LLMs in physically isolated environments. Gain insights from container service experts at Alibaba Cloud and Ant Group as they share practical applications of combining OpenYurt with Dragonfly in edge computing scenarios for LLMs.
Syllabus
OpenYurt & Dragonfly: Enhancing Efficient Distribution of LLMs in Cloud-Edge... - Linbo He & Jim Ma
Taught by
CNCF [Cloud Native Computing Foundation]