Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

CNCF [Cloud Native Computing Foundation]

High Performance Storage Solution for Large-scale ML Systems

CNCF [Cloud Native Computing Foundation] via YouTube

Overview

Explore a conference talk on developing high-performance storage solutions for large-scale machine learning systems. Discover how I/O bottlenecks can significantly impact training time and system scalability, especially when moving data from global filesystems. Learn about innovative approaches to address these challenges, including the adoption of high-speed hardware and software improvements such as thread models, load balancing SDKs, read/write splitting, and read path optimization. Gain insights into achieving lower latency and higher throughput for more efficient ML model training and data processing.

Syllabus

High Performance Storage Solution for Large-scale ML Systems - Hongjian Yu & Pengfei Zheng

Taught by

CNCF [Cloud Native Computing Foundation]

Reviews

Start your review of High Performance Storage Solution for Large-scale ML Systems

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.