Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Transparent GPU Sharing in Container Clouds for Deep Learning Workloads

USENIX via YouTube

Overview

Explore a cutting-edge solution for GPU sharing in container clouds designed specifically for deep learning workloads. This 15-minute conference talk introduces TGS (Transparent GPU Sharing), an innovative system operating at the OS layer that addresses the challenge of GPU underutilization in datacenters. Learn how TGS leverages adaptive rate control and transparent unified memory to achieve high GPU utilization and performance isolation, ensuring minimal impact on production jobs while significantly improving throughput for opportunistic jobs. Discover the advantages of TGS over existing application-layer and OS-layer solutions, and gain insights into its integration with Docker and Kubernetes. Understand the potential of this technology to revolutionize resource management in container clouds and optimize deep learning training processes.

Syllabus

NSDI '23 - Transparent GPU Sharing in Container Clouds for Deep Learning Workloads

Taught by

USENIX

Reviews

Start your review of Transparent GPU Sharing in Container Clouds for Deep Learning Workloads

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.