AntMan - Dynamic Scaling on GPU Cluster for Deep Learning

AntMan - Dynamic Scaling on GPU Cluster for Deep Learning

USENIX via YouTube Direct link

Intro

1 of 13

1 of 13

Intro

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

AntMan - Dynamic Scaling on GPU Cluster for Deep Learning

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Intro
  2. 2 Deep Learning in productions
  3. 3 Observations: Low utilization
  4. 4 Opportunities
  5. 5 Outline
  6. 6 Dynamic scaling memory
  7. 7 Dynamic scaling computation Exclusive mode
  8. 8 AntMan architecture
  9. 9 Micro-benchmark: Memory grow-shrink
  10. 10 Micro-benchmark: Adaptive computation
  11. 11 Trace experiment
  12. 12 Large-scale experiment
  13. 13 Conclusion AntMan: Dynamic Scaling on GPU Clusters for Deep Learning

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.