Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Introducing Amazon SageMaker HyperPod for Foundation Model Training

AWS Events via YouTube

Overview

Learn about accelerating foundation model (FM) training in this AWS re:Invent 2023 conference session that introduces Amazon SageMaker HyperPod. Explore how to conduct uninterrupted FM training over extended periods of weeks and months using this purpose-built solution. Discover the system's intelligent cluster health monitoring capabilities that automatically repair and replace faulty nodes while maintaining training progress. Gain insights into the preconfigured SageMaker distributed training libraries that optimize FM training performance by efficiently splitting training data and models into smaller segments for parallel processing across cluster nodes, maximizing compute and network infrastructure utilization.

Syllabus

AWS re:Invent 2023 - [LAUNCH] Introducing Amazon SageMaker HyperPod (AIM362)

Taught by

AWS Events

Reviews

Start your review of Introducing Amazon SageMaker HyperPod for Foundation Model Training

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.