Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Scalable Training of Language Models Using Ray, JAX, and TPUv4

Anyscale via YouTube

Overview

Explore the challenges and design decisions associated with developing a scalable training framework for large language models in this 34-minute conference talk from Ray Summit 2022. Delve into the quantitative analysis of efficiency improvements resulting from adopting new software and hardware solutions, including Ray, JAX pjit, and TPUv4. Learn about the distributed training strategies required for modern large language models due to their size, and gain insights into the rapid developments on both software and hardware frontiers that address the challenges of efficient and robust training.

Syllabus

Scalable training of language models using Ray, JAX, and TPUv4 at Cohere

Taught by

Anyscale

Reviews

Start your review of Scalable Training of Language Models Using Ray, JAX, and TPUv4

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.