Overview
Watch a conference talk from Ray Summit 2024 where IBM's Dean Wampler explores the implementation of Ray for large-scale data processing in AI and scientific applications. Discover the capabilities of the Data Prep Kit, an open-source project by IBM Research and the AI Alliance, which leverages Ray as its primary engine for data processing operations essential to LLM training and fine-tuning. Learn how Ray enables seamless scalability from local development environments to large-scale parallel processing of terabyte-scale datasets, and understand its integral role in the AI Alliance's Open Trusted Data initiative for data curation.
Syllabus
Ray at IBM: Transforming Large-Scale Data Processing for AI and Science | Ray Summit 2024
Taught by
Anyscale