Overview
Explore how Daft, a powerful Python/Rust library, enhances Ray clusters with distributed ETL and analytics capabilities in this lightning talk from Ray Summit 2024. Discover the seamless integration between Daft and Ray's ecosystem, featuring superior performance metrics and innovative functionalities. Learn how to leverage Ray's object store for processing larger-than-memory datasets, implement zero-copy data transfer to Ray Data, and seamlessly connect with Ray's ML/AI toolset. Follow along as Jay Chia demonstrates an end-to-end example showcasing the powerful combination of Daft and Ray for data exploration, cleaning, processing, and ML/AI training workflows, providing a comprehensive look at the future of distributed data processing solutions.
Syllabus
Ray Meets Daft: Supercharging ETL and Analytics | Ray Summit 2024
Taught by
Anyscale