Discover techniques to optimize Pandas for larger datasets in this 30-minute EuroPython 2020 conference talk. Learn how to compress data to fit more rows into RAM, explore methods to accelerate common DataFrame operations using numpy and Numba, and investigate multi-core processing. Examine solutions for handling datasets that exceed memory limitations, including Dask on Pandas and the Vaex alternative. Gain valuable insights to reduce DataFrame size and boost data processing speed, applicable to Pandas users of all levels. Based on the speaker's work updating the O'Reilly book "High Performance Python," this talk equips you with strategies to enhance your data analysis capabilities and improve job effectiveness.
Overview
Syllabus
Ian Ozsvald - Making Pandas Fly
Taught by
EuroPython Conference