Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Discover techniques to optimize Pandas for larger datasets in this 30-minute EuroPython 2020 conference talk. Learn how to compress data to fit more rows into RAM, explore methods to accelerate common DataFrame operations using numpy and Numba, and investigate multi-core processing. Examine solutions for handling datasets that exceed memory limitations, including Dask on Pandas and the Vaex alternative. Gain valuable insights to reduce DataFrame size and boost data processing speed, applicable to Pandas users of all levels. Based on the speaker's work updating the O'Reilly book "High Performance Python," this talk equips you with strategies to enhance your data analysis capabilities and improve job effectiveness.