Completed
Outline
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Using Pandas and Dask to Work with Large Columnar Datasets in Apache Parquet
Automatically move to the next video in the Classroom when playback concludes
- 1 Intro
- 2 Outline
- 3 Business Model
- 4 Data Flow
- 5 Conclusion
- 6 Why do I care
- 7 Other technologies
- 8 Blob storage
- 9 Data sharing
- 10 Pocky
- 11 Why Parquet
- 12 Python implementations
- 13 Parquet file structure
- 14 Pre predicate pushdown
- 15 Dictionary encoding
- 16 Compression
- 17 Partitioning
- 18 Storage
- 19 ODBC
- 20 Azure Blob Storage
- 21 Questions