Completed
Parquet: Data Page
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Recent Parquet Improvements in Apache Spark - Vectorized Complex Types and Column Index Support
Automatically move to the next video in the Classroom when playback concludes
- 1 Intro
- 2 Short Intro
- 3 Outline
- 4 Introduction on Apache Parquet
- 5 Parquet: Glossary
- 6 Parquet: Data Page
- 7 Background
- 8 Non-Vectorized Parquet Reader
- 9 Advantages of Vectorized Approach
- 10 High Level Idea
- 11 Parquet Schema Conversion
- 12 SPARK-34863: Complex type support
- 13 Complex Type - Performance
- 14 Perf: vectorized vs non-vectorized
- 15 Parquet Predicate Pushdown
- 16 Column Index Filtering
- 17 Column Index Support in Spark
- 18 Column Index - Performance
- 19 Future Work