Completed
Intro
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
The Apache Spark File Format Ecosystem - Optimizing Storage for Performance
Automatically move to the next video in the Classroom when playback concludes
- 1 Intro
- 2 Session Goals
- 3 File Formats
- 4 Row-wise Storage
- 5 Columnar (Column-wise) Storage
- 6 Hybrid Storage
- 7 Example Data
- 8 About: CSV
- 9 About: JSON
- 10 About: Avro
- 11 Inspecting: Avro
- 12 About: ORC
- 13 Structure: ORC
- 14 Inspecting: ORC
- 15 Config: ORC
- 16 Structure: Parquet
- 17 Inspecting: Parquet (1)
- 18 Inspecting: Parquet (2)
- 19 Config: Parquet
- 20 Case Study: Veraset
- 21 Looking Forward: Apache Arrow
- 22 Final Thoughts