Breaking Out of the Proprietary Cage - Real-time Data Warehouses in Open Source

Breaking Out of the Proprietary Cage - Real-time Data Warehouses in Open Source

Linux Foundation via YouTube Direct link

Intro

1 of 22

1 of 22

Intro

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Breaking Out of the Proprietary Cage - Real-time Data Warehouses in Open Source

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Intro
  2. 2 Presenter Bio
  3. 3 What makes analytic applications special?
  4. 4 SQL data warehouses run analytic queries
  5. 5 What ClickHouse is not
  6. 6 Merge Tree is the workhorse table engine
  7. 7 Merge Tree data layout
  8. 8 Detailed storage layout within a single part /var/lib/clickhouse/data/airline/ontime
  9. 9 Adding CPUs boosts parallelized execution
  10. 10 I/O drives ClickHouse performance
  11. 11 Compression and codecs reduce I/O
  12. 12 Effect on storage is dramatic
  13. 13 Materialized views restructure/reduce data
  14. 14 Pattern: TTLs + downsampled views
  15. 15 Alternative pattern: Tiered storage
  16. 16 More table engines for clustering!
  17. 17 How do distributed queries work?
  18. 18 Pattern: Kafka-based ingestion pipelines
  19. 19 Alternative ingest pattern: Kafka engine
  20. 20 Pattern: Grafana visualization
  21. 21 Pattern: Operation on Kubernetes
  22. 22 Wrap-up . ClickHouse meets/beats proprietary SQL data warehouses in head-to-head comparisons

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.