Breaking Out of the Proprietary Cage - Real-time Data Warehouses in Open Source

Breaking Out of the Proprietary Cage - Real-time Data Warehouses in Open Source

Linux Foundation via YouTube Direct link

More table engines for clustering!

16 of 22

16 of 22

More table engines for clustering!

Class Central Classrooms beta

YouTube videos curated by Class Central.

Classroom Contents

Breaking Out of the Proprietary Cage - Real-time Data Warehouses in Open Source

Automatically move to the next video in the Classroom when playback concludes

  1. 1 Intro
  2. 2 Presenter Bio
  3. 3 What makes analytic applications special?
  4. 4 SQL data warehouses run analytic queries
  5. 5 What ClickHouse is not
  6. 6 Merge Tree is the workhorse table engine
  7. 7 Merge Tree data layout
  8. 8 Detailed storage layout within a single part /var/lib/clickhouse/data/airline/ontime
  9. 9 Adding CPUs boosts parallelized execution
  10. 10 I/O drives ClickHouse performance
  11. 11 Compression and codecs reduce I/O
  12. 12 Effect on storage is dramatic
  13. 13 Materialized views restructure/reduce data
  14. 14 Pattern: TTLs + downsampled views
  15. 15 Alternative pattern: Tiered storage
  16. 16 More table engines for clustering!
  17. 17 How do distributed queries work?
  18. 18 Pattern: Kafka-based ingestion pipelines
  19. 19 Alternative ingest pattern: Kafka engine
  20. 20 Pattern: Grafana visualization
  21. 21 Pattern: Operation on Kubernetes
  22. 22 Wrap-up . ClickHouse meets/beats proprietary SQL data warehouses in head-to-head comparisons

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.