Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Petabyte-Scale Data Analytics with ClickHouse, S3 Storage, and Data Lakes

Altinity via YouTube

Overview

Learn how to handle petabyte-scale data processing in real-time using ClickHouse®, S3 object storage, and data lakes in this comprehensive technical webinar. Explore essential design patterns for data ingestion, aggregation, and querying while mastering best practices for S3 storage policies, Parquet data handling, backup strategies, monitoring, and high-performance cluster setup in cloud environments. Starting with fundamental challenges of real-time big data analytics, progress through implementing flexible compute solutions, scaling MergeTree tables with S3 storage, and leveraging Parquet in read-only data lakes. Gain practical insights through detailed demonstrations and real-world examples, concluding with a valuable cheatsheet for managing petabyte-scale ClickHouse clusters and an interactive Q&A session. Access additional resources including detailed documentation, knowledge base articles, and community support channels to further enhance your ClickHouse expertise.

Syllabus

Introduction -
Challenges of Real-time Big Data -
Overview of ClickHouse and Its Features -
Designing analytics on real-time big data -
Implementing Flexible Compute -
Scaling MergeTree Tables with S3 Storage -
Use Parquet in Read-only Data Lakes -
Cheatsheet For Petabyte-Scale ClickHouse Clusters -
Q&A and Closing Remarks -

Taught by

Altinity

Reviews

Start your review of Petabyte-Scale Data Analytics with ClickHouse, S3 Storage, and Data Lakes

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.