Inside the VertiPaq Engine

Overview

Dive deep into the VertiPaq engine's architecture and functionality in this comprehensive 1-hour 10-minute conference talk from PASS Data Community Summit. Explore the columnar database used by SQL Server Analysis Services Tabular, Power BI, and Power Pivot, known for its impressive performance in speed and compression. Discover how VertiPaq stores information, gaining valuable insights into its inner workings and learning optimal data warehouse modeling techniques. Examine common and effective methods to enhance compression ratios and boost performance in Tabular data models. Cover topics such as row and column storage layouts, querying techniques, value encoding, hash encoding, Run Length Encoding (RLE), segmentation, processing phases, calculated columns and tables, query parallelization, hierarchies, memory usage, and materialization strategies.

Syllabus

Intro
Marco Russo
What is VertiPaq?
Tabular query architecture
Row storage layout
Column storage layout
Querying a columnar database
Column vs row storage
Value encoding
Hash encoding
Run Length Encoding (RLE)
Compression of one column
VertiPaq compression
Reducing column size
Segmentation
Processing phases
Special Case of 3rd Segment
Calculated columns and tables
Query parallelization
Hierarchies
Memory Usage During Process
Data memory usage
VertiPaq Analyzer
Query memory usage
Early materialization
Late materialization

Taught by

PASS Data Community Summit

Reviews

Start your review of Inside the VertiPaq Engine

Taught by

Columnstore Technical Deep Dive

Improving Analytics Performance With Columnstore Indexes

Never Stop Learning.