Overview
Explore Azure's big data and Massively Parallel Processing (MPP) offerings in this comprehensive conference talk from the PASS Data Community Summit. Dive into the concept of data lakes, Lambda Architecture, and various Azure services including Blob Storage, Data Lake Store, HDInsight, and Data Lake Analytics. Learn about batch processing, federated queries, and the intricacies of Azure SQL Data Warehouse, including Data Warehouse Units (DTUs), data partitioning, and important considerations for unsupported data types and features. Gain valuable insights into leveraging Azure's powerful tools for handling large-scale data processing and analytics workloads.
Syllabus
Intro
Technical Assistance
What is a generic data lake?
Lambda Architecture
Azure Blob Storage
Azure Data Lake Store (ADLS)
Azure HDinsight
Batch Processing - Azure Data Lake Analytics
Data Lake Analytics Workloads
Federated Query
Store & Process: Azure SQL Data Warehouse
What is a DTU (Data Warehouse Unit)?
Partitioning Data
Non-supported data types
Unsupported Features
Taught by
PASS Data Community Summit