Overview
Discover how columnar databases are revolutionizing data warehousing and analytics in this 26-minute Devoxx conference talk. Explore the fundamental differences between columnar and traditional row-based databases, with a focus on how column-oriented storage optimizes data retrieval and processing at scale. Learn to implement open-source technologies including Apache Arrow, Apache Parquet, and Pandas to create high-performance analytics applications. Examine real-world case studies that demonstrate improved query performance and reduced storage costs through columnar storage adoption. Gain insights into best implementation practices and discover valuable open-source resources to enhance analytics workflows, ultimately enabling more effective data-driven decision making within organizations.
Syllabus
Columnar Storage: Redefining Data Management for the Modern Era by Zoe Steinkamp
Taught by
Devoxx