Overview
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Learn about the cost implications and optimization challenges of migrating data analytics to cloud environments through a detailed conference presentation examining Uber's large-scale SQL analytics platform deployment on HDFS and GCS. Explore how traditional performance-focused optimization strategies need adaptation when confronting cloud storage operation costs, which can rapidly escalate in production scenarios. Discover key findings about unexpected cost impacts when implementing standard I/O optimizations like table scans, filters, and broadcast joins in cloud environments. Gain insights into the necessary paradigm shift for optimizing data-intensive applications in the cloud, understanding how to balance performance with costs while addressing the unique demands of cloud ecosystems. Through this real-world case study, understand the complexities of managing analytics workloads at scale and develop strategies for more cost-effective cloud deployments.
Syllabus
A Case Study in API Cost of Running Analytics in the Cloud at Scale with an... - Bin Fan & Hope Wang
Taught by
Linux Foundation