Overview
Explore a conference talk presenting Opaque, an innovative distributed data analytics platform designed to provide robust security for sensitive data processing in cloud environments. Learn about the platform's ability to support a wide range of queries while offering strong security guarantees through hardware enclaves and oblivious relational operators. Discover how Opaque addresses the challenge of access pattern leakage and implements new query planning techniques to optimize performance. Gain insights into the platform's implementation on Spark SQL, its encryption and authentication features, and its performance compared to vanilla Spark SQL and state-of-the-art oblivious protocols. Understand the benefits of Opaque's query optimization techniques and their impact on improving performance in various scenarios, including complex analytics and PageRank computations.
Syllabus
Intro
Complex analytics run on sensitive data
Problem: access pattern
Oblivious aggregation
Rule-based optimization
Observation: not all tables are sensitive
Cost-based optimization
Big Data Benchmark (distributed)
PageRank: comparison with GraphSC (single machine)
Taught by
USENIX