Overview
Explore the challenges and solutions for implementing Spark in a secure, productive Mesos environment in this conference talk by Jorge Lopez-Malla and Marcos Peñate from Stratio. Delve into the evolution of Big Data usage in companies and the need for enhanced security measures. Learn about Stratio's modifications to Apache Spark over Apache Mesos, including the implementation of Software Defined Networks (SDN) for improved isolation and changes to Spark's core network layer. Discover a smart approach to handling secrets without user interaction, enhancing Apache Spark's security module. Gain insights into Kerberizing Spark, integrating with HDFS, and implementing mutual TLS. Watch a live demonstration showcasing network profiling, Spark dispatch, and HDFS access in a secured environment. Understand the importance of network parameters and Spark properties in maintaining a secure Big Data ecosystem.
Syllabus
Intro
Presentation Outline
Presentation Introduction
Marcos Introduction
What is DCOs
Preconditions
Integration
Kerberos
HDFS
Read from HDFS
Key management
Security
Security Problem
mundial
Mutual TLS
TLS in Wikipedia
We think different
Demo
Never Isolation
Network Solution
Network Profiling
Life Demo
Spark Dispatch
Access HDFS
Network Parameters
Spark Properties
Questions
Dessert Questions
Taught by
Linux Foundation