Overview
Explore the analysis of compromised passwords using Apache Spark in this conference talk from GrrCon 2018. Dive into big data concepts, Resilient Distributed Datasets (RDDs), and DataFrames while examining the current state of password security. Learn about data visualization techniques, password policies, and the benefits and challenges of using Spark for large-scale data processing. Discover insights on common password patterns, lengths, and suffixes, and understand the importance of addressing password reuse and credential stuffing attacks. Gain valuable knowledge on balancing security measures with user experience in the ever-evolving landscape of cybersecurity.
Syllabus
Introduction
Kelly Robinson Introduction
Agenda
Apache Spark
Big Data
RDDs
DataFrames
Performance
State of passwords
TryHunt
Zeppelin
Top Passwords
Lengths
Data visualizations
Suffixes
Transform Data
Password Policies
Spark Benefits
Spark Challenges
Java Stack Traces
Big Data Security
Security is on everyones mind
Nobody security is perfect
Users have bad passwords
Password reuse
Password security
Seamless user experience
Credential stuffing
Wrap up