Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Spark 2.0

Scala Days Conferences via YouTube

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Explore the evolution of Apache Spark's API in this keynote presentation from Scala Days New York 2016. Dive into the upcoming features of Spark 2.0, including more declarative APIs for automatic optimizations and improved links between Scala data types and binary data formats for efficient processing. Learn about Spark's journey as a large-scale Scala project, its functional API, and its impact on distributed programming. Discover the challenges faced in API design, data representation, and performance optimization. Gain insights into DataFrames, Datasets, and Structured Streaming APIs. Understand Project Tungsten's role in improving space efficiency and runtime code generation. Get a glimpse of Spark's long-term vision and versioning strategy, and find resources to further your Apache Spark knowledge.

Syllabus

Intro
What is Apache Spark?
A Large Community
Apache Spark Users
Original Spark Vision
Motivation: Unification
Motivation: Concise API
How Did the Vision Hold Up?
Libraries Built on Spark
Which Libraries Do People Use?
Top Applications
Main Challenge: Functional API
Which API Call Causes Most Tickets?
Example Problem
Challenge: Data Representation
Why Structure?
DataFrames and Datasets
Execution Steps
DataFrame API
Why DataFrames?
What Structured APIs Enable
Performance
Dataset API Details
Data Sources
Data Source API
Examples
Hardware Trends
Project Tungsten
Tungsten's Compact Encoding
Space Efficiency
Runtime Code Generation
Long-Term Vision
Versioning in Spark
Major Features in 2.0
Background
Structured Streaming High-level streaming API built on DataFrames/Datasets
Structured Streaming API
Example: Batch Aggregation
Example: Continuous Aggregation
Incrementalized By Spark
Release Timeline
Conclusion
Want to Learn Apache Spark?

Taught by

Scala Days Conferences

Reviews

Start your review of Spark 2.0

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.