Overview
Syllabus
Intro
What is Spark?
The different pieces of Spark
Why should we consider Spark SQL?
What is the performance like?
How is it so fast?
How much more space efficient?
Getting started
Loading some simple JSON data
Sample case class for schema
Then apply some type magic
What do relational transforms look like?
Writing a relational transformation
What can the optimizer do now?
Using Datasets to mix functional & relational style
And functional style maps
What is DS functional perf like?
Build the recipe for each query
Windowed operations
Window specs
Summary: Why to use Datasets
The next book.....
Taught by
Scala Days Conferences