Apache Spark Essential Training

Overview

Learn how to make Apache Spark work with other Big Data technologies and put together an end-to-end project that can solve a real-world business problem.

Syllabus

Introduction

Driving big data engineering with Apache Spark
Course prerequisites
Setting up the exercise files

1. Data Engineering Concepts

What is data engineering?
Data engineering vs. data analytics vs. data science
Data engineering functions
Batch vs. real-time processing
Data engineering with Spark

2. Spark Capabilities for ETL

Spark architecture review
Parallel processing with Spark
Spark execution plan
Stateful stream processing
Spark analytics and ML

3. Batch Processing Pipelines

Batch processing use case: Problem statement
Batch processing use case: Design
Setting up the local DB
Uploading stock to a central store
Aggregating stock across warehouses

4. Real-Time Processing Pipelines

Real-time use case: Problem
Real-time use case: Design
Generating a visits data stream
Building a website analytics job
Executing the real-time pipeline

5. Data Engineering with Spark: Best Practices

Batch vs. real-time options
Scaling extraction and loading operations
Scaling processing operations
Building resiliency

6. End-to-End Exercise Project

Project exercise requirements
Solution design
Extracting long last actions
Building a scorecard

Conclusion

More about Apache Spark

Taught by

Ben Sullins

Reviews

4.6 rating at LinkedIn Learning based on 471 ratings

Start your review of Apache Spark Essential Training

Taught by

Apache Spark with Scala – Hands-On with Big Data!

Scalable Machine Learning on Big Data using Apache Spark

Apache Spark for Data Engineering and Machine Learning

Applying the Lambda Architecture with Spark, Kafka, and Cassandra

Big Data, Hadoop, and Spark Basics

Perform data engineering with Azure Synapse Apache Spark Pools

From Data to Insights: 10 Best Data Analysis Courses for 2024

10 Best Machine Learning Courses for 2024: Scikit-learn, TensorFlow, and more

110+ Hours of Free LinkedIn Learning Courses with Free Certification

Never Stop Learning.