Explore Apache Spark's architecture, data lineage, and Direct Acyclic Graph (DAG) in this 39-minute session. Dive into the multi-language engine designed for executing data engineering, data science, and machine learning tasks on both single-node machines and clusters. Gain insights into distributed computing concepts and learn how Apache Spark facilitates efficient data processing across various applications.
Overview
Syllabus
Getting Started with Apache Spark
Taught by
NashKnolX