Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Distributed Real Time Stream Processing - Why and How

Scala Days Conferences via YouTube

Overview

Explore distributed real-time stream processing frameworks in this 42-minute conference talk from Scala Days New York 2016. Dive into popular open-source solutions like Spark Streaming, Storm, Samza, and Flink, comparing their similarities, differences, and trade-offs. Gain insights into theoretical foundations, common pitfalls, and popular architectures for handling the increasing demand for fast processing of immense data from disparate sources. Learn how to choose the right framework for various use cases, including trading, social networks, Internet of Things, and system monitoring. Discover comprehensive overviews of modern streaming solutions, runtime and programming models, fault tolerance, state management, and performance considerations. Examine project maturity for different frameworks and receive general guidelines and recommendations for implementing streaming solutions.

Syllabus

Intro
The Data Deluge
Distributed Stream Processing
Points of interest
Runtime and Programming Model
Native Streaming
Micro-batching
Apache Streaming Landscape
System Comparison
Fault Tolerance
Managing State
Counting Words Revisited
Performance
Project Maturity [Storm & Trident]
Project Maturity [Spark Streaming]
Project Maturity [Samza]
Project Maturity [Flink]
Summary
General Guidelines
Recommendations [Storm & Trident]
Recommendations [Spark Streaming]
Recommendations Samza
Recommendations [Apex]
Recommendations [Flink]
Dataflow and Apache Beam
Questions

Taught by

Scala Days Conferences

Reviews

Start your review of Distributed Real Time Stream Processing - Why and How

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.