Overview
Explore Apache Pulsar, a cutting-edge messaging and queuing system, in this 38-minute video presentation from Databricks. Dive into Pulsar's unique two-layered architecture that separates message storage from serving, allowing for seamless scalability and durability. Learn how Pulsar unifies messaging, queuing, and light-weight computing with the scalable properties of log storage systems. Discover the benefits of using Apache BookKeeper as the underlying data storage, ensuring data consistency and durability while maintaining strict SLAs. Gain insights into Pulsar Functions, a serverless framework for real-time data processing. Understand Pulsar's components, client libraries, architecture, and features such as repeaters, latency optimization, partitioning, and multitenancy. Explore practical applications through use cases, Pulsar IO, connectors, and Pulsar SQL implementation. By the end of this talk, grasp how Apache Pulsar enables the development of end-to-end streaming applications and its deployment in various scenarios.
Syllabus
Introduction
Messaging vs Streaming
Use Cases
Pulsar Components
Client Libraries
Architecture
Repeaters
Latency and throughput
Segments
Partitions
Offloading
Registry
Schema
Application
Subscriptions
Multitenancy
Pulsar State Example
Pulsar IO
Pulsar connectors
Data stored in Pulsar
Pulsar SQL
Pulsar SQL Implementation
About Apache Pulsar
Taught by
Databricks