Serverless Data Processing with Dataflow - Advanced Streaming Analytics Pipeline with Cloud Dataflow (Java)
Google via Google Cloud Skills Boost
Overview
In this lab you read deal with late and malformed streaming data using advanced Apache Beam concepts.
Syllabus
- Overview
- Setup and requirements
- Lab part 1. Dealing with late data
- Task 1. Prepare the environment
- Task 2. Set allowed lateness
- Task 3. Set a trigger
- Lab part 2. Dealing with malformed data
- Task 1. Collect malformed data
- Task 2. Make code more modular with a composite transform
- Task 3. Write malformed data for later analysis
- Task 4. Run your pipeline
- Task 5. Test your pipeline
- End your lab