This lab will teach you how to use the Pipeline Studio in Cloud Data Fusion to build an ETL pipeline. Pipeline Studio exposes the building blocks and built-in plugins for you to build your batch pipeline, one node at a time. You will also use the Wrangler plugin to build and apply transformations to your data that goes through the pipeline.
Overview
Syllabus
- GSP807
- Overview
- Setup and requirements
- Task 1. Load the data
- Task 2. Add the necessary permissions for your Cloud Data Fusion instance
- Task 3. Build a batch pipeline
- Task 4. Take in the pipeline studio
- Task 5. Configure the pipeline
- Task 6. Test the pipeline
- Task 7. View the results
- Congratulations!