Completed
Informatica ETL Pipeline
Class Central Classrooms beta
YouTube videos curated by Class Central.
Classroom Contents
Faster Data Integration Pipeline Execution Using Spark-Jobserver
Automatically move to the next video in the Classroom when playback concludes
- 1 Intro
- 2 Informatica ETL Pipeline
- 3 Dealing with buggy pipelines
- 4 Data Preview - Feature Requirements
- 5 What spark-submit based data preview achieved?
- 6 Execution Profiling Results - Spark-submit
- 7 Compare Spark-submit with Spark Job Server
- 8 Spark-submit based Architecture
- 9 SJS based Architecture
- 10 Execution Flow
- 11 Spark Job Server vs Spark-submit
- 12 Setup Details
- 13 Getting started
- 14 Environment Variables (local.sh. template)
- 15 Application Code Migration
- 16 WordCount Example
- 17 Running Jobs
- 18 Handling Job Dependencies
- 19 Multiple Spark Job Servers
- 20 Concurrency
- 21 Support for Kerberos
- 22 HTTPS/SSL Enabled Server
- 23 Logging
- 24 Key Takeaways
- 25 Timeouts (in local.conf. template)
- 26 Complex Data Representation in Informatica Developer Tool
- 27 Monitoring: Binaries
- 28 Monitoring: Spark Context
- 29 Monitoring: Jobs
- 30 Monitoring: Yarn Job