ETL or Extract Transform Load, is the process of gathering data to a central data warehouse for analytics. This course will discuss the steps required for each step in the ETL process and look at different solutions for some real-world scenarios. We will explore extracting data from multiple sources and formats, performing transformations on the data, and finally loading the data into the final location. This course does assume some experience with relational databases, as well as standard Unix command line tools such as sed, grep and awk.
Overview
Syllabus
- Introduction
- Extracting Data
- Transforming Data
- Loading Data
- Process Performance
- Process Challenges
- Conclusion
Taught by
David Thomas