Overview
Whether you’re an aspiring data engineer, data architect, business analyst, or data scientist, strong data warehousing skills are a must. With the hands-on experience and competencies, you gain on this course, your resume will catch the eye of employers and power up your career opportunities.
A data warehouse centralizes and organizes data from disparate sources into a single repository, making it easier for data professionals to access, clean, and analyze integrated data efficiently.
This course teaches you how to design, deploy, load, manage, and query data warehouses, data marts, and data lakes. You’ll dive into designing, modeling, and implementing data warehouses, and explore data warehousing architectures like star and snowflake schemas. You’ll master techniques for populating data warehouses through ETL and ELT processes, and hone your skills in verifying and querying data, and utilizing concepts like cubes, rollups, and materialized views/tables.
Additionally, you’ll gain valuable practical experience working on hands-on labs, where you’ll apply your knowledge to real data warehousing tasks. You’ll work with repositories like PostgreSQL and IBM Db2, and complete a project that you can refer to in interviews.
Syllabus
- Data Warehouses, Data Marts, and Data Lakes
- Welcome to your first module! This module provides an introduction to data warehouse systems, data lakes, and data marts. When you complete this module, you’ll be able to identify and compare data warehouse systems, data marts, and data lakes based on their architecture, and understand how organizations can benefit from each of these three data storage entities. Then, you’ll learn about three types of data warehouse systems and popular data warehouse system vendors. You will learn to help your organization assess new data warehouse system offerings when you know the five essential, critical criteria, including the total cost of ownership, to evaluate before changing to a new data warehouse system.
- Designing, Modeling, and Implementing Data Warehouses
- In this knowledge-packed module, you’ll explore general and reference enterprise data warehousing architecture. You’ll discover how data cubes relate to star schemas. Then, you’ll learn how to slice, dice, drill up or down, roll up, and pivot relative to data cubes. Next, you will examine the capabilities of materialized views, their benefits, and how to apply them. You’ll learn how a data organization using facts and dimensions and their related tables organizes information. Then, you will explore how to use normalization to create a snowflake schema as an extension of the star schema. You will also learn about populating a data warehouse, incremental data updates, verifying data, querying data, and interpreting an entity-relationship diagram for a star schema. Finally, the module will delve into the creation of a materialized view, the application of cube and rollup options, and examine the advantages organizations gain from implementing staging.
- Final Assignment and Final Quiz
- In this module, you’ll complete your practice project and final course project, which bring together concepts and practices you previously learned in the first two modules. In the final project, you will design and load data into a data warehouse using facts and dimension tables. Then you’ll write aggregation queries using cube and rollup functions and create a materialized view. In the optional lesson, you will explore the workings of IBM Db2 data warehouse system architecture, view use cases, and understand the key capabilities and integrations available with IBM Db2 Warehouse. The hands-on labs in this lesson will enable you to gain practical knowledge on how to create a Db2 service instance, how to populate a data warehouse using IBM Db2, how to query the data warehouse using IBM Db2.
Taught by
Ramesh Sannareddy and Rav Ahuja
Tags
Reviews
2.0 rating, based on 1 Class Central review
4.4 rating at Coursera based on 194 ratings
Showing Class Central Sort
-
Unfortunately, there are mistakes in hands on labs, which create problems in understanding the practical aspects of this course. In my opinion, Coursera should improve all hands on labs of this course. It appears that in most labs, the lefthand instruction panel has been created by one version of the pgadmin and the righthand panel in another version. The coursera may look into this matter.
Theoretical concepts have been covered in only one module. For further understanding, more videos or modules may be added.
Overall, I did not find this course very useful.