Overview
This Professional Certificate is intended to help you develop the job-ready skills and portfolio for an entry-level Business Intelligence (BI) or Data Warehousing Engineering position. Throughout the online courses in this program, you will immerse yourself in the in-demand role of a Data Warehouse Engineer and acquire the essential skills you need to work with a range of tools and databases to design, deploy, operationalize and manage Enterprise Data Warehouses (EDW).
By the end of this Professional Certificate, you will be able to perform the key tasks required in a data warehouse engineering role. You will work with Relational Database Management Systems (RDBMS) and query data using SQL statements.
You will use Linux/UNIX shell scripts to automate repetitive tasks, and build data pipelines with tools like Apache Airflow and Kafka to Extract, Transform and Load (ETL) data. You will gain experience with managing databases and data warehouses.
Finally, you will design and populate data warehouse systems and utilize Business Intelligence tools to analyze and extract insights using reports and dashboards.
This program is suitable for anyone with a passion for learning and is suitable for you whether you have a college degree or not and does not require any prior data engineering, or programming experience.
Syllabus
Course 1: Introduction to Data Engineering
- Offered by IBM. Start your journey in one of the fastest growing professions today with this beginner-friendly Data Engineering course! You ... Enroll for free.
Course 2: Introduction to Relational Databases (RDBMS)
- Offered by IBM. Are you ready to dive into the world of data engineering? In this beginner level course, you will gain a solid understanding ... Enroll for free.
Course 3: SQL: A Practical Introduction for Querying Databases
- Offered by IBM. Much of the world's data lives in databases. SQL (or Structured Query Language) is a powerful programming language that is ... Enroll for free.
Course 4: Hands-on Introduction to Linux Commands and Shell Scripting
- Offered by IBM. This course provides a practical understanding of common Linux / UNIX shell commands. In this beginner friendly course, you ... Enroll for free.
Course 5: Relational Database Administration (DBA)
- Offered by IBM. Get started with Relational Database Administration and Database Management in this self-paced course! This course begins ... Enroll for free.
Course 6: ETL and Data Pipelines with Shell, Airflow and Kafka
- Offered by IBM. Delve into the two different approaches to converting raw data into analytics-ready data. One approach is the Extract, ... Enroll for free.
Course 7: Data Warehouse Fundamentals
- Offered by IBM. Whether you’re an aspiring data engineer, data architect, business analyst, or data scientist, strong data warehousing ... Enroll for free.
Course 8: BI Dashboards with IBM Cognos Analytics and Google Looker
- Offered by IBM. Business Intelligence (BI) Analyst is one of the top 3 fastest growing roles, according to Statista in its ‘Which Jobs Have ... Enroll for free.
Course 9: Data Warehousing Capstone Project
- Offered by IBM. In this course you will apply a variety of data warehouse engineering skills and techniques you have learned as part of the ... Enroll for free.
- Offered by IBM. Start your journey in one of the fastest growing professions today with this beginner-friendly Data Engineering course! You ... Enroll for free.
Course 2: Introduction to Relational Databases (RDBMS)
- Offered by IBM. Are you ready to dive into the world of data engineering? In this beginner level course, you will gain a solid understanding ... Enroll for free.
Course 3: SQL: A Practical Introduction for Querying Databases
- Offered by IBM. Much of the world's data lives in databases. SQL (or Structured Query Language) is a powerful programming language that is ... Enroll for free.
Course 4: Hands-on Introduction to Linux Commands and Shell Scripting
- Offered by IBM. This course provides a practical understanding of common Linux / UNIX shell commands. In this beginner friendly course, you ... Enroll for free.
Course 5: Relational Database Administration (DBA)
- Offered by IBM. Get started with Relational Database Administration and Database Management in this self-paced course! This course begins ... Enroll for free.
Course 6: ETL and Data Pipelines with Shell, Airflow and Kafka
- Offered by IBM. Delve into the two different approaches to converting raw data into analytics-ready data. One approach is the Extract, ... Enroll for free.
Course 7: Data Warehouse Fundamentals
- Offered by IBM. Whether you’re an aspiring data engineer, data architect, business analyst, or data scientist, strong data warehousing ... Enroll for free.
Course 8: BI Dashboards with IBM Cognos Analytics and Google Looker
- Offered by IBM. Business Intelligence (BI) Analyst is one of the top 3 fastest growing roles, according to Statista in its ‘Which Jobs Have ... Enroll for free.
Course 9: Data Warehousing Capstone Project
- Offered by IBM. In this course you will apply a variety of data warehouse engineering skills and techniques you have learned as part of the ... Enroll for free.
Courses
-
Start your journey in one of the fastest growing professions today with this beginner-friendly Data Engineering course! You will be introduced to the core concepts, processes, and tools you need to know in order to get a foundational knowledge of data engineering. as well as the roles that Data Engineers, Data Scientists, and Data Analysts play in the ecosystem. You will begin this course by understanding what is data engineering as well as the roles that Data Engineers, Data Scientists, and Data Analysts play in this exciting field. Next you will learn about the data engineering ecosystem, the different types of data structures, file formats, sources of data, and the languages data professionals use in their day-to-day tasks. You will become familiar with the components of a data platform and gain an understanding of several different types of data repositories such as Relational (RDBMS) and NoSQL databases, Data Warehouses, Data Marts, Data Lakes and Data Lakehouses. You’ll then learn about Big Data processing tools like Apache Hadoop and Spark. You will also become familiar with ETL, ELT, Data Pipelines and Data Integration. This course provides you with an understanding of a typical Data Engineering lifecycle which includes architecting data platforms, designing data stores, and gathering, importing, wrangling, querying, and analyzing data. You will also learn about security, governance, and compliance. You will learn about career opportunities in the field of Data Engineering and the different paths that you can take for getting skilled as a Data Engineer. You will hear from several experienced Data Engineers, sharing their insights and advice. By the end of this course, you will also have completed several hands-on labs and worked with a relational database, loaded data into the database, and performed some basic querying operations.
-
Are you ready to dive into the world of data engineering? In this beginner level course, you will gain a solid understanding of how data is stored, processed, and accessed in relational databases (RDBMSes). You will work with different types of databases that are appropriate for various data processing requirements. You will begin this course by being introduced to relational database concepts, as well as several industry standard relational databases, including IBM DB2, MySQL, and PostgreSQL. Next, you’ll utilize RDBMS tools used by professionals such as phpMyAdmin and pgAdmin for creating and maintaining relational databases. You will also use the command line and SQL statements to create and manage tables. This course incorporates hands-on, practical exercises to help you demonstrate your learning. You will work with real databases and explore real-world datasets. You will create database instances and populate them with tables and data. At the end of this course, you will complete a final assignment where you will apply your accumulated knowledge from this course and demonstrate that you have the skills to: design a database for a specific analytics requirement, normalize tables, create tables and views in the database, load and access data. No prior knowledge of databases or programming is required. Anyone can audit this course at no-charge. If you choose to take this course and earn the Coursera course certificate, you can also earn an IBM digital badge upon successful completion of the course.
-
Delve into the two different approaches to converting raw data into analytics-ready data. One approach is the Extract, Transform, Load (ETL) process. The other contrasting approach is the Extract, Load, and Transform (ELT) process. ETL processes apply to data warehouses and data marts. ELT processes apply to data lakes, where the data is transformed on demand by the requesting/calling application. In this course, you will learn about the different tools and techniques that are used with ETL and Data pipelines. Both ETL and ELT extract data from source systems, move the data through the data pipeline, and store the data in destination systems. During this course, you will experience how ELT and ETL processing differ and identify use cases for both. You will identify methods and tools used for extracting the data, merging extracted data either logically or physically, and for loading data into data repositories. You will also define transformations to apply to source data to make the data credible, contextual, and accessible to data users. You will be able to outline some of the multiple methods for loading data into the destination system, verifying data quality, monitoring load failures, and the use of recovery mechanisms in case of failure. By the end of this course, you will also know how to use Apache Airflow to build data pipelines as well be knowledgeable about the advantages of using this approach. You will also learn how to use Apache Kafka to build streaming pipelines as well as the core components of Kafka which include: brokers, topics, partitions, replications, producers, and consumers. Finally, you will complete a shareable final project that enables you to demonstrate the skills you acquired in each module.
-
Get started with Relational Database Administration and Database Management in this self-paced course! This course begins with an introduction to database management; you will learn about things like the Database Management Lifecycle, the roles of a Database Administrator (DBA) as well as database storage. You will then discover some of the activities, techniques, and best practices for managing a database. You will also learn about database optimization, including updating statistics, slow queries, types of indexes, and index creation and usage. You will learn about configuring and upgrading database server software and related products. You’ll also learn about database security; how to implement user authentication, assign roles, and assign object-level permissions. And gain an understanding of how to perform backup and restore procedures in case of system failures. You will learn how to optimize databases for performance, monitor databases, collect diagnostic data, and access error information to help you resolve issues that may occur. Many of these tasks are repetitive, so you will learn how to schedule maintenance activities and regular diagnostic tests and send automated messages of the success or failure of a task. The course includes both video-based lectures as well as hands-on labs to practice and apply what you learn. This course ends with a final project where you will assume the role of a database administrator and complete a number of database administration tasks across many different databases.
-
This course provides a practical understanding of common Linux / UNIX shell commands. In this beginner friendly course, you will learn about the Linux basics, Shell commands, and Bash shell scripting. You will begin this course with an introduction to Linux and explore the Linux architecture. You will interact with the Linux Terminal, execute commands, navigate directories, edit files, as well as install and update software. Next, you’ll become familiar with commonly used Linux commands. You will work with general purpose commands like id, date, uname, ps, top, echo, man; directory management commands such as pwd, cd, mkdir, rmdir, find, df; file management commands like cat, wget, more, head, tail, cp, mv, touch, tar, zip, unzip; access control command chmod; text processing commands - wc, grep, tr; as well as networking commands - hostname, ping, ifconfig and curl. You will then move on to learning the basics of shell scripting to automate a variety of tasks. You’ll create simple to more advanced shell scripts that involve Metacharacters, Quoting, Variables, Command substitution, I/O Redirection, Pipes & Filters, and Command line arguments. You will also schedule cron jobs using crontab. The course includes both video-based lectures as well as hands-on labs to practice and apply what you learn. You will have no-charge access to a virtual Linux server that you can access through your web browser, so you don't need to download and install anything to complete the labs. You’ll end this course with a final project as well as a final exam. In the final project you will demonstrate your knowledge of course concepts by performing your own Extract, Transform, and Load (ETL) process and create a scheduled backup script. This course is ideal for data engineers, data scientists, software developers, and cloud practitioners who want to get familiar with frequently used commands on Linux, MacOS and other Unix-like operating systems as well as get started with creating shell scripts.
-
Much of the world's data lives in databases. SQL (or Structured Query Language) is a powerful programming language that is used for communicating with and manipulating data in databases. A working knowledge of databases and SQL is a must for anyone who wants to start a career in Data Engineering, Data Warehousing, Data Analytics, Data Science or Business Intelligence. The purpose of this course is to help you learn and apply foundational and intermediate knowledge of the SQL language, and become familiar with many relational database (RDBMS) concepts along the way. You will start with performing basic Create, Read, Update and Delete (CRUD) operations using CREATE, SELECT, INSERT, UPDATE and DELETE statements. You will then learn to filter, order, sort, and aggregate data. You will work with functions, perform sub-selects and nested queries, as well as JOIN data in multiple tables. You will also work with VIEWS, transactions and create stored procedures. The emphasis in this course is on hands-on, practical learning. As such, you will work with real database systems, use real tools, and real-world datasets. You will create a database instance in the cloud. Through a series of hands-on labs, you will practice building and running SQL queries. At the end of the course you will apply and demonstrate your skills with a final project. The SQL skills you learn in this course will be applicable to a variety of RDBMSes such as MySQL, PostgreSQL, IBM Db2, Oracle, SQL Server and others. No prior knowledge of databases, SQL or programming is required, however some basic data literacy is beneficial.
-
In this course you will apply a variety of data warehouse engineering skills and techniques you have learned as part of the previous courses in the IBM Data Warehouse Engineer Professional Certificate. You will assume the role of a Junior Data Engineer who has recently joined the organization and be presented with a real-world use case that requires a data warehouse engineering solution.
-
Business Intelligence (BI) Analyst is one of the top 3 fastest growing roles, according to Statista in its ‘Which Jobs Have a Future’ update. IBM Cognos Analytics and Google Looker Studio are powerful BI tools used for data visualization, analytics, and reporting. This short course helps you to build IBM Cognos Analytics and Google Looker Studio skills that can open up opportunities in business analytics, data science, and BI across industries. The course introduces you to the features and capabilities of IBM Cognos Analytics and Google Looker Studio. You’ll learn the basics of visualizing data without writing code, plus how use both to create interactive dashboards. You’ll also gain practical experience through hands-on labs, and you’ll complete a final project in which you’ll create data visualizations and an interactive dashboard that you can share with prospective employers to highlight your skills. If you’re looking to get started as a data analyst, BI analyst or data warehouse specialist, this course provides the ideal introduction to two high profile tools used in these roles. Enroll in this self-paced course today, and develop valuable BI Dashboard skills you can talk about in interviews.
-
Whether you’re an aspiring data engineer, data architect, business analyst, or data scientist, strong data warehousing skills are a must. With the hands-on experience and competencies, you gain on this course, your resume will catch the eye of employers and power up your career opportunities. A data warehouse centralizes and organizes data from disparate sources into a single repository, making it easier for data professionals to access, clean, and analyze integrated data efficiently. This course teaches you how to design, deploy, load, manage, and query data warehouses, data marts, and data lakes. You’ll dive into designing, modeling, and implementing data warehouses, and explore data warehousing architectures like star and snowflake schemas. You’ll master techniques for populating data warehouses through ETL and ELT processes, and hone your skills in verifying and querying data, and utilizing concepts like cubes, rollups, and materialized views/tables. Additionally, you’ll gain valuable practical experience working on hands-on labs, where you’ll apply your knowledge to real data warehousing tasks. You’ll work with repositories like PostgreSQL and IBM Db2, and complete a project that you can refer to in interviews.
Taught by
IBM Skills Network Team, Jeff Grossman, Lavanya Thiruvali Sunderarajan, Priya Kapoor, Ramesh Sannareddy, Rav Ahuja, Sabrina Spillner, Sam Prokopchuk, Sandip Saha Joy, Shubhra Das and Yan Luo