Overview
Syllabus
Introduction
Introduction to Data Engineering
Data Engineer vs Data Scientist
Important Skills for Data Engineering
Data Engineering Lifecycle
Data Structures in Python: Tuple, List, Dictionary & Set
Flow Control Statements
Inheritance in Python
Python Numpy
Python Pandas
Data Visualization with Python: Matplotlib
Data Visualization with Python: SeaBorn
Python Project on IPL Data
SQL - Introduction and Installation
Data Types in SQL
Hands-on based on HR Database Management System
Introduction to Big Data
ETL Extract-Transform-Load
Introduction to Hadoop
Distributed Computing
Hadoop Architecture
HDFS File Storage
Introduction to Oozie and HDFS Processing
Hadoop Clusters
Hadoop Ecosystem
Introduction to Spark
Introduction to Real-Time Analytics
Difference between Batch & Real-Time Systems
Input Output Connectors
Twitter Streaming in Real-Time - Demo
What is Data Warehouse for Data Engineer
Need of Data Warehouse
Top Data Warehouse Tools
What is Cloud Computing?
World before Cloud Computing
World before Cloud Computing
Need For Cloud Computing
Working of Cloud Computing
Cloud Providers
Basics of AWS
AWS CloudFront
Demo: AWS CloudFront
AWS HoneyCode
AWS Amplify
Demo: AWS Honeycode & AWS Amplify
Summary
Taught by
Great Learning