What you'll learn:
- Understanding of AWS Glue Data Catalog and creating AWS Glue Database, Glue Tables and Crawlers
- Using AWS Glue Studio, creating the ETL pipeline along with scheduled triggers, conditional triggers and glue workflow
- KMS, IAM Role, SNS, S3 and other associated AWS resources associated with Glue. Understanding and creation of all the resources
- Understanding of AWS Glue Data Quality and creating the associated Glue ETL pipeline
- Understanding AWS Glue Data Brew , creating the recipe, project and job to curate the dataset
- Understanding the AWS Glue streaming, creating the stream using the Python shell job and load the stream using the Spark streaming
- Different ways AWS Glue job can fail and debugging the failure and fix
- Creating the AWS resources for AWS Glue Pipeline using the AWS console and cloudformation
Learn the latest in AWSGlue - And learn to use it with other AWSresources.
In this growing world of data and growing cloud computing, it is necessary to have the core competency in cloud ETLtool also. AWSGlue come with the in built Spark support, Data Quality and data curation using Data brew. The top technology, finance and insurance companies like JPMC, Vanguard, BCBS, Amazon, Capital One, Capgemini, FINRA and more are all using AWSGlue to run their ETLon PetaBytes scale of data everyday.
AWSGlue provides server less and scalable ETLsolution where scripts can be written in Python, Spark and currently using Ray. It also provides the visual drag and drop options to create the ETL pipelines. As now more and more companies are migrating to cloud it has caused an explosion in demand for this skill! With the mastery of AWSGlue, you now have the ability to quickly become one of the most knowledgeable people in the job market!
This course will teach the basics in AWS Glue Data Catalog, AWSGlue Studio, AWSresources such as IAM, SNS, KMS, CloudFormation, CloudWatch and continuing on to learning how to use AWSGlue to build ETLsolution for the organization! Once we've done that we'll go through how to use the Glue Data Quality, Glue Streaming and Glue Data Brew ETLpipelines. All along the way you'll have multiple labs to create all the resources and ETLpipelines using AWSconsole and CloudFormation templates that you put you right into a real world situation where you need to use your new skills to solve a real problem!
If you're ready to jump into the data engineering world of AWSGlue, this is the course for you!