Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Amazon Web Services

Amazon EMR Getting Started

Amazon Web Services and Amazon via AWS Skill Builder

Overview

Languages Available: Español (España) | 日本語 | 한국어 | 中文(简体)


Amazon EMR is the industry-leading cloud big data solution for petabyte-scale data processing, interactive analytics, and machine learning using open-source frameworks such as Apache Spark, Apache Hive, and Presto. You can use Amazon EMR to set up, operate, and scale your big data environments and automate time-consuming tasks like provisioning capacity.

In this course, you will learn Amazon EMR Serverless which is a new option in Amazon EMR that makes it efficient and cost-effective for data engineers and analysts to run applications built using open-source big data frameworks without having to tune, operate, optimize, secure, or manage clusters. Additionally, you will learn the benefits, typical use cases, and technical concepts of Amazon EMR. You will have an opportunity to try Amazon EMR Serverless and Amazon EMR Cluster through tutorials using the AWS Management Console.

  • Course level: Fundamental
  • Duration: 1 Hour


Course objectives

This course includes presentations, graphics, tutorials, and demonstrations with the option to follow along.


Course objectives

In this course, you will learn to:

  • Understand different deployment options available with Amazon EMR.
  • Understand how Amazon EMR works.
  • Understand the technical concepts of Amazon EMR Serverless.
  • List typical use cases for Amazon EMR Serverless.
  • Understand the technical concepts of Amazon EMR Cluster.
  • List typical use cases for Amazon EMR Cluster.
  • Specify what it would take to implement Amazon EMR in a real-world scenario.
  • Recognize the benefits of Amazon EMR.
  • Explain the cost structure of Amazon EMR.
  • Use Amazon EMR Serverless and Amazon EMR Cluster


Intended audience

This course is intended for:

  • Developers

  • Solutions architects
  • Data engineers
  • Data architects


Prerequisites

AWS Technical Essentials

Data Analytics Fundamentals


Course outline

Introduction

  • Introduction to Amazon EMR
  • Amazon EMR Serverless Architecture and Use Cases
  • Amazon EMR Cluster Architecture and Use Cases

Using Amazon EMR Serverless

  • How Do I Run a Spark Job on Amazon EMR Serverless?

Using Amazon EMR

  • How Do I Create an Amazon EMR on EC2 Cluster?
  • How Do I Create an Amazon EMR Studio?
  • How Do I Create an Amazon EMR Workspace?
  • How Do I Run a Spark Job with Amazon EMR Studio Notebook?

Resources

  • Learn More

Reviews

Start your review of Amazon EMR Getting Started

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.