Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

FutureLearn

Hadoop Ecosystem Essentials

Packt via FutureLearn

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!

Learn the skills needed to succeed as a data analyst

For data analysts, Hadoop is an extremely powerful tool to help process large amounts of data and is used by successful companies such as Google and Spotify.

On this four-week course, you’ll learn how to use Hadoop to its full potential to make it easier for you to store, analyse, and scale big data.

Through step-by-step guides and exercises, you’ll gain the knowledge and practical skills to take into your role in data analytics.

Understand how to manage your Hadoop cluster

You’ll understand how to manage clusters with Yet Another Resource Negotiator (YARN), Mesos, Zookeeper, Oozie, Zeppelin, and Hue.

With this knowledge, you’ll be able to ensure high performance, workload management, security, and more.

Learn how to analyse streams of data

Next, you’ll uncover the techniques to handle and stream data in real-time using Kafka, Flume, Spark Streaming, Flink, and Storm.

This understanding will help you to react and respond quickly to any issues that may arise.

Hone your data handling skills

Finally, you’ll learn how to design real-world systems using the Hadoop ecosystem to ensure you can use your skills in practice.

By the end of the course, you’ll have the knowledge to handle large amounts of data using Hadoop.

This course is designed for anyone who wants to hone their data handling skills using Hadoop.

You’ll be shown how to use a variety of open source utilities within the Hadoop environment. We assume you’ve already installed the Hadoop environment. If you haven’t, check out Introduction to Big Data Analytics with Hadoop.

Syllabus

  • Querying data interactively in Hadoop
    • Introduction to the course
    • Apache Drill
    • Apache Phoenix
    • Presto
    • Wrap up
  • Managing your cluster in Hadoop
    • Introduction to Week 2
    • Managing resources
    • Managing clusters and tasks
    • Other technologies
    • Wrap up
  • Feeding and analysing data in Hadoop
    • Introduction to Week 3
    • Kafka
    • Apache Flume
    • Spark Streaming
    • Introducing Apache Storm
    • Flink
    • Wrap up
  • Designing real-world systems
    • Introduction to Week 4
    • Architecture design
    • Wrap up

Taught by

Astrid deRidder

Reviews

Start your review of Hadoop Ecosystem Essentials

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.