Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

University of California, San Diego

Introduction to Big Data Analytics

University of California, San Diego via Coursera

This course may be unavailable.

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
*********
A new, improved version of the Big Data Specialization will become available on June 6! As such, enrollment for this course and all courses in this original Big Data Specialization will close on June 6.

The original Big Data Specialization will continue to run until September 2016, when the Capstone will be offered for learners in this version of the Specialization.

If you are in the middle of the Specialization and have purchased the entire original Big Data Specialization before June 6, Coursera will reach out to you to offer you the option of staying in the original Specialization or taking the new version.

If you are just getting started on this Specialization, we recommend that you wait until June 6 to enroll in the new version.

*********


This course is for novice programmers or business people who'd like to understand more advanced tools used to wrangle and analyze big data. In this course you will be guided in basic approaches to querying and exploring data using higher level tools built on top of a Hadoop Platform. You will be walked through query interfaces, environments, and the canonical situations for tools like HBASE, HIVE, Pig, as well as more general tools like Spark-SQL. After this course you will be able to identify the kinds of analysis you can get of big data and how to interpret these results.

Syllabus

HBASE: Hadoop's database
In this Module we will talk about HBase - Hadoop’s database, a distributed, scalable, big data store.

HIVE: a Hadoop-based data warehouse
In this Module we will learn about Hive, the data warehousing infrastructure based on Hadoop that facilitates querying and managing large datasets residing in distributed storage.

PIG: A Dataflow Engine for Hadoop
Wouldn’t it be nice if you could quickly gather/sample/view and perform simple analysis on HDFS data? Then PIG might fly for you! This module will introduce the essential concepts and basic execution of PIG.

Splunk: Log Analysis and More
Welcome to the data analytics with Splunk module- this week, we’re going to go further in our quest to examine different ways and tools to deploy analytics on large dataset and gain valuable insight.

Spark for Analytics
Welcome to Spark for Analytics, this week we will learn about Spark SQL. Spark SQL provides a higher level interface to process your data and write more expressive code. We'll focus on data exploration, cleaning and plotting.

Taught by

Paul Rodriguez, Andrea Zonca and Natasha Balac

Reviews

1.0 rating, based on 8 Class Central reviews

Start your review of Introduction to Big Data Analytics

  • Part 3 of the Big Data Specialization was definitely mediocre. I no way do I question the technical expertise of the lecturers, but from the perspective of instructional design, the course was awful. The contents were presented in a dull fashion, th…
  • Profile image for Pablo Torre
    Pablo Torre
    The course is implemented in an unprofessional and mediocre manner. All the practice "hands on sections" are walk throughs of the tutorials included in the cloudera QuickStart VM. The quizzes have questions that are just wrong from cryptic multiple choice questions that rely on obscure grammar to "appear harder", to concepts that are included in the quiz before they are taught in class and even one case where the correct answer is not included in the multiple choice and one has to choose an answer that reflects a misconception and poor understanding of the material, in order to get it right.
    The one thing that worked with this course was coursera's refund mechanism.
  • Anonymous
    I paid only for the first "course" (the intro) and after 2 weeks asked for a refund. Compared to other Coursera courses, this one should be called a "lecture" rather than a course.
    The content is extremely limited and you are asked to wait for the next courses to really get answers to your questions.
  • Anonymous
    teaching / slides / assignments and quizzes have no relation among st each other. Specially Pig module. The lecture videos are like just slide reading and makes no sense even at the end of the lecture. Again Pig module
  • Anonymous
    The course is absolutely unprofessional. Videos contain obvious mistakes and instructions provided by teachers just do not work. Even more, the virtual machine that is used for hands-on tasks misses some files that are required to pass the assignments.

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.