The Apache™ Hadoop® project develops open-source software for reliable, scalable, distributed computing. Learn the fundamental principles behind it, and how you can use its power to make sense of your Big Data.
Overview
Syllabus
- Big Data
- What is Big Data?,The problems big data creates.,How Apache Hadoop addresses these problems.
- HDFS and MapReduce
- Discover how HDFS distributes data over multiple computers.,Learn how MapReduce enables analyzing datasets in parallel across multiple machines.
- MapReduce code
- Write your own MapReduce code.
- MapReduce Design Patterns
- Use common patterns for MapReduce programs to analyze Udacity forum data.
Taught by
Ian Wrigley and Sarah Sproehnle
Tags
Reviews
4.0 rating, based on 21 Class Central reviews
Showing Class Central Sort
-
Debugging Hadoop code is pain, but otherwise very nice class. I appreciate that it brings real-world examples and lets you try the code on live system, not only udacity coding platform.
-
Can anyone tell what the benefit end users usually forum moderators will get from the extracted information in the final project?? i.e, Benefits of
1-List of students per thread
2-Finding Top Ten Tags.
3- Student Timings i.e. at which they are most active
4-Co-relation between question and answer.
-
This is a solid introduction to Hadoop and MapReduce concepts. The assignments are a good exercise in getting familiar with the basics. There's a lot that this course doesn't cover, but it's enough to get your feet wet with Hadoop and MapReduce concepts.
-
Very nice, introductory course; Video content is not much; very short and fast, but the exercise questions are well designed and illustrate some real-world applications of MapReduce
-
Ridiculously tiny videos, no detailed explanations, disappointed. Please add some more content and don't rush thorough them. Udacity could have done better
-
This is good place to start with Hadoop HDFS filesystem and map reduce concepts. Good balance between theory and practice and approach is comfortable for novice.
-
-
-
-
-
-