Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

XuetangX

Big Data Analysis

Beijing Institute of Technology via XuetangX

Overview

The course of Big Data Analysis Technology in English builds a learner competency hierarchy according to the BLOOM 's Taxonomy. It systematically explains the basic knowledge and necessary skills of big data analysis. It develops students' application and analytical skills based on the knowledge memorizing and understanding, in further develop the evaluating and creating ability.&nbspIt combined the theoretical explanation and engineering practice training.

1.&nbsp&nbsp&nbsp&nbsp&nbspTheory&nbsp:

Big Data Analysis Technology theoretical knowledge is carried out following the general architecture: 1 data storing system, data processing system and data application system. Shown as the figure.

In 1 data storing system, it can be divided into 4 parts

1.1 Data collection and modeling;

1.2 Distributed file system;

1.3 Distributed Database and data warehouse;

1.4 Unified Data Access Interface

In 2 data processing system, it can be divided into 3 parts

2.1 Data analysis algorithm

2.2 Computing model;

2.3 Computing Engine and platform;

In 2 data application system, it can be divided into 3 parts

3.1 big data visualization

3.2 big data product and services

3.3 big data application

In the big data application, the relevant principles and algorithms are explained, taking two typical big data applications of recommendation system and social network as an example.

And in Theoretical Explanation, the whole course content roadmap helps students to systematically establish a knowledge system of big data analysis. When explaining the complex knowledge points, the excellent explanation videos selected from the whole network facilitate the students clearly understand what they have learned.&nbspBecause the excellent video is not only a summary of the rich experience of the explainer, but also equipped with an explanation animation display, which explains the principle of the relevant knowledge very intuitively and clearly.

2. Experiments

Five experiments were designed in this part. includes:

1.&nbsp&nbsp&nbsp&nbsp Dynamic web crawler

2.&nbsp&nbsp&nbsp&nbsp Spark MLlib learning and applying

3.&nbsp&nbsp&nbsp&nbsp TensorFlow learning and applying

4.&nbsp&nbsp&nbsp&nbsp Recommendation system understanding and construction

5.&nbsp&nbsp&nbsp&nbsp Social Network Analysis and Visualization

For the above five topics, several experiments from simple to complex, from the shallow to the deep are designed to train students' hands-on ability of big data analysis. All experiments are equipped with complete case explanations, including experimental design ideas and steps, experimental manuals and source codes.

Through the theoretical explanation of big data analysis technology and engineering experiment training, students can build a knowledge system,In-depth understanding of the concepts, principles, platforms, technologies, etc. in big data analysis technology. Through the hands-on practice of the experiment, the practical application of the theoretical knowledge of big data is realized, the understanding of the principle and concept is deepened, and the analysis and solution ability to solve engineering problems of big data analysis is improved. At the same time, the big data analysis relevant English ability can be greatly improved.

Syllabus

  • Chapter 1 Introduction
    • Big Data Analysis 1-1 Basic Concept
    • Big Data analysis 1-2 structured data and unstructured data
    • Big Data Analysis 1-3 the Fourth Paradigm
    • Big Data analysis 1-4 Big Data Characters
    • Big Data Analysis 1-5 Big Data Lifecycle
    • Big Data Analysis 1-6 Processing Flow
    • Big Data Analysis 1-7 Architecture
  • Chapter 2 Data Collection
    • Big Data Analysis 2-1 Data Resources
    • Big data analysis 2-2 Internal Data Acquisition
    • Big data analysis 2-3 External Data Acquisition
    • Big Data Analysis 2-4 Deep Web
  • Chapter 3 Data Preprocessing
    • Big Data Analysis 3-1 Data Preprocessing Overview
    • Big Data Analysis 3-2 Data Quality
    • Big data analysis 3-3 Data Cleaning Technology
    • Big Data Analysis 3-4 Data Tranform
    • Big Data Analysis 3-5 Data Reduction
  • Chapter 4 Data Storing System
    • 4-1 Data Modeling
    • 4-2 Distributed File System
    • 4-3 No SQL DataBase
    • 4-4 Characters of No SQL DB
    • 4-5 Four Types of No SQL DB
    • 4-6 UDAI
  • Chapter 5 Data Processing System
    • BDA 5-1 Data Processing System Architecture
    • BDA 5-2 Data Processing Algorithms
    • BDA 5-22 Big Data Analysis Algorithms
    • 5.2.1 Linear Regression Using Least Square Method
    • 5.2.2 K-Nearest Neighbors
    • 5.2.3 K-means clustering
    • 5.2.4 Naive Bayes
    • 5.2.5 Apriori Algorithm Explained Association Rule Mining
    • 5.2.6 Support Vector Machines
    • 5.2.7 CNN
    • 5.2.8 Principal Component Analysis
    • 5.3 Batch processing
    • 5.4 Streaming processing
    • 5.5 Massively Parallel Processing for Structured Data
    • 5.6 In Memory Computing-Spark
    • 5.7 In Memory Computing-HANA
    • 5.8 Distributed Graph computing-Pregel
  • Chapter 6 Big Data Computing Platforms and Applications
    • Chapter 6-1 Spark MLlib
    • Chapter 6-2 TensorFlow
    • Chapter 6-3 Recommendation System
    • Chapter 6-4 Recommendation system 2
    • Chapter 6-5 Social Network Analysis

Taught by

Haiying Che

Tags

Reviews

Start your review of Big Data Analysis

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.