Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Pluralsight

Coping with Missing, Invalid, and Duplicate Data in R

via Pluralsight

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Learn about the most essential steps of data preparation: Missing value imputation, outlier detection, and duplicate removal.

Data preparation is part of nearly any data analytics project, therefore the skills are highly valuable. In this course, Coping with Missing, Invalid, and Duplicate Data in R, you will learn the main steps of data preparation. First, you will learn how to handle duplicate data. Next, you will discover that missing values prevent a lot of R functions from working properly, therefore you are limited in your R toolset as long as you do not take care of all these NA's. Finally, you will explore outlier and invalid data detection and how they can introduce bias into your analysis. When you’re finished with this course, you will understand why missing values, outliers, and duplicates are problematic, how to detect them, and how to remove them from the dataset.

Taught by

Martin Burger

Reviews

4.5 rating at Pluralsight based on 10 ratings

Start your review of Coping with Missing, Invalid, and Duplicate Data in R

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.