Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Pluralsight

Data Cleaning and Quality Assurance in R

via Pluralsight

Overview

Data preparation is a critical step in turning raw information into actionable insights. This course will teach you how to clean, structure, and validate data in R to solve real-world problems efficiently.

Preparing data for analysis can be a daunting task, especially when dealing with missing values, outliers, and inconsistent formats that compromise the integrity of your insights. In this course, Data Cleaning and Quality Assurance in R, you’ll gain the ability to handle inconsistent, real-world datasets and transform them into reliable and analyzable formats. First, you’ll explore strategies to identify and address missing data, including summarizing missing patterns and imputing values using statistical and conditional techniques. Next, you’ll discover how to detect and manage outliers in both numerical and categorical data using visualizations, statistical methods, and targeted replacements. Finally, you’ll learn how to ensure data consistency by converting data types, standardizing units, and implementing validation checks to maintain data integrity. When you’re finished with this course, you’ll have the skills and knowledge of data cleaning and preparation needed to confidently preprocess datasets for analysis.

Syllabus

  • Identifying and Addressing Missing Data 24mins
  • Detecting and Handling Outliers 17mins
  • Handling Data Consistency and Type Correctness 10mins

Taught by

Janani Ravi

Reviews

Start your review of Data Cleaning and Quality Assurance in R

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.