Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Pluralsight

Validate Data Cleanliness Using Asserts in Python

via Pluralsight

Overview

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
This course teaches how to use asserts in Python to validate data cleanliness. Learn to compare indexes, Series, and DataFrames, compose quantitative and logical tests, and apply them for data cleaning.

Inaccurate or inconsistent data can lead to poor business decisions. However, manually validating data can be time-consuming and error-prone. Tools and technologies available today can help automate the process of validating and cleaning data. In this course, Validate Data Cleanliness Using Asserts in Python, you will learn how to use asserts in Python to validate the cleanliness of data. First, you will be introduced to the numpy.testing module and how it can be used to verify data tidiness. Next, you will discover how to verify the equality of two indexes, two Series, and two DataFrames using the various testing functions available in the numpy.testing module. Finally, you will explore how to compose quantitative and logical tests for clean data using asserts and apply them for data cleaning. When you are finished with this course, you will have the skills needed to use asserts to validate data cleanliness in Python.

Syllabus

  • Course Overview 1min
  • Validating and Verifying Data Using Asserts 28mins
  • Using Assert-based Tests for Data Cleaning 12mins

Taught by

Pinal Dave

Reviews

4.8 rating at Pluralsight based on 10 ratings

Start your review of Validate Data Cleanliness Using Asserts in Python

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.