Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Pluralsight

Data Science with Python: Munging, Outliers, and Feature Engineering

via Pluralsight

Overview

Learn to preprocess and engineer features from raw data using Python. This course will teach you to handle missing values, detect outliers, create relevant features, and optimize datasets for effective data science projects.

Raw data often contains missing values, outliers, and irrelevant features that can hinder the success of data science projects. In this course, Data Science with Python: Munging, Outliers, and Feature Engineering, you'll gain the ability to preprocess and engineer features from raw data effectively using Python. First, you'll explore techniques for handling missing data and imputing missing values to ensure your datasets remain informative and reliable for analysis. Next, you'll discover methods for detecting and treating outliers using statistical and machine learning approaches, developing strategies to handle them appropriately. Finally, you'll learn how to create meaningful features from raw data, transform categorical variables, and optimize datasets for efficient analysis by dropping irrelevant columns and reordering them for better readability and processing. When you're finished with this course, you'll have the skills and knowledge of data munging and feature engineering needed to tackle real-world data science problems, transform raw data into clean, informative datasets, and enhance the performance of your machine learning models.

Syllabus

  • Course Overview 2mins
  • Data Munging 26mins
  • Feature Engineering 18mins

Taught by

Dr. Ali Feizollah

Reviews

Start your review of Data Science with Python: Munging, Outliers, and Feature Engineering

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.