Python can be a powerful tool for data preparation. In this course, we will quickly cover how to connect to various database types. Then, we will jump into using the pandas Python package for data preparation. We will look at examples of cleansing missing and outlying data as well as data visualizations and exploration. In addition to the pandas package, we will also look at preprocessing data for machine learning using the scikit-learn Python package.Before beginning this course, you should have a strong knowledge of Python and data approaches. Check out the Prerequisite and Related Courses lesson in the Introduction section for a starting point.
Overview
Syllabus
- Introduction
- Database Access
- Data Visualization
- Data Cleansing
- Preprocessing Data for Machine Learning
- Conclusion
Taught by
David Thomas