- Import data sets
- Clean and prepare data for analysis
- Manipulate pandas DataFrame
- Summarize data
- Build machine learning models using scikit-learn
- Build data pipelines
- Data Analysis libraries: will learn to use Pandas DataFrames, Numpy multi-dimentional arrays, and SciPy libraries to work with a various datasets. We will introduce you to pandas, an open-source library, and we will use it to load, manipulate, analyze, and visualize cool datasets. Then we will introduce you to another open-source library, scikit-learn, and we will use some of its machine learning algorithms to build smart models and make cool predictions.