Get a practical, project-based look at using Python for data analysis.
Overview
Syllabus
Introduction
- Introduction
- Prerequisites
- CoderPad tour
- The Who, What, Where & Why
- Who
- What
- Where
- Why
- Case Study Introduction
- Define the problem (the three Ds)
- Get Familiar: Domain
- Domain: Applied
- Get Familiar: Data
- Get Familiar: Deliverable
- Set the Right Expectations
- Getting Setup in Codespaces
- Read from CSV
- General Cleaning Techniques
- General Cleaning Techniques: High Level Checks
- General Cleaning Techniques: Missing Values
- Data Transformations: Binning
- Solution: Prepare data for analysis
- Intro to EDA
- Summary Statistics
- Distributions: Histograms
- Data Transformations: Normalization and Log
- Other Distribution Types
- Data Visualizations: Comparing Categories
- Data Visualization: Data Tables
- Data Visualization: Relationships
- Create a ridgeplot
- Solution: Building summary table using pandas
- Visual Best Practices: Part 1
- Visual Best Practices: Part 2
- Leverage Exploratory and Explanatory Visualizations
- Choose a medium
- Storyboarding
- Whats in a story?
- Putting it all together
- Solution: Visualizing with Matplotlib
- Wrap-up and next steps
Taught by
Sarah Om