Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Coursera

Advanced Data Analysis and Visualization with Pandas

Packt via Coursera

Overview

This advanced Pandas course delves deep into date-time manipulation, covering Timestamps, DatetimeIndex objects, and pd.date_range for effective time series handling. - You'll master techniques like using the dt attribute and DateOffset objects for arithmetic operations and timedeltas. - Learn essential input-output operations, including exporting DataFrames to CSV and Excel files using openpyxl, and seamless file imports. - Enhance data presentation with Matplotlib for basic visualizations, customizing aesthetics with templates, and creating bar and pie charts. Ideal for data analysts, scientists, and Python enthusiasts with intermediate to advanced Pandas skills, this course enriches data workflows and visualization capabilities.

Syllabus

  • Working with Dates and Times
    • In this module, we will explore how to handle dates and times in Pandas, starting with an introduction to the concepts and a review of Python's datetime module. You will learn to utilize Timestamp and DatetimeIndex objects for manipulating date-time data and create ranges of dates using the pd.date_range function. We will cover accessing date and time properties using the dt attribute, selecting DataFrame rows based on date-time indexes, and performing time-based arithmetic operations with the DateOffset object. Additionally, you'll master specialized date offsets and understand the concept of timedeltas for representing durations of time.
  • Input and Output
    • In this module, we will explore input and output operations in Pandas, starting with an overview of essential data exchange techniques. You will learn how to export DataFrames to CSV files, a common format for data sharing. We will guide you through installing the openpyxl library to enable reading and writing Excel files in Pandas. Additionally, you'll master importing data from Excel files into Pandas and exporting DataFrames to Excel for effective data reporting and sharing.
  • Visualization
    • In this module, we will delve into data visualization techniques using Pandas and Matplotlib. You will begin with installing the Matplotlib library, a crucial tool for creating diverse visualizations in Python. We will explore the plot method in Pandas for basic line plots and demonstrate how to modify plot aesthetics using templates. Additionally, you'll learn to create bar charts for comparing groups or tracking changes over time, and construct pie charts to effectively display proportions of a whole.
  • Options and Settings
    • In this module, we will explore how to customize Pandas' behavior and output through various options and settings. You will learn to change Pandas options using attributes, adjusting settings to suit different analysis needs. We will also cover how to change options using functions, providing greater flexibility and control over your data analysis environment. Additionally, you'll understand the precision option to control the output display precision of floating-point numbers, ensuring data clarity and readability.
  • Conclusion
    • In this module, we will wrap up the course by summarizing the key concepts and techniques you've learned. We'll reinforce the comprehensive skill set you have acquired in data analysis with Pandas and Python, providing final insights and encouragement for your continued learning and application of these skills in real-world scenarios.

Taught by

Packt

Reviews

Start your review of Advanced Data Analysis and Visualization with Pandas

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.