Overview
Embark on a comprehensive live coding session that guides you through the process of creating a flight delay dataset using Python and Pandas on Kaggle. Learn how to pull airline flight data, explore existing datasets, and navigate public flight information sources. Gain hands-on experience in data cleaning, feature understanding, and visualization techniques using Plotly Express. Follow along as the instructor encounters and resolves real-time challenges, including dealing with partial data and downloading additional information. By the end of this tutorial, you'll have created a Kaggle dataset, generated visualizations, and gained valuable insights into the intricacies of working with flight delay data.
Syllabus
Intro
Existing Kaggle Datasets
Finding Public Flight Data
Checking the CSV in IPython
Creating a Kaggle Dataset
Dalle2 Dataset Image
Data Exploration
Understanding Features
More Data Cleaning
Plotly Express Plot
Plotting Cancellation Rates
Realizing it's Partial Data
Downloading More Data
Data Filtering
Final Plot Works
Bye Bye
Taught by
Rob Mulla