Skip to content

tezzytezzy/us-flight-delay

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 

Repository files navigation

US Flight Delay

Objective

Demonstrate data munging through the following actions.

1️⃣ Distribution assessment by checking and eliminating samples with Not a Number (NaN) fields
2️⃣ Outlier elimination via Tukey Fences and Z-Scores
3️⃣ Categorical value transformation and encoding

Installation

pandas                    0.25.2           py37he6710b0_0  
pyarrow                   0.13.0           py37he6710b0_0 
category-encoders         2.1.0                    pypi_0    pypi

Data Source

flight_data.csv (393Mb)

Reference

Python Machine Learning by Wei-Ming Lee (ISBN: 978-1-119-54563-7)