Skip to content

Latest commit

 

History

History
18 lines (13 loc) · 982 Bytes

README.md

File metadata and controls

18 lines (13 loc) · 982 Bytes

nyc-taxi

Goal

Kaggle competition to predict NYC taxi travel times. The report for the project is at capstone.pdf.

Software and Libraries

  • Python 3
  • Scikit-learn: Python’s open source machine learning library
  • XGBoost: Python package for XGBoost model,

Datasets

The primary train dataset (train.csv) and test dataset (test.csv) is at the Kaggle competition website.

The weather dataset is at: weather_data_nyc_centralpark_2016.csv.

The datasets for the fastest routes from OSRM can be found here. The files are: fastest_routes_train_part_1.csv, fastest_routes_train_part_2.csv, and fastest_routes_test.csv

Visualization Image

The final visualization image for the project report is visualization.pdf (.png), and is best viewed zoomed in.