This repository hold all the code, data and figures for the analysis and machine learning code of the New York Yellow Taxi Dataset
This project is accompanied with a blog post on my website that goes through the analysis and the results it can be found here 📖
├── Data
│ ├── Data Description <- Contains the PDF for the data dictionary with explanation for each column (not all though).
│ └── Pre-processed Data <- CSV files of the sampled and cleaned data (train and test).
|
│
├── Figures <- Generated graphics and figures to be used in reporting. Segmented by the different questions addressed
│
├── Source Code <- Jupyter notebooks for the project segmented by the project phase (collection, cleaning, exploration, etc.)
│ ├── Presentation Notebook <- This notebook is for presenting the finding using the RISE Jupyter extension more on it on the Readme file in this directory
│ └── rf_model <- Random Forest model for predicting the pickup density
│
└── G02 E-Above Arebi.pdf <- PDF file for the project presentation