This repository contains a comprehensive analysis and modeling of daily visitors to four different restaurants, utilizing various statistical and machine-learning techniques. The dataset, RestaurantVisitors.csv
, tracks visitor counts and holiday indicators over time.
The primary goal of this project is to predict daily restaurant visitors using various modeling techniques, comparing their performance based on the Root Mean Squared Error (RMSE) as the primary metric.
- Random Walk
- Random Walk with Drift
- ARIMA (AutoRegressive Integrated Moving Average)
- SARIMA (Seasonal ARIMA)
- AR (AutoRegressive)
- MA (Moving Average)
- ARIMAX (ARIMA with exogenous variables)
- SARIMAX (Seasonal ARIMA with exogenous variables)
- Exponential Smoothing
- Simple Exponential Smoothing
- Holt’s Linear Trend Model
- Holt-Winters Seasonal Model
- VAR (Vector AutoRegression)
- VARMA (Vector AutoRegressive Moving Average)
- VARMAX (Vector AutoRegressive Moving Average with exogenous variables)
- ETS (Error, Trend, Seasonality) Models
- Linear Regression
- Support Vector Regression (SVR)
- Decision Trees Regressor
- Random Forest
- Gradient Boosting
- XGBoost (Extreme Gradient Boosting)
- LightGBM (Light Gradient Boosting Machine)
- CatBoost
- K-Nearest Neighbors (KNN)
- Vanilla RNN(Recurrent Neural Network)
- LSTM (Long Short-Term Memory)
- GRU (Gated Recurrent Unit)
- LSTM-CNN (CNN combined with LSTM)
- Temporal Convolutional Networks (TCN)
- Transformer Models
- Transformers (e.g., BERT, GPT)
- Time Series Transformers
- Seq2Seq Models
- FBProphet (Facebook's forecasting tool)
- DeepAR (Deep Autoregressive Models for Forecasting)
- Temporal Fusion Transformers (TFT)
- Matrix Factorization Models (for multivariate time series)
The models were evaluated based on the RMSE as the primary metric to identify the best-performing model for predicting restaurant visitors.