Skip to content

This project aim to predict the possibility of a hotel customer to cancel a booking he/she has made. I was using booking data from a hotel in Portugal that is used to take insight and design a simple machine learning model.

Notifications You must be signed in to change notification settings

ReyhanR/Hotel-Cancellation-Booking-Prediction-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Purwadhika Final Project - Hotel Cancellation Booking Prediction

By : Reyhan Ryanafi

Project Brief Description


Hotel industry is one industry that can utilize data and machine learning in helping management to increase revenue. one of them by using machine learning as a tool to take preventive action against the possibility of a consumer to cancel their bookings.

So in this project I was using booking data from a hotel in Portugal that is used to take insight and design a simple machine learning model.

  • Dataset Source : Kaggle.com

  • Dataset Info : Source

Project Objectives
  • Create and choose one of several machine learning models, to predict hotel cancellation bookings.
  • Create a web application using the flask, from a model that was created and selected previously.

The Dataset


dataset

Brief Model Selection

  • Models Score
Model Score (Train Set) Model Score (Test Set)
model score model score
  • Models ROC Curve and AUC Score

ROC Curve

  • Models ROC Curve and AUC Score (Normal Data VS Oversampling)

ROC Curve oversampling

  • Gradient Boosting Precision and Recall Score (Normal Data VS Oversampling)
Precision Score Recall Score
model score recall score

The final choice of this classification model is that I chose the Gradient Boosting Classifier without oversampling the data as my model.

The Web Apps


Homepage


homepage

Prediction


data input

Feature Description
Country Country of origin. Categories are represented in the ISO 3155–3:2013 format.
Deposit Type Indication on if the customer made a deposit to guarantee the booking. This variable can assume three categories No-Deposit, Non-Refund and Refundable
Lead Time Number of days that elapsed between the entering date of the booking into the PMS and the arrival date.
Total of Special Request Number of special requests made by the customer (e.g. twin bed or high floor).
Average Daily Rate(ADR) Calculated by dividing the sum of all lodging transactions by the total number of staying nights.
Market Segment Market segment designation. In categories, the term “TA” means “Travel Agents” and “TO” means “Tour Operators”.
Arrival Date Day of Month Day of the month of the arrival date.
Arrival Date Week Number Week number of the arrival date.
Stays In Week Nights Number of week nights (Monday to Friday) the guest stayed or booked to stay at the hotel. Calculated by counting the number of week nights from the total number of nights.

Prediction Result


Result

Visualization Page


visualization

About

This project aim to predict the possibility of a hotel customer to cancel a booking he/she has made. I was using booking data from a hotel in Portugal that is used to take insight and design a simple machine learning model.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages