Skip to content

Determining the factors affecting Walmart sales and analyzing the impact of markdowns around holidays on the sales; Predicting the sales accurately with regression algorithms like Linear regression, Decision Trees and Random Forest

Notifications You must be signed in to change notification settings

mishika12/Regression-Walmart_Sales_Data_Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Walmart retail sales data analysis and prediction

Introduction

One of the leading retail stores in the US, Walmart is facing a challenge due to unforeseen demands and runs out of stock some times, due to the inappropriate machine learning algorithm. An ideal ML algorithm will predict demand accurately and ingest factors like economic conditions including CPI, Unemployment Index, etc. Walmart runs several promotional markdown events throughout the year. These markdowns precede prominent holidays, the four largest of all, which are the Super Bowl, Labour Day, Thanksgiving, and Christmas. The weeks including these holidays are weighted five times higher in the evaluation than non-holiday weeks.

Business Problem

The objective is to determine the factors affecting the sales and to analyze the impact of markdowns around holidays on the sales.

Methodology

In this project, we undertake the following two tasks:

a) Answer the following questions:

  1. Which store has maximum sales?
  2. Which store has maximum standard deviation i.e., the sales vary a lot. Also, find out the coefficient of mean to standard deviation?
  3. Which store/s has good quarterly growth rate in Q3’2012?
  4. Some holidays have a negative impact on sales. Find out holidays which have higher sales than the mean sales in non-holiday season for all stores together.
  5. Provide a monthly and semester view of sales in units and give insights.

b) Build the 3 prediction models to forecast demand and evaluate the most accurate model:

  1. Linear Regression
  2. Decision Trees
  3. Random Forest

The ipynb and html versions of the code are in 'Notebook'.

About

Determining the factors affecting Walmart sales and analyzing the impact of markdowns around holidays on the sales; Predicting the sales accurately with regression algorithms like Linear regression, Decision Trees and Random Forest

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published