An analysis of the shaving blades market in the United States. The project was done as part of Predictive Analysis class at the University of Texas at Dallas. The final presentation for the report can be found in project presentation. The entire summary of the project can be found in the project report.
- General info
- Screenshots
- Technologies and Tools
- Setup
- Code Examples
- Features
- Status
- Inspiration
- Contact
The dataset consists of sales data for various brands of shaving blades sold in the United States. We have analyzed the data from the standpoint of one of the prime blade manufacturer "BIC" to provide insights that can help them increase their revenue. Our analysis is focussed on providing answers to four questions using the data provided.
The entire presentation of the project can be found in project presentation and entire research findings can be found in project report.
- SAS
- Microsoft Excel
The data used for the analysis and its metadata can be found in the dataset folder. The entire code has been provided here can be used in the latest version of SAS to run replicate various operations provided and see the various outputs generated. A detailed explanation of the various operations and the interpretation of the outputs can be found in the project report.
Some examples of usage:
/*Investigating Product type = Razor Blades*/
proc freq data=a4;
where product_type = 'RAZOR BLADE';
table L2*product_type;
run;
proc freq data=a4;
table product_type;
run;
data a4;
set a4;
drop product_type;
run;
data a4;
set a4;
if L2 = 'CARTRIDGES' then type = 0;
else type = 1;
run;
proc format;
value type 0 = 'Cartridges';
value type 1 = 'Disposable';
run;
/*Running Logistic Regression*/
/*Using data=L1*/
proc print data=L1 (obs=10); run; quit;
/*Testing for Non-linearity*/
data L1;
set L1;
total_weightedprice2 = total_weightedprice*total_weightedprice;
run;
data L1;
set L1;
vol_eq2 = vol_eq*vol_eq;
run;
/*Business Logistic Regression*/
title1 'Refining Business Logistic Regression for Loyalty using Stepwise Selection';
proc logistic data=L1 outmodel=betas1 desc;
class outlet agm agf ethnicity status inc_status marital kids status / param=ref;
model loyal_customer = outlet agm agf ethnicity status inc_status marital kids status / selection=stepwise slentry= 0.3 slstay=0.2 expb;
output out=preds predprobs=individual;
run;
/*k-means clustering*/
/*the difference between hierachical - k-means is an iterative o(n) operation; hierachical is looking based on distances o(n2) operation*/
/*proc fastclus data=clus3 maxclusters=3 maxiter=20 list;
id panid;
var recency_score frequency_score monetary_score;
run;
proc fastclus data=clus3 maxclusters=4 maxiter=20 list;
id panid;
var recency_score frequency_score monetary_score;
run;
proc fastclus data=clus3 maxclusters=5 maxiter=20 list;
id panid;
var recency_score frequency_score monetary_score;
run;*/
The entire code can be found here.
We have answered four questions as follows:
- Effects of Advertising on unit sales
- Effects of Advertising on product switching
- Effects of Demographics on product loyalty
- Effects of Product Characteristics on product loyalty
Please refer to project presentation for detailed answers to the questions.
Project is: finished
The color theme for the project presentation is inspired by the brand color of "BIC", our assumed client in the analysis.
Created by me with my teammates Weiyang Sun, Gongzhan Hua, and Mengyun Xia.
If you loved what you read here and feel like we can collaborate to produce some exciting stuff, or if you just want to shoot a question, please feel free to connect with me on email, LinkedIn, or Twitter. My other projects can be found here.