Upgrade Propensity Model

Overview

This project focuses on predicting customer upgrade propensity based on provided data. The task includes data exploration, cleaning, feature engineering, model building, and performance evaluation. The project utilizes logistic regression, random forest, and XGBoost algorithms to identify the most effective model for predicting customer upgrades.

Problem Statement

The goal is to predict which customers are most likely to upgrade their services. Using the provided dataset, the task is to build a model that identifies customers with the highest propensity to upgrade, enabling targeted marketing and retention strategies.

Task List

Data Exploration, Cleaning, and Feature Engineering:
- Explore the dataset to understand its structure and content.
- Clean the data by addressing missing values, outliers, and inconsistencies.
- Engineer features to improve model performance.
Explanatory Analysis:
- Analyze the data to uncover key insights and trends.
- Document key features and their importance in predicting upgrades.
Modeling/Training:
- Implement and train logistic regression, random forest, and XGBoost models.
- Evaluate model performance using relevant metrics, with a focus on recall due to the classification nature of the problem.
Documentation and Reporting:
- Summarize findings and provide actionable insights for business strategies based on model performance.

Project Files

DATA.xlsx: Contains sample data used for analysis and model building.
UPGRADES.xlsx: Additional data related to customer upgrades.
CustomerUpgradePrediction.ipynb: Jupyter notebook with code for data exploration, cleaning, feature engineering, and model training.
result.csv: Output file containing the results of the model predictions.
README.md: Documentation for the project.

Data Exploration, Cleaning, and Feature Engineering

Data Exploration:
- Conducted exploratory data analysis to understand the dataset’s structure and content.
- Identified key features and their distributions.
Data Cleaning:
- Addressed missing values through imputation or removal.
- Handled outliers and corrected any inconsistencies.
Feature Engineering:
- Created features such as average spend, voice off-net duration, number of upgrades, and calls to customer care counts.
- Applied transformations to enhance model performance.

Explanatory Analysis

Key Insights:
- Important features for predicting customer upgrades include AVERAGE_SPEND, VOICE_OFFNET_DUR_L3M, NUM_OF_UPGRADES, and CALLS_CARE_CNTS_L6M.
- Coefficients for these features indicate their influence on predicting upgrades, with positive coefficients predicting class 1 and negative coefficients predicting class 0.

Modeling/Training

Model Building:
- Implemented logistic regression, random forest, and XGBoost algorithms.
- Focused on recall as the primary performance metric to maximize the detection of upgrade propensity.
Model Evaluation:
- Logistic Regression: Achieved a recall of 0.66, indicating the model successfully classifies 66% of the upgrade instances. Despite a lower precision of 0.23 and an F1-score of 0.35, the model's recall performance is deemed acceptable for this task.
- Random Forest and XGBoost: Both models showed signs of overfitting compared to logistic regression.

Key Findings and Recommendations

Feature Importance:
- AVERAGE_SPEND, VOICE_OFFNET_DUR_L3M, NUM_OF_UPGRADES, and CALLS_CARE_CNTS_L6M are crucial for predicting customer upgrades. These features should be prioritized in any customer targeting or marketing strategy.
Model Performance:
- Logistic regression provides a good balance between recall and complexity, making it the preferred model despite its lower precision and F1-score.
- Consider using logistic regression for predicting customer upgrades, given its performance and interpretability.
Business Recommendations:
- Focus on customers with high values in key features such as average spend and recent interactions with customer care.
- Use the model's predictions to tailor marketing strategies and retention efforts, aiming to convert high-propensity customers.

How to Run the Project

Install Dependencies:
- Ensure all required libraries are installed. Use the requirements.txt file if available or install packages manually.
Run the Notebook:
- Open CustomerUpgradePrediction.ipynb in Jupyter Notebook.
- Execute the cells to perform data exploration, cleaning, feature engineering, and model training.
View Results:
- Check result.csv for the model’s predictions and output.

Author

Ramesh S

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Upgrade Propensity Model

Overview

Problem Statement

Task List

Project Files

Data Exploration, Cleaning, and Feature Engineering

Explanatory Analysis

Modeling/Training

Key Findings and Recommendations

How to Run the Project

Author

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data		data
CustomerUpgradePrediction.ipynb		CustomerUpgradePrediction.ipynb
README.md		README.md
result.csv		result.csv

rameshs-data/Customer_Upgrade_Prediction

Folders and files

Latest commit

History

Repository files navigation

Upgrade Propensity Model

Overview

Problem Statement

Task List

Project Files

Data Exploration, Cleaning, and Feature Engineering

Explanatory Analysis

Modeling/Training

Key Findings and Recommendations

How to Run the Project

Author

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages