Skip to content

Exploring E-commerce Transactions: Data Cleaning, Cohort Analysis, and Business Insights generation using Alteryx Designer Core.

License

Notifications You must be signed in to change notification settings

SQLicious/Alteryx-Project-Ecommerce-Transaction-Cohort-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

70 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Project Origin

In September 2023, guided by Alteryx expert Albert Bellamy, I undertook the 30-day Alteryx challenge, culminating in earning the Alteryx Designer Core certification. in just 15 days.

  • This project showcases my two critical capabilities:
    • Rapid Learning: Demonstrates my ability to swiftly master new tools, an asset I bring to any employer.
    • Practical Application: Proves I can effectively employ these skills to solve business challenges and educate others, as evidenced by the comprehensive documentation provided in this GitHub repository.

Project Overview

Embark on a informational journey through my Alteryx project,designed to enhance e-commerce operations. This project has three vital stages: Data Cleanup, Cohort Analysis, and Insights generation. Together, we'll unravel the nuances of online shopping, driving strategic decisions to maximize revenue.

Quick Alteryx Workflow Demo Before Diving Into the Project Details

Watch the Project Demo Video

Table of Contents

  1. Dataset: transaction_dataset.csv
  2. Data Cleanup
  3. Cohort Analysis
  4. Insights Generation
  5. Project Inspiration
  6. Methodology
  7. Upcoming Additions

1. Dataset{#dataset}

The project utilizes the trasaction_dataset.csv dataset, which contains detailed information about customer transactions, including transaction IDs, product details, customer IDs, dates, and order statuses, among others. This dataset was sourced from Hicounsellor.com/projects titled "Analyzing E-commerce Transactions". This CSV contains 20,000 rows across 13 columns.

Data Profile

The dataset provides the following key attributes:

  • Transaction ID
  • Product ID
  • Customer ID
  • Transaction Date
  • Online Order (True/False)
  • Order Status (Approved/Unapproved)
  • Brand
  • Product_Line
  • List Price
  • Standard Cost
  • Product First Sold Date

Phases

This Alteryx workflow is designed to perform comprehensive analysis on customer transactions using the provided dataset. The project encompasses three main phases: 1.Data Cleaning 2.Cohort Analysis 3.Business Insights Generation.

2. Phase 1- Data Cleanup {#data-cleanup}

In this phase, the following steps were performed:

  1. Filtered out undesired columns: Identified and eliminated extraneous columns not necessary for analysis such as "product_class" and "product_size using SELECT tool
  2. Data Type Conversion: Ensured data types were compatible for analysis using SELECT tool
  3. **Renamed Columns:**Revamped column names to enhance clarity using SELECT tool
  4. Quality Assurance: Checked for data integrity using temporary BROWSE tool

3. Phase 2 - Cohort Analysis {#cohort-analysis}

This phase involved the following steps:

  1. Customer Segmentation: Grouped customers into cohorts based on their transaction behavior.
  2. Cohort Metrics: Calculated key metrics (e.g., retention rate, average spend) for each cohort.
  3. Tabular Representation: Generated Cohort Retention rate to aid in cohort comparison.

For a detailed view of the Cohort Analysis process, refer to the Alteryx Workflow (.yxmd) ...

4. Phase 3 Insights Generation {#insights-generation}

The final phase encompassed building worflows to derive the following:

  1. Distinct Brands: Identified unique brands available in the dataset.
  2. Unique Customers: Counted the number of unique customers who made transactions.
  3. Approved vs. Unapproved Orders: Quantified approved and unapproved transactions.
  4. Average List Price by Product Line: Listed top product lines with the highest average list price.
  5. Top 5 Profitable Products: Identified products with the highest profit margin.
  6. Customer Spending Analysis: Evaluated customer transactions, including total spend and average profit per transaction.
  7. Top 5 Product Lines by Revenue Contribution: Identified product lines with the highest total revenue contribution.
  8. Customers Who Purchased from All Product Lines: Identified customers with transactions in all product lines.
  9. Total Sales Amount by Brand: Calculated the total sales amount for each brand.
  10. Top 5 Products with Highest Profit Margin: Identified the top 5 products with the highest profit margin.
  11. Customer Transaction Metrics: Calculated three metrics for each customer. Total number of transactions. Total amount spent, Average profit per transaction
  12. Top Product Lines by Revenue Percentage: Identified product lines contributing the highest percentage of revenue.
  13. Customers Engaged with Entire Product Range: Identified customers who made transactions in all distinct product lines.

For a detailed view of the Business Insights Generation process, refer to the Alteryx Workflow (.yxmd).

5. Project Inspiration {#project-inspiration}

This project draws inspiration from Hicounsellor's 'Analyzing E-commerce Transactions:Data Cleaning, Cohort Analysis, and SQL' that recommends a Python+SQL approach. However, I've taken a unique approach by exclusively utilizing Alteryx Designer. This showcases the efficiency of low-code solutions such as Alteryx Designer, offering a fresh perspective on streamlining processes and extracting actionable insights, even by a non-technical business user.

6. Methodology {#methodology}

In this project, I leveraged the skills and knowledge acquired during the Alteryx Designer Core certification and the pursuit of four microcredentials. This allowed me to harness a robust set of Alteryx tools, putting them to work in a cohesive workflow.

Throughout this project, I harnessed the full potential of Alteryx, seamlessly executing all phases. Leveraged a range of powerful tools from each tool palette

  • In/Out Tools:

    • Input
    • Output
  • Preparation Tools:

    • Data Cleansing
    • Filter
    • Formula
    • Multi-Field
    • Sample
    • Select
    • Sort
    • Unique
  • Join Tools:

    • Append
    • Join
  • Transform Tools:

    • Cross-Tab
    • Summarise

I ensured an efficient data transformation process. This would have taken me 3 weeks (120 man hours using the SQL and Python approach) and i built this worklow in just 3 days. This comprehensive utilization of Alteryx highlights its remarkable capabilities in ETL( Extract - Load - Transformation)

7. Upcoming Additions {#upcoming-additions}

I plan to automate the generation of a comprehensive PDF report combining Phase 2 cohort analysis results with answers to 13 business questions. This report will be sent automatically to a designated list of recipients. Additionally, I'm exploring the possibility of exporting Phase 2 cohort analysis data to Tableau using the Tableau output tool.

Project Created By: [Roopmathi Gunna] Date: [10-06-2023]


About

Exploring E-commerce Transactions: Data Cleaning, Cohort Analysis, and Business Insights generation using Alteryx Designer Core.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published