This report analyzes a dataset consisting of sales transactions to uncover key insights and patterns. The dataset includes various attributes such as order details, product types, sales amounts, customer information, and deal sizes. The purpose of this review is to identify trends and significant metrics that can inform business strategies and decisions.
The dataset was made available by HNG, link and consists of 2823 rows and 25 columns.
Before conducting the analysis, a copy of the dataset was made and imported into Excel where several data-cleaning steps were performed to ensure the dataset was accurate and ready for analysis:
- Checked for Duplicates: Verified the dataset and found no duplicate records.
- Checked Data Types: Ensured all columns had appropriate data types.
- Merged Name Columns: Combined the
CONTACTLASTNAME
andCONTACTFIRSTNAME
columns into a singleFULLNAME
column. - Renamed Columns: Renamed
PRICEEACH
toUNIT_PRICE
,SALES
toREVENUE
,QTR_ID
toQUARTER
, andYEAR_ID
toYEAR
. - Converted Month ID: Transformed
MONTH_ID
intoMONTH_NAME
using the first three characters of the month name. - Quarter Column Prefix: Added the prefix
Q-
to theQUARTER
column for better understanding. - Merged Address Columns: Combined
ADDRESSLINE1
andADDRESSLINE2
into a singleADDRESS
column. - Deleted Irrelevant Columns: Removed columns with blanks such as
STATE
andPOSTALCODE
as they were irrelevant to the analysis.
The cleaning process made the data more concise, focusing on essential fields and ensuring data integrity for more accurate analysis.
POWER QUERY EDITOR
The analysis of the dataset reveals the following insights:
-
Total Revenue and Orders:
- The total revenue generated is $10,032,628.85.
- A total of 2,823 orders were recorded.
- The average quantity per order is 35 units.
-
Product Popularity:
- Classic Cars are the most ordered product category with 967 orders.
- Trains are the least popular product with only 77 orders.
-
Geographical Distribution:
- The USA leads in the number of orders with a total of 1,004 orders.
- Ireland has the lowest number of orders, amounting to 16.
-
Revenue by Year:
- The year 2004 generated the highest revenue of $4,474,162.60, accounting for 47% of the total revenue.
- The year 2003 had the lowest revenue at $1,791,486.71, accounting for only 18% of total revenue
-
Monthly Revenue:
- November is the highest revenue-generating month with $2,118,885.67.
- June records the lowest revenue with $454,756.78.
-
Deal Sizes:
- Medium-sized deals are the most common with 1,384 orders.
- Large deals are the least frequent with only 157 orders.
THE DASHBOARD:
The analysis highlights that Classic Cars are the most popular product category, and the USA is the primary market for sales. The year 2004 and the month of November stand out as the most profitable periods across all years. Medium-sized deals are predominant, suggesting a focus on this segment could be beneficial.
- Customer Segmentation: Analyzing the customer base to identify key demographics and purchasing patterns.
- Product Performance: Investigating the factors contributing to the popularity of Classic Cars and underperformance of Trains.
- Seasonal Trends: Examining monthly sales data in more detail to understand seasonal influences.
- Deal Size Optimization: Exploring strategies to increase the number of large deals.
These insights can help tailor marketing strategies, optimize inventory management, and enhance customer targeting efforts.