This repository contains Python code for analyzing sales data during the Diwali season. The dataset encompasses various customer demographics, purchase details, and product categories.
Ensure you have the necessary libraries installed:
- NumPy
- pandas
- Matplotlib
- Seaborn
The dataset (Diwali Sales Data.csv) comprises 11251 entries and 15 columns, including:
User_ID: Unique customer identifierCust_name: Customer's nameProduct_ID: Unique product identifierGender: Customer's genderAge Group: Age group of the customerAge: Customer's ageMarital_Status: Marital status of the customerState: Customer's stateZone: State's zoneOccupation: Customer's occupationProduct_Category: Category of the purchased productOrders: Number of ordersAmount: Amount spent
- Female customers dominate the sales, with a notably higher purchasing power compared to males.
- The age group between 26-35 years represents the majority of buyers, predominantly comprising females.
- Uttar Pradesh, Maharashtra, and Karnataka emerge as the top states both in terms of total orders and sales revenue.
- Married females exhibit a stronger purchasing propensity, reflecting higher spending patterns.
- The IT, Healthcare, and Aviation sectors display the highest customer engagement levels.
- Food, Clothing, and Electronics emerge as the most sought-after product categories.
The analysis indicates that married females aged 26-35, hailing from Uttar Pradesh, Maharashtra, and Karnataka, and working in the IT, Healthcare, and Aviation sectors, are the primary contributors to sales during the Diwali season. Moreover, these customers show a distinct preference for products falling under Food, Clothing, and Electronics categories.
To execute the analysis, ensure the dataset (Diwali Sales Data.csv) resides within the same directory as the Python script. Execute the script in any Python environment supporting the aforementioned libraries.
For any inquiries or suggestions, please reach out.