This repository contains code and analysis for sales data. The analysis is divided into several sections, and each section is explained below:
- Getting Started
- Exploratory Data Analysis (EDA)
In this section, we import the required Python libraries to perform data analysis and visualization.
Data loading is a crucial step in any data analysis project. Here, we load the dataset that we'll be working with.The Dataset The dataset contains 12 CSV files containing sales details for the 12 months of the year 2019. Each file contains anywhere from around 9000 to 26000 rows and 6 columns. The columns are as follows: Order ID, Product, Quantity Ordered, Price Each, Order Date, Purchase Address
Data processing involves cleaning and preparing the data for analysis. This section includes various data preprocessing steps.
EDA is the heart of this project, where we explore and analyze the data to gain insights.
We determine the best month for sales and present our findings.
We identify the day of the week with the highest sales and provide insights.
We visualize the timeline of day of the week versus revenue to spot trends.
We analyze sales per hour and present the results.
We identify the product that sold the most and share our findings.
We determine the top-selling products for each city and provide insights.
We present the top 5 products with the highest revenue for each city.
We analyze product associations to identify which products are most frequently sold together.
We calculate the percentage of orders that include multiple products.
We identify the highest value for a single order.
We determine the city with the highest revenue and provide insights.
We identify the city that sold the most products.
We analyze the distribution of states in the dataset.
In conclusion, our data analysis of sales data has provided valuable insights that can guide decision-making across various aspects of the business. These insights include the identification of peak sales months, best-selling products, top revenue-generating cities, and more. the business can make informed decisions to optimize operations, enhance marketing efforts, and maximize revenue. Continued analysis and monitoring of sales data will be crucial for adapting to changing market dynamics and maintaining a competitive edge.