The goal of the project is to group consumers into clusters using the elbow approach. The project also includes scatter plots to show the relationships between the variables and dataset's columns. Customer Segmentation is the subdivision of a marketplace into discrete client agencies that proportion comparable characteristics. Customer Segmentation may be a effective method to perceive unhappy client needs. Using the above statistics groups can then outperform the opposition through growing uniquely attractive merchandise and services. K method clustering is one of the maximum famous clustering algorithms and normally the primary component practitioners practice while fixing clustering duties to get an concept of the shape of the dataset. The purpose of K method is to organization statistics factors into wonderful non-overlapping subgroups. One of the important software of K method clustering is segmentation of clients to get a higher information of them which in flip may be used to growth the sales of the company.
K-means clustering Elbow method Libraries- Pandas, Matplotlib, Numpy, Seaborn, Skleran libraries
The package can be installed using pip
pip install customer_segmentation
The most common ways companies segment their customer base are:
- Demographic information such as gender, age, marital and marital status, income, education and occupation;
- Geographic Information. It depends on the scale of your business. For localized businesses, this information may relate to a specific city or county. For large companies, this may mean the customer's city, state or even country of residence;
- Psychographic data such as social class, lifestyle, and personality traits;
- Behavioral data such as spending and consumption habits, product/service usage and desired benefits.
This package can be used for the -Determine the right price for the product. -Create individualized marketing initiatives. -Create the best distribution plan possible. -Select particular product features for implementation. -Set new product development activities as a priority.
The data was taken from Kaggle competition datasets