Skip to content

Repository that contains the full process of a Clustering models creation based on a Kaggle Dataset. Different models of KMeans, DBScan and AHC were created during the process.

Notifications You must be signed in to change notification settings

JosePadillaMtnz/CustomerDataClustering_KaggleDataset

Repository files navigation

Kaggle Dataset on Customer Data Clustering

A study of data from the Kaggle customer data clustering dataset (https://www.kaggle.com/datasets/imakash3011/customer-personality-analysis), looking for patterns, behaviors, and information that may be useful. Throughout the document, different sections can be seen, highlighting the EDA part and the creation and evaluation of clustering models.

The Exploratory Data Analysis part searches for patterns in the variations of the variables, relationships between them, and performs a deep featuring engineering. The model part creates different model options (KMeans, DBSCAN and AHC) both using PCA and without using it, studying the different results and obtaining conclusions.

About

Repository that contains the full process of a Clustering models creation based on a Kaggle Dataset. Different models of KMeans, DBScan and AHC were created during the process.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published