Fund Allocation for Countries in Need

Overview

This project focuses on using unsupervised learning techniques to cluster countries based on their socioeconomic indicators. The goal is to provide insights to policymakers, NGOs, and governments to allocate funds to countries most in need.

Dataset

The dataset used in this project contains socioeconomic indicators for countries around the world. It includes features such as GDP per capita, infant mortality rate, life expectancy, etc. The data was obtained from Kaggle and was cleaned/preprocessed before analysis.

Methodology

Exploratory Data Analysis (EDA)

Conducted comprehensive EDA to understand the distribution and relationships between variables.
Visualized key features and explored correlations using plots and charts.

Preprocessing

Performed data preprocessing steps such as handling missing values, feature scaling, and encoding categorical variables.

Feature Engineering

Scale the features to ensure uniformity.
Perform PCA (Principal Component Analysis) to reduce dimensionality.

Clustering Techniques

Applied various clustering algorithms including K-means, DBSCAN, and hierarchical clustering.
Evaluated clustering performance using metrics such as Silhouette Score, Davies-Bouldin score, and Calinski-Harabasz score.

Results

K-means Performance:
- Silhouette Score: 0.332
- Davies-Bouldin Score: 1.133
- Calinski-Harabasz Score: 85.015

Findings

Cluster 2 comprises countries primarily located in Africa and Asia, characterized by the higher child mortality rates and the lower levels of economic development.
Cluster 1 consists of countries distributed across South America, parts of Africa, Europe, and Asia. These countries exhibit average values across all features compared to other clusters.
Cluster 0 includes countries mainly located in North America, Europe, Oceania, and a few in Asia. These countries demonstrate strong or positive indicators such as robust economic development, higher life expectancy, and lower child mortality rates.
Obs: Blank spaces, such as Mexico, indicate countries with no available data.

Cluster Map - K-means

Cluster Map - DBSCAN

Conclusion

The clustering analysis revealed distinct clusters of countries based on their socioeconomic characteristics. These insights can inform decision-makers on where to allocate resources effectively.

Usage

Clone the repository to your local machine.
Install the necessary dependencies (Python libraries, Jupyter Notebook).
Run the Jupyter Notebook files in sequential order to reproduce the analysis.
Explore the code and visualizations to gain insights into the clustering results.
Refer to the documentation for detailed explanations of each step.

Contributors

Eryclis Rodrigues Bezerra Silva

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
archive		archive
EDA_and_Feature_Engineering.ipynb		EDA_and_Feature_Engineering.ipynb
LICENSE		LICENSE
Modeling - DBSCAN.ipynb		Modeling - DBSCAN.ipynb
Modeling - K-Means.ipynb		Modeling - K-Means.ipynb
Needed Help per Country (DBSCAN).png		Needed Help per Country (DBSCAN).png
Needed Help per Country (k-means).png		Needed Help per Country (k-means).png
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fund Allocation for Countries in Need

Overview

Dataset

Methodology

Exploratory Data Analysis (EDA)

Preprocessing

Feature Engineering

Clustering Techniques

Results

Findings

Cluster Map - K-means

Cluster Map - DBSCAN

Conclusion

Usage

Contributors

About

Releases

Packages

Languages

License

Eryclis/Fund-Allocation-for-Countries-in-Need

Folders and files

Latest commit

History

Repository files navigation

Fund Allocation for Countries in Need

Overview

Dataset

Methodology

Exploratory Data Analysis (EDA)

Preprocessing

Feature Engineering

Clustering Techniques

Results

Findings

Cluster Map - K-means

Cluster Map - DBSCAN

Conclusion

Usage

Contributors

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages