Credit Card Approval Prediction (Classification Project)

Authors:

About the Dataset:

Credit score cards are a common risk control method in the financial industry. They use personal information and data submitted by credit card applicants to predict the probability of future defaults and credit card borrowings. The bank uses this to decide whether to issue a credit card to the applicant. Credit scores objectively quantify the magnitude of risk.

Goal: The goal of this project was to predict the credit card score status based on the applicant’s information, where the scores are mapped to either "Approved" or "Not Approved" based on certain criteria.

Credit Scores to Predict:

'C': "Credit Approved" status
'X': "Accepted" or "Approved" status
'0', '1', '2', '3', '4', '5': Levels of rejection, from minor to severe issues like insufficient credit history, high debt-to-income ratio, or bankruptcy.

Mapping to Status:

'C': Approved
'X': Approved
'0', '1', '2', '3', '4', '5': Not Approved

The two datasets, credit_record and application_record, were merged based on the common column ID to combine applicant details with credit scores.

Project Deliverables:

A Python file with the complete implemented pipeline.
A report explaining each step and decision in detail.

Key Steps in the Project:

Data Exploration and Statistical Analysis:
- We performed an initial exploration of both credit_record and application_record datasets.
- Conducted sanity checks and determined the appropriate preprocessing techniques to ensure data quality.
- Performed exploratory data analysis (EDA) to understand the distribution of features and identify any outliers, missing values, or discrepancies.
Preprocessing:
- Checked for and handled any missing/null values in the datasets.
- Applied feature engineering where necessary, including creating new features or transforming existing ones to improve model performance.
- Dropped unnecessary columns that did not contribute to the prediction task.
- Checked for class imbalance and addressed it through techniques like oversampling or undersampling.
- Removed any duplicate records.
- Merged the two datasets (credit_record and application_record) based on the ID column.
- Applied label encoding to categorical features and performed feature scaling to ensure all features were on the same scale.
Feature Selection Using Genetic Algorithms:
- Implemented Genetic Algorithms for feature selection to choose the most relevant features for model training.
- The algorithm helped to determine which features contributed most to predicting the credit score status.
Splitting the Data:
- Split the data into training (70%), validation (15%), and testing (15%) sets, ensuring that the data was well-distributed across the subsets for model training and evaluation.
Model Training:
- We trained three classification models:
  - K-Nearest Neighbors (KNN)
  - Decision Trees
  - Multi-Layer Perceptron (MLP)
- The models were trained using the selected features, and performance was evaluated using cross-validation on the validation set.
Hyperparameter Tuning:
- Applied grid search to find the best hyperparameters for each model, tuning parameters such as the number of neighbors for KNN, the depth of decision trees, and the number of hidden layers for MLP.
- This allowed us to optimize the models for better performance.
Model Evaluation:
- Once hyperparameter tuning was completed, the models were evaluated using accuracy as the primary evaluation metric.
- We found that the Decision Trees model performed the best in terms of accuracy, followed closely by KNN.

Results:

Best Model: Decision Trees
Evaluation Metric: Accuracy

Screenshots:

Conclusion:

The project was successful in predicting the credit card approval status by leveraging classification models like KNN, Decision Trees, and MLP. Feature selection using Genetic Algorithms helped in improving model performance, and hyperparameter tuning ensured the optimal settings for each classifier. The Decision Trees model emerged as the best performer, achieving the highest accuracy in predicting whether a credit card application would be approved or not.

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
Credit Card Approval Prediction		Credit Card Approval Prediction
Project Requirements-20241221/AI_FALL24_Project/AI_FALL24_Project		Project Requirements-20241221/AI_FALL24_Project/AI_FALL24_Project
__pycache__		__pycache__
AI_Project.pdf		AI_Project.pdf
LICENSE		LICENSE
README.md		README.md
decision_tree_model.joblib		decision_tree_model.joblib
knn_model.joblib		knn_model.joblib
updatedAI.ipynb		updatedAI.ipynb
updatedAI.py		updatedAI.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Credit Card Approval Prediction (Classification Project)

About the Dataset:

Mapping to Status:

Project Deliverables:

Key Steps in the Project:

Results:

Screenshots:

Conclusion:

About

Releases

Packages

Languages

License

Nouran246/Credit-Card-Approval-Prediction-Classification

Folders and files

Latest commit

History

Repository files navigation

Credit Card Approval Prediction (Classification Project)

About the Dataset:

Mapping to Status:

Project Deliverables:

Key Steps in the Project:

Results:

Screenshots:

Conclusion:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages