Credit Card Fraud Detection using Random Forest Algorithm
The challenge is to recognize fraudulent credit card transaction so that the customers of credit card companies are not charged for items that they did not purchase.
- Enormous Data is processed every day and the model build must be fast enough to respond to the scam in time.
- Imbalanced Data i.e most of the transactions (99.8%) are not fraudulent which makes it really hard for detecting the fraudulent ones.
- Data availability as the data is mostly private.
- Misclassified Data can be another major issue, as not every fraudulent transaction is caught and reported.
- Adaptive techniques used against the model by the scammers.
• Pandas
• Matplotlib
• Seaborn
• Sklearn
• Random Forest Algorithm
• creditcard.csv (https://www.kaggle.com/datasets/mlg-ulb/creditcardfraud)
- Only 0.17% fraudulent transaction out all the transactions.
- The data is highly Unbalanced.
• The accuracy score is: 100%
• The precision score is: 0: 100% and 1: 92%
• The recall score is: 0: 100% and 1: 80%
• The f1-score is: 0: 100% and 1: 85%
• Comparison with the other algorithms without dealing with the imbalancing of the data. The Random Forest Model gives the better result.