This project requires Python 3 and the following Python libraries installed:
The Paysim dataset consists of over 6 million data points, each with 9 features, generated by the Paysim Retail Simulation Software. The dataset is available on Kaggle.
Features
-
'type': CASH-IN, CASH-OUT, DEBIT, PAYMENT and TRANSFER.
-
'amount': amount of the transaction in local currency.
-
'nameOrig': customer who started the transaction
4, 'oldbalanceOrg': initial balance before the transaction
-
'newbalanceOrig': new balance after the transaction
-
'nameDest': customer who is the recipient of the transaction
-
'oldbalanceDest': initial balance recipient before the transaction. Note that there is not information for customers that start with M (Merchants).
-
'newbalanceDest': new balance recipient after the transaction. Note that there is not information for customers that start with M (Merchants).
Target Variable 9. 'isFraud' - This is the transactions made by the fraudulent agents inside the simulation. In this specific dataset the fraudulent behavior of the agents aims to profit by taking control or customers accounts and try to empty the funds by transferring to another account and then cashing out of the system.
One more feature, 'Step', indicates the fictional time at which the transaction occurred. This may or may not be useful to us.