The primary objective of this project is to develop a robust fraud detection system for financial transactions using Autoencoder Neural Networks, leveraging data akin to real-time SAP ERP system data. The autoencoder model will be trained to accurately reconstruct regular transaction data, resulting in a low reconstruction loss (below a certain threshold) for these typical patterns. However, when faced with anomalous transactions that deviate from the norm, the model is expected to exhibit a higher reconstruction loss (above a certain threshold), thereby signaling potential fraudulent activity. A key aspect of this project involves visualizing the reconstruction loss across various transactions to clearly distinguish between normal and anomalous behavior.
The encoder consists of a dense layer with a Leaky ReLU activation function, taking an input with a shape of 618 and producing an output with a shape of 3. Similarly, the decoder is another dense layer with a Leaky ReLU activation, which reconstructs the encoder's output froTm a shape of 3 back to the original shape of 618. After the autoencoder processes the input data, the difference between the original input and its reconstructed version (output) is calculated. This difference is referred to as the reconstruction loss, any data point with a reconstruction loss higher than the threshold could be considered anomalous.
The AENN is trained for 25 epochs, using binary cross-entropy as the loss function and the ADAM optimizer. The training statistics are presented below.
The encoder and decoder weights for the 25th epoch are available at "encoder_model_epoch_25.pth", "decoder_model_epoch_25.pth" files.
The 95th percitle of the reconstruction loss is set as threshold.
threshold = np.percentile(reconstruction_loss_transaction, 95)
The figure below shows the performance of AENN for anomaly detection.
We already have the labels in our dataset indicating regular, local and global anomalies. The figure below is the plot of reconstruction loss along with already available labels.
This figure is almost similar to the one above hence we can infer that the decoder will be able to reconstruct the original data for regular transactions hence the loss is very low (below threshold) where as the loss is high (above threshold) for some cases which indicates these are anomalous one's.
- Sai Harsha Vardhan Reddy, Kolan- skolan@horizon.csueastbay.edu, harsha62334@gmail.com
Thanks for reading!