Anomaly Detection for Financial Fraud with Autoencoder Neural Networks

Objective

The primary objective of this project is to develop a robust fraud detection system for financial transactions using Autoencoder Neural Networks, leveraging data akin to real-time SAP ERP system data. The autoencoder model will be trained to accurately reconstruct regular transaction data, resulting in a low reconstruction loss (below a certain threshold) for these typical patterns. However, when faced with anomalous transactions that deviate from the norm, the model is expected to exhibit a higher reconstruction loss (above a certain threshold), thereby signaling potential fraudulent activity. A key aspect of this project involves visualizing the reconstruction loss across various transactions to clearly distinguish between normal and anomalous behavior.

Softwares & Frameworks

Model Architecture

Model Training

The encoder consists of a dense layer with a Leaky ReLU activation function, taking an input with a shape of 618 and producing an output with a shape of 3. Similarly, the decoder is another dense layer with a Leaky ReLU activation, which reconstructs the encoder's output froTm a shape of 3 back to the original shape of 618. After the autoencoder processes the input data, the difference between the original input and its reconstructed version (output) is calculated. This difference is referred to as the reconstruction loss, any data point with a reconstruction loss higher than the threshold could be considered anomalous.

The AENN is trained for 25 epochs, using binary cross-entropy as the loss function and the ADAM optimizer. The training statistics are presented below.

The encoder and decoder weights for the 25th epoch are available at "encoder_model_epoch_25.pth", "decoder_model_epoch_25.pth" files.

Model Evaluation

The 95th percitle of the reconstruction loss is set as threshold.

threshold = np.percentile(reconstruction_loss_transaction, 95)

The figure below shows the performance of AENN for anomaly detection.

Inference

We already have the labels in our dataset indicating regular, local and global anomalies. The figure below is the plot of reconstruction loss along with already available labels.

This figure is almost similar to the one above hence we can infer that the decoder will be able to reconstruct the original data for regular transactions hence the loss is very low (below threshold) where as the loss is high (above threshold) for some cases which indicates these are anomalous one's.

Contributors

Sai Harsha Vardhan Reddy, Kolan- skolan@horizon.csueastbay.edu, harsha62334@gmail.com

Thanks for reading!

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Dataset		Dataset
README.md		README.md
decoder_model_epoch_25.pth		decoder_model_epoch_25.pth
encoder_model_epoch_25.pth		encoder_model_epoch_25.pth
finanomaly-main.ipynb		finanomaly-main.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Anomaly Detection for Financial Fraud with Autoencoder Neural Networks

Objective

Softwares & Frameworks

Model Architecture

Model Training

Model Evaluation

Inference

Contributors

About

Releases

Packages

Languages

KolanHarsha/Anomaly_Detection_for_Financial_Fraud_with_Autoencoder_Neural_Networks

Folders and files

Latest commit

History

Repository files navigation

Anomaly Detection for Financial Fraud with Autoencoder Neural Networks

Objective

Softwares & Frameworks

Model Architecture

Model Training

Model Evaluation

Inference

Contributors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages