Optimizing Fraud Detection in Financial Services with Graph Neural Networks and NVIDIA GPUs

jwitsoe · October 4, 2022, 1:00pm

Originally published at: https://developer.nvidia.com/blog/optimizing-fraud-detection-in-financial-services-with-graph-neural-networks-and-nvidia-gpus/

Learn an end-to-end workflow showcasing best practices for detecting financial services fraud using GNNs and GPUs.

m.f.balin · October 22, 2022, 2:26pm

What batch sizes were used while scaling from 1 to 8 GPUs on the MAG240M dataset?

oyilmaz · October 31, 2022, 5:45pm

We used batch size of 8192 and this batch size gave us the best classification accuracy. We see similar speedups with lower batch sizes as well.

googan40 · November 23, 2022, 7:55am

Hi!
Can you share the full end-to-end code for fraud detection (including R-GCN building, training and downstream XGBoost applying)?

ntqhai2002 · May 5, 2023, 3:26am

Hi I know I am a bit late to the topic, but I have some questions I was wondering if you could answer.

So far, I have conducted preprocessing, and the dataset now contains 20 numerical features. As the article suggests, I have saved the bulk of the data on the edges between nodes, leaving the nodes featureless, besides their distinct IDs. Moreover, from my perspective, it would seem that these transactions only have one relationship, which is “Credit card purchases from Merchant”. Now, I have some questions regarding the suggestions of the article

The more I look into R-GCN and GCN for that matter, it would seem that these models do not use edge features, but node features instead. As such, wouldn’t it be ineffective to conduct node embeddings, and node classification as the article suggests, since there is no information on the nodes themselves, and its IDs provide no information to detect fraudulence?
Does R-GCN provide any significant advantage to GCN in this instance, as there is only one type of relationship ?
I have also seen the article suggest using Link Prediction as part of the approach, but I do not understand how it helps with detecting fraudulent transactions.

I am having a pretty hard time understanding this article, and the methods for that matter, and I would really appreciate some clarification.

kkranen · June 26, 2023, 8:29pm

Hi there nthqhai2002!

Thanks for reading this blog and for your awesome questions!

it would seem that these transactions only have one relationship, which is “Credit card purchases from Merchant”

In order to make the edge undirected and to allow message propogation between both classes of the graph, we also add a second reverse edge type, in your cases “Merchant has purchase from Credit Card”

As such, wouldn’t it be ineffective to conduct node embeddings?

The IDs themselves in a transductive setting are valuable as well, as a learned user embedding encodes the generalized structural behavioral embedding of the user. You can also aggregate adjacent edge features per node to user as a node feature

You are correct that an architecture that propogates edge information would likely be useful here. The purpose of the blog post was mainly to show baseline usefulness, but if you’re interested in edge-inclusive papers, you can refer to: Exploiting Edge Features for Graph Neural Networks | IEEE Conference Publication | IEEE Xplore

I have also seen the article suggest using Link Prediction as part of the approach, but I do not understand how it helps with detecting fraudulent transactions.

Link Prediction in this case is used to generate robust representations of nodes, which can be used downstream in the direct prediction of the fraud label. Often in the fraud detection domain, labels are noisy and generally weak. Training representations on non-noisy labels (transaction presence) often has more consistent convergence properties.

ntqhai2002 · June 26, 2023, 9:38pm

Hi kkranen!
Thanks a lot for the response, your answers have helped me a lot !
I have another question regarding this workflow, if you do not mind. Suppose I have trained the model on data from 2015 until 2021, and new influx of data from 2022 comes in. At this stage, I would follow the workflow in order to generate robust node embeddings, then attach them to the 2022 tabular data according to the unique node IDs, then conduct predictions. My question is should the node embeddings of 2022 be generated by submitting the entire dataset from 2015 to 2022 to the workflow, or do I only need to exclusively use data of 2022?

junnior.el · July 25, 2023, 10:08pm

Hello,
Could you find the reproducible source code?

Topic		Replies	Views
Supercharging Fraud Detection in Financial Services with Graph Neural Networks Technical Blog	2	108	August 15, 2025
Detecting Financial Fraud Using GANs at Swedbank with Hopsworks and NVIDIA GPUs Technical Blog	0	622	March 26, 2021
Fraud Detection - Top Resources from GTC 21 Technical Blog	0	461	June 4, 2021
Deep Learning vs Machine Learning Challenger Models for Default Risk with Explainability Technical Blog	0	442	December 8, 2021
Want to Get Improve Your Fraud Detection Game? Learn how This Kaggle Grandmaster Team Won Data Science of the Day python , gpu , data-science , machine-learning , fan-facts	0	972	March 5, 2021
cuGraph: 그래프 분석 및 GNN 카테고리에 대해 무엇이든 물어보세요! Technical Blog - South Korea korean	0	625	August 24, 2023
What to Do with All That Bandwidth? GPUs for Graph and Predictive Analytics Technical Blog	2	434	August 14, 2017
Delivering fast recommendations from Google Analytics 360 SQL Knowledge Graph with RAPIDS cuGraph Technical Blog	0	388	May 3, 2021
Announcing NVIDIA Merlin: An Application Framework for Deep Recommender Systems Technical Blog	0	475	August 25, 2020
Accelerating Inference on End-to-End Workflows with H2O.ai and NVIDIA Technical Blog	2	528	January 4, 2024

Optimizing Fraud Detection in Financial Services with Graph Neural Networks and NVIDIA GPUs

Related topics