I. Introduction
The widespread adoption of digital payments and online credit has enabled financial services to scale and operate in real time. It has also made fraud more covert and more organized. Traditional credit fraud detection often relies on static rules or anomalous signals from a single transaction. In internet platforms, cross-channel payments, and multi-party ecosystems, fraudsters can coordinate across multiple accounts, devices, merchants, and interaction paths [
1]. They dilute single-point risk signals and create behavior traces that appear normal. At the same time, rapid business iteration continuously reshapes transaction patterns and customer profiles. Fraud strategies evolve accordingly. Methods built only on historical patterns can therefore lag and become ineffective. Reliable risk identification under high concurrency, strict timeliness, and complex associations remains a central challenge in credit risk control.
In terms of data structure, credit fraud identification is not simply a tabular classification problem. It is a network-oriented risk perception task with strong dependencies. Transactions, accounts, devices, addresses, merchants, and behavioral sequences are naturally linked through multiple entities and multiple relation types. Risk does not always appear as isolated samples. It often emerges as local clusters, chain-like propagation, or implicit interactions across groups. In practice, many fraud activities depend on relationships. Examples include shared devices, reused collection routes, circular fund transfers, or intermediary accounts that conceal the final beneficiary. These behaviors form interaction graphs with clear structural characteristics. If such a structure is ignored and features are extracted only from individual records, the model will struggle to capture key signals such as group coordination and path dependence. This limits the ability to detect complex fraud patterns [
2].
Against this background, graph neural networks offer a modeling paradigm that matches the nature of the problem. These methods aggregate and propagate contextual information over graph structures. The model can use both individual attributes and relational signals. It can learn more discriminative risk representations from neighborhood relations, interaction intensity, and connection patterns [
3]. Graph neural networks can also capture higher-order relations. Multi-hop message passing enables the model to sense latent long-range associations. It can therefore identify risk behaviors that look normal at the surface but are structurally abnormal. In financial applications, heterogeneous graphs are common. Entity types differ, and relation semantics are diverse. Graph neural networks can jointly model multiple node types and multiple edge types within a unified framework. This provides flexibility for representing complex risk control scenarios [
4]. Overall, reframing credit fraud detection as relationship-driven graph learning supports a more complete understanding of risk formation mechanisms. It also lays theoretical and practical foundations for building more robust and reliable intelligent risk control systems.
III. Proposed Framework
In terms of overall modeling, this paper represents the credit fraud identification problem as a node risk discrimination task on a graph structure. First, a graph representation is constructed containing multiple types of entities and their interactions, where each node corresponds to a risk subject, and each edge characterizes the interaction or association between subjects. Let the graph be:
where
represents the set of nodes and
represents the set of edges. Each node
is associated with an initial feature vector, denoted as:
This representation serves as the basis for the fundamental attribute information of the entity and as the starting point for subsequent relationship modeling and information dissemination. By explicitly introducing a graph structure, the model can incorporate the connections between entities into a unified risk representation space, beyond the feature level. This paper also presents the overall model architecture, as shown in
Figure 1.
During the representation learning stage, the model applies a graph neural network framework that updates node features via neighborhood aggregation. Each node’s new representation is derived by applying a function that fuses its own attributes with aggregated signals from its neighbors. This approach has been successfully applied in the context of anomaly detection for financial data, where attention-based neighborhood aggregation enhances the model’s ability to distinguish between normal and anomalous patterns [
19]. To further strengthen relational modeling, our method applies attention mechanisms during the aggregation process, allowing the model to adaptively weight neighboring nodes—a strategy that has proven effective in previous deep learning frameworks for anomaly detection [
20].
In handling heterogeneous graphs characteristic of transaction networks, the method further incorporates advanced message passing techniques, as applied by Xie & Chang [
21], enabling the integration of both node-level and relational context for more expressive representations. Accordingly, the update rule in our model is designed to jointly encode local features and multi-type relational information, and is formally expressed as:
where
is the normalized adjacency matrix,
is the learnable parameter matrix of the l-th layer, and
is the nonlinear mapping function. This process enables risk information to propagate layer by layer in the graph structure, allowing nodes to perceive the behavioral patterns of their directly and indirectly related entities, thereby forming a potential representation of the fused structural context.
To enhance the model's ability to characterize complex relational structures, a multi-layer propagation mechanism is further introduced, capturing higher-order dependencies by stacking several graph convolutional layers. After L layers of updates, the final representation of the nodes is obtained:
This representation integrates information from multi-hop neighborhoods, which helps identify fraud patterns that rely on cooperative relationships or path structures. Compared to shallow modeling methods that only use local neighborhood information, multi-layer propagation can mitigate the impact of single-interaction noise, making risk features more stable and coherent in the structural space.
In the risk discrimination stage, the model constructs a probability output based on the final representation of the nodes to characterize the likelihood of the subject engaging in fraudulent behavior. Specifically, a linear mapping and normalization function are used to map the representation to a risk score:
where
represents the output layer parameters. This design enables the model to complete the end-to-end reasoning process from relationship modeling to risk scoring within a unified framework. By jointly encoding structural information and node attributes into the representation space, the method can theoretically more fully reflect the correlation and systemic characteristics of credit fraud, providing a consistent modeling foundation for subsequent risk control and decision support.
IV. Experimental Analysis
A. Dataset
The dataset used in this study is the public Credit Card Fraud Detection dataset, which is widely adopted in credit fraud identification research. It consists of credit card transaction records from European cardholders. It is available from open source platforms for machine learning and anomaly detection tasks. The dataset contains a large volume of transaction behavior information generated in real payment environments. It includes several hundred thousand records. The fields include transaction time, transaction amount, and anonymized transaction features. To protect privacy, the transaction information has been preprocessed, such as by principal component analysis. The original attributes are transformed into numerical features. Key information relevant to fraud pattern analysis is retained. The labels clearly distinguish legitimate transactions from fraudulent behavior. This provides supervised semantic annotations for credit fraud identification.
A prominent characteristic of this dataset is severe class imbalance. Fraudulent samples account for only a very small proportion of all transactions. This setting is representative of real-world credit fraud scenarios. Since fraud is inherently rare in the overall transaction population, the minority class constitutes only a tiny fraction of the total volume. This imbalanced distribution imposes stricter requirements on learning strategies. In addition, fields such as transaction time and amount support studies of temporal dynamics and monetary scale. These attributes are directly useful for building graph structures and defining node features in graph neural networks. Therefore, this open dataset is not only a standard benchmark for classical credit fraud detection. It also provides a rich basis for mining complex relational patterns.
Although the dataset is originally provided in a flat tabular format, graph neural network modeling typically requires graph construction based on relations among entities. Entities such as transaction participants, for example, accounts and merchants, device identifiers, and time windows can be treated as nodes. Transaction activities can be defined as edges. In this way, raw transaction logs can be transformed into a multi-entity relational graph. This representation reflects the relational nature of financial activities. It also provides richer contextual information for deep graph models. It helps capture risk propagation patterns across nodes. The relational graph built from this dataset contains both attribute information and behavioral links. This supports the modeling and discrimination of complex fraudulent behaviors with graph neural networks.
B. Experimental Results
This article first presents the results of the comparative experiments, as shown in
Table 1.
Results show clear stratification across methods on credit fraud detection, indicating that modeling paradigm strongly determines risk identification quality: traditional or weakly structured models struggle to capture coordinated and concealed fraud within complex transaction relations and are more sensitive to local noise, while relational/graph-aware methods are more stable because independent features alone cannot represent the underlying risk structure. Performance generally improves as models leverage more transaction connectivity, reflecting that fraud is networked and often propagates along account interactions and transaction paths, so explicitly modeling structure helps capture anomalous signals and improves robustness. Our method achieves consistent advantages across metrics by jointly modeling node attributes and relational structure to learn more discriminative risk representations under multi-account coordination and path obfuscation, reinforcing the view that fraud detection is fundamentally a structured relational learning problem with strong practical potential in large financial networks. Finally, because GNN depth controls the neighborhood range for aggregation and thus local vs. higher-order relationship modeling, we conduct a sensitivity analysis across layer settings, with results reported in
Figure 2.
The depth sensitivity curves indicate that the model exhibits a clear and controllable change in how it captures structural fraud patterns under different propagation depths. With a shallow depth, node representations rely more on local neighborhoods. The model can capture short-range anomaly cues induced by direct interactions. However, coverage is limited for cross-account paths and covert coordinated behavior. As the depth increases, the aggregation range expands. The model can incorporate risk signals from multi-hop relations into a shared representation space. This makes it more sensitive to graph-structured risks such as group-based fraud and path-based transfers. The trend supports the need to formulate fraud detection as a relational learning task.
The accuracy trend suggests that a moderate propagation depth better balances local details and global context. The model can maintain stable discrimination under the diverse patterns of legitimate transactions. If the depth is too shallow, fraud patterns that depend on higher-order relations are more likely to be missed. This reflects insufficient modeling of complex structures. When the depth continues to increase, accuracy shows signs of decline. This suggests effects from over-smoothing or noise diffusion. In credit transaction graphs, legitimate behavior is denser and more frequent. Excessive depth can mix large amounts of non-risk neighborhood information into node representations. This weakens the decision boundary for key anomalous structures.
The recall curve shows a sharp increase followed by a plateau. This matches the dependence of fraud detection on relational diffusion. Multi-hop aggregation brought by deeper propagation makes hidden fraud associations along paths more observable. This is especially important when fraud samples are rare and behavioral camouflage is strong. Cross-node propagation helps compensate for the limitations of single-point features. After the depth reaches a certain level, the gain from adding more layers diminishes. This indicates that most critical fraud relations have already entered the effective receptive field. Further depth mainly introduces redundant neighborhood information rather than additional evidence.
The variations in AUC and F1 are more pronounced. This indicates that depth changes affect not only whether fraud can be captured, but also the trade-off between ranking ability and precise coverage. AUC drops markedly at a deeper setting. This suggests that overly deep propagation may dilute risk representations with non-risk neighborhoods. It reduces overall separability and ranking stability. AUC then rises again, which implies a nonlinear relationship between structural signal absorption and noise accumulation across depths. The fluctuation of F1 shows that, under class imbalance, depth adjustment changes the structural distribution of false positives and false negatives. Moderate depth is more likely to yield a relatively balanced decision state. Depth that is too shallow or too deep can bias the model toward one type of error. This provides direct modeling evidence for selecting an appropriate propagation depth in credit fraud scenarios.
V. Conclusion
This study addresses real-world characteristics of credit fraud detection, including complex relational structures, strong behavioral concealment, and highly coordinated risk patterns. It conducts a systematic investigation of graph neural network-based modeling. It elevates the conventional risk identification problem from an independent sample-centered setting to a structure-driven graph learning task. By representing transaction entities and their interactions as a unified graph, the model can exploit relational information among multiple parties beyond feature-level signals. This design aligns more closely with the intrinsic mechanisms of risk formation and diffusion in real financial systems. The analysis indicates that relational modeling plays an irreplaceable role in improving the completeness of risk characterization and the consistency of discrimination. It offers a new methodological perspective for intelligent risk control in complex credit scenarios.
From a methodological perspective, graph neural networks propagate and aggregate neighborhood information layer by layer. This enables the model to perceive local interactions and higher-order relations at multiple scales. It shows stronger adaptability to typical risk patterns, including organized group fraud, chain-like transfers, and implicit coordination. The model no longer relies only on explicit anomalous features. It learns latent risk representations from structural context. This shifts fraud detection from static judgment to relation-aware perception and structural reasoning. The shift supports more stable risk identification. It also provides a theoretical basis for building more robust risk control systems.
From an application perspective, graph neural network-based credit fraud detection is important for risk management in financial institutions. On the one hand, relation-aware models can capture cross-account and cross-channel risk associations earlier. This provides richer support for real-time interception and early warning. On the other hand, structured risk representations can improve interpretability. They make risk decisions easier to align with business rules, compliance requirements, and audit processes. The approach also has strong transfer potential across related domains, including payment risk control, credit approval, and anti-money laundering.
Looking ahead, credit fraud detection will remain in environments with continuous distribution shifts and evolving attack strategies. Several questions require further study. One is how to achieve continual learning under dynamic graph structures. Another is how to maintain stable structural representations under noisy and incomplete relations. A third is how to support cross-scenario risk modeling while satisfying privacy and compliance constraints. As graph learning methods become more integrated with financial business systems, graph neural network-based risk identification is expected to play a key role in larger-scale and more complex financial networks. It can provide long-term support for building secure, robust, and intelligent financial infrastructure.
References
- Xiang, S.; Zhu, M.; Cheng, D. Semi-Supervised Credit Card Fraud Detection via Attribute-Driven Graph Representation. Proceedings of the AAAI Conference on Artificial Intelligence 2023, vol. 37(no. 12), 14557–14565. [Google Scholar] [CrossRef]
- Duan, Y.; Zhang, G.; Wang, S. CAT-GNN: Enhancing Credit Card Fraud Detection via Causal Temporal Graph Neural Networks. arXiv 2024, arXiv:2402.14708. [Google Scholar]
- Ali; Abd Razak, S.; Othman, S. H. Financial Fraud Detection Based on Machine Learning: A Systematic Literature Review. Applied Sciences 2022, vol. 12(no. 19), 9637. [Google Scholar] [CrossRef]
- Zou, Y.; Cheng, D. Effective High-Order Graph Representation Learning for Credit Card Fraud Detection. arXiv 2025, arXiv:2503.01556. [Google Scholar] [CrossRef]
- Fan, H.; Yi, Y.; Xu, W.; Wu, Y.; Long, S.; Wang, Y. Intelligent Credit Fraud Detection with Meta-Learning: Addressing Sample Scarcity and Evolving Patterns. 2025. [Google Scholar]
- Lai, J.; Xie, A.; Feng, H.; Wang, Y.; Fang, R. Self-Supervised Learning for Financial Statement Fraud Detection with Limited and Imbalanced Data. 2025. [Google Scholar]
- Xu, Q.; Xu, W.; Su, X.; Ma, K.; Sun, W.; Qin, Y. Enhancing Systemic Risk Forecasting with Deep Attention Models in Financial Time Series. In Proceedings of the 2025 2nd International Conference on Digital Economy, Blockchain and Artificial Intelligence, 2025; pp. 340–344. [Google Scholar]
- Wu, Y.; Qin, Y.; Su, X.; Lin, Y. Transformer-Based Risk Monitoring for Anti-Money Laundering with Transaction Graph Integration. In Proceedings of the 2025 2nd International Conference on Digital Economy, Blockchain and Artificial Intelligence, 2025; pp. 388–393. [Google Scholar]
- Gan, Q.; Ying, R.; Li, D.; Wang, Y.; Liu, Q.; Li, J. Dynamic Spatiotemporal Causal Graph Neural Networks for Corporate Revenue Forecasting. 2025. [Google Scholar]
- Zhang, Q.; Wang, Y.; Hua, C.; Huang, Y.; Lyu, N. Knowledge-Augmented Large Language Model Agents for Explainable Financial Decision-Making. arXiv 2025, arXiv:2512.09440. [Google Scholar] [CrossRef]
- Xing, Y.; Wang, M.; Deng, Y.; Liu, H.; Zi, Y. Explainable Representation Learning in Large Language Models for Fine-Grained Sentiment and Opinion Classification. 2025. [Google Scholar]
- Yan, L.; Wang, Q.; Liu, C. Semantic Knowledge Graph Framework for Intelligent Threat Identification in IoT. 2025. [Google Scholar]
- Pan, S.; Wu, D. Modular Task Decomposition and Dynamic Collaboration in Multi-Agent Systems Driven by Large Language Models. arXiv 2025, arXiv:2511.01149. [Google Scholar] [CrossRef]
- Wang, Y.; Wu, D.; Liu, F.; Qiu, Z.; Hu, C. Structural Priors and Modular Adapters in the Composable Fine-Tuning Algorithm of Large-Scale Models. arXiv 2025, arXiv:2511.03981. [Google Scholar]
- Zhang, H.; Zhu, L.; Peng, C.; Zheng, J.; Lin, J.; Bao, R. Intelligent Recommendation Systems Using Multi-Scale LoRA Fine-Tuning and Large Language Models. 2025. [Google Scholar]
- Zheng, J.; Zhang, H.; Yan, X.; Hao, R.; Peng, C. Contrastive Knowledge Transfer and Robust Optimization for Secure Alignment of Large Language Models. arXiv 2025, arXiv:2510.27077. [Google Scholar] [CrossRef]
- Gao, K.; Hu, Y.; Nie, C.; Li, W. Deep Q-Learning-Based Intelligent Scheduling for ETL Optimization in Heterogeneous Data Environments. arXiv 2025, arXiv:2512.13060. [Google Scholar]
- Liu, R.; Zhang, R.; Wang, S. Graph Neural Networks for User Satisfaction Classification in Human-Computer Interaction. arXiv 2025, arXiv:2511.04166. [Google Scholar] [CrossRef]
- Li, J.; Gan, Q.; Liu, Z.; Chiang, C.; Ying, R.; Chen, C. An Improved Attention-Based LSTM Neural Network for Intelligent Anomaly Detection in Financial Statements. 2025. [Google Scholar]
- Wang, H.; Nie, C.; Chiang, C. Attention-Driven Deep Learning Framework for Intelligent Anomaly Detection in ETL Processes. 2025. [Google Scholar]
- Xie; Chang, W. C. Deep Learning Approach for Clinical Risk Identification Using Transformer Modeling of Heterogeneous EHR Data. arXiv 2025, arXiv:2511.04158. [Google Scholar] [CrossRef]
- Van Belle, R.; Baesens, B.; De Weerdt, J. CATCHM: A Novel Network-Based Credit Card Fraud Detection Method Using Node Representation Learning. Decision Support Systems 2023, vol. 164, 113866. [Google Scholar] [CrossRef]
- Zhu, H.; Zhou, M. C.; Liu, G. NUS: Noisy-Sample-Removed Undersampling Scheme for Imbalanced Classification and Application to Credit Card Fraud Detection. IEEE Transactions on Computational Social Systems 2023, vol. 11(no. 2), 1793–1804. [Google Scholar] [CrossRef]
- Padhi, B. K.; Chakravarty, S.; Naik, B. RHSOFS: Feature Selection Using the Rock Hyrax Swarm Optimization Algorithm for Credit Card Fraud Detection System. Sensors 2022, vol. 22(no. 23), 9321. [Google Scholar] [CrossRef] [PubMed]
- Ghosh Dastidar, K.; Jurgovsky, J.; Siblini, W. NAG: Neural Feature Aggregation Framework for Credit Card Fraud Detection. Knowledge and Information Systems 2022, vol. 64(no. 3), 831–858. [Google Scholar] [CrossRef]
- Zhao, X.; Guan, S. CTCN: A Novel Credit Card Fraud Detection Method Based on Conditional Tabular Generative Adversarial Networks and Temporal Convolutional Network. PeerJ Computer Science 2023, vol. 9, e1634. [Google Scholar] [CrossRef] [PubMed]
|
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).