I. Introduction
In the context of ongoing digitalization and intelligent transformation, the complexity and risk of financial systems are becoming increasingly prominent. As enterprises expand in scale and adopt cross-regional and diversified business models, the volume, dimensions, and interconnections of financial data have grown explosively. Traditional auditing methods rely mainly on static sampling and professional judgment. They often fail to capture potential anomalies and risks in vast and complex data. In this environment, the urgent question is how to achieve precise risk identification in financial auditing through intelligent methods[
1]. Under the frequent occurrence of financial fraud, fraudulent transactions, and internal control failures, building intelligent auditing approaches that can uncover causal relationships is not only critical to corporate compliance and governance but also essential to capital market stability and investor confidence.
In recent years, causal representation learning has been recognized as an important theory and method for identifying risks in complex systems. Unlike traditional correlation analysis, causal representation learning emphasizes the causal mechanisms among variables. It helps models separate spurious correlations from true causal effects. In financial systems, many risk events are not caused by a single abnormal indicator but by the interaction of multiple factors. For example, abnormal capital flows may be linked to macroeconomic fluctuations, corporate governance weaknesses, and the behavior of external trading partners. If traditional methods rely only on surface data patterns, they are prone to fall into the trap of correlation without causation, leading to misjudgments. By introducing causal reasoning and structural constraints in feature modeling, causal representation learning can better approximate the true mechanisms of risk generation and provide more interpretable and reliable evidence for audit risk identification[
2,
3].
The identification of risks in financial systems is not only an issue of corporate governance but also closely related to national economic operations and the sound development of social credit systems. Globally, corporate financial fraud, false disclosures, and high-risk investments frequently trigger market turmoil and even systemic financial crises. Traditional auditing methods suffer from delays, limited coverage, and weak adaptability in the face of rapidly evolving financial environments. Applying causal representation learning to financial auditing offers a new technical pathway for risk identification. By constructing causal-driven representations, auditing can move beyond detecting anomalies to explaining the causes and evolutionary logic of risks. This provides regulators and corporate management with decision support that is both forward-looking and actionable[
4].
Moreover, the introduction of causal representation learning carries significant social and practical value. It enhances the transparency and interpretability of audit results, addressing the limitations of “black-box” models. Risk identification is no longer limited to outcome judgments but can present the underlying causal chains. This is especially important in financial auditing, where conclusions must be accurate and withstand regulatory and legal scrutiny. At the same time, causal representation learning supports the shift from a results-oriented to a mechanism-oriented approach in intelligent auditing. It enables enterprises to build early-warning mechanisms before risks occur and to quickly locate root causes after risks emerge, thereby improving governance efficiency and risk control capacity[
5].
In conclusion, applying causal representation learning to risk identification in financial auditing aligns with the trend toward intelligent auditing and data-driven decision-making. It provides a new research paradigm for financial governance and regulatory practice. This direction addresses long-standing pain points in auditing practice and offers a solid foundation for theoretical innovation in auditing methods[
6,
7]. As the scale and complexity of financial data continue to increase, the interpretability and robustness of causal representation learning will play an increasingly important role in future risk identification and control. This lays the groundwork for building a more transparent, efficient, and resilient financial auditing system.
III. Method
In the method design, the proposed framework takes causal representation learning as the central modeling principle and characterizes the risk generation mechanism in financial systems by constructing a causal-driven latent space. Specifically, we employ a nonlinear encoder to map high-dimensional, heterogeneous audit-related financial variables into compact latent factors that preserve risk-relevant structure while reducing sensitivity to superficial correlations. To ensure the learned representation focuses on informative signals in complex processing pipelines, we incorporate attention-guided feature weighting at the representation stage, drawing on the practical effectiveness of attention-driven anomaly detection frameworks for identifying salient patterns under noisy operational data [
19]. In addition, because financial risk patterns may evolve over time and positive risk samples can be scarce, the framework is designed to maintain stable identification under distribution shifts by leveraging training strategies that improve adaptability across changing environments, consistent with meta-learning-based fraud detection settings that explicitly address sample scarcity and evolving patterns [
20]. On this basis, causal constraints are imposed on the latent space so that downstream risk scoring can be grounded in mechanism-relevant factors rather than spurious associations, improving interpretability and robustness for auditing decisions. The overall architecture and information flow are illustrated in
Figure 1, and the formal objective and optimization procedure are defined as follows:
First, let the original financial data be
, where n represents the number of samples and d represents the feature dimension. In the basic representation stage, the input data is mapped to the latent representation space through the nonlinear transformation function
:
Where is a trainable parameter and represents the resulting low-dimensional causal potential representation. The goal of this stage is to compress and structure high-dimensional financial indicators to provide a foundation for subsequent causal modeling.
In the causal structure modeling stage, the method introduces the structural equation model to describe the relationship between variables. Let the set of latent variables be
, and its causal relationship can be characterized by the following form:
Where represents the set of parent nodes pointing to in the causal graph, and is the noise term. This causal structure can distinguish the direct and indirect effects of different risk factors, thereby establishing interpretable dependencies between variables at the representation level.
To improve the causal consistency of representations, a causal constraint representation learning objective is introduced. By minimizing the joint loss of prediction error and causal structure deviation, the learned representation is guaranteed to have both predictive power and conform to causal laws. The joint optimization objective function can be expressed as:
Where is the prediction-related loss function is used to constrain the consistency of the latent representation and the causal graph G, and C is a trade-off factor. This design enables the model to maintain the effectiveness of representation while avoiding the interference of false correlation on risk identification.
In the reasoning and decision-making stage, causal representation is used to calculate the risk score. Let the causal representation vector be
, and the final risk score is obtained through the linear discriminant function:
Where
is the activation function, and W and b are the parameters to be learned. To further quantify the causal effect, the model also calculates the expected difference in results under potential interventions during inference, that is, the average treatment effect:
Where T represents the intervention variable and represents the expected outcome under external intervention. By combining risk scores with causal effects, the model can not only provide risk identification results but also explain the driving mechanisms behind them.
IV. Experimental Results
A. Dataset
In this study, the dataset used is the Enron Email Dataset. This dataset is publicly available and is widely cited in research on financial auditing and risk identification. It contains authentic email communications within a large enterprise, covering a wide range of information such as daily business exchanges, financial correspondence, contract negotiations, and potential risk events. The dataset is large in scale, including hundreds of thousands of email texts. It also provides basic metadata such as timestamps and sender-receiver information, which allows it to reflect the dynamic characteristics of financial systems in real business contexts.
The value of this dataset lies in its rich textual information as well as the embedded organizational structures and potential risk clues. For example, email subjects, content, communication relationships, and time sequences reveal internal communication patterns and possible abnormal behaviors. This information offers a natural setting for causal representation learning, enabling research to explore how causal relationships can support risk identification in real contexts. Compared with single financial indicators, email data is more unstructured and semantically diverse, which increases both the applicability and the challenges of identifying risks in complex systems.
Furthermore, the dataset provides a realistic and multi-layered environment for auditing research. On this basis, studies can not only test the feasibility of causal representation methods for risk identification but also examine how cross-dimensional feature fusion supports risk modeling. By modeling semantic, network, and temporal information within the emails, the research setting becomes closer to the multi-source information integration required in actual auditing tasks. This lays a solid data foundation for building an audit risk identification framework that is more interpretable and robust.
B. Experimental Results
This paper also gives the comparative experimental results, as shown in
Table 1.
On the alignment robustness benchmark, performance improves from MLP to 1D-CNN to LSTM to Transformer as model capacity increases: MLP performs worst across ACC/Precision/Recall/F1 due to weak semantic modeling, 1D-CNN gains by capturing local patterns but remains limited for cross-scale robustness, LSTM improves Recall/F1 via temporal dependency learning yet is constrained by efficiency and optimization issues, and Transformer further boosts ACC/Precision through self-attention over global and local context. Our proposed method achieves the best results on all metrics—especially ACC and F1—because causal representation learning reduces spurious correlations while preserving causal signals for risk identification, improving robustness and generalization under noise and semantic inconsistency; we additionally analyze how the causal regularization coefficient
λ affects Precision (
Figure 2).
Precision is highly sensitive to the causal regularization coefficient
λ with small
λ, Precision stays low because weak causal constraints leave the model vulnerable to spurious correlations; as
λ increases, Precision rises, indicating better noise suppression and stronger causal signal extraction. Precision peaks around
λ≈0.5, suggesting an optimal balance between causal constraints and model flexibility that yields more stable and interpretable risk identification. When
λ exceeds about 0.7, Precision declines because overly strong regularization can over-constrain representations and suppress useful (though not strictly causal) features, reducing adaptability and overall detection quality. Overall, the results show causal regularization improves robustness and interpretability within a proper range, so
λ should be tuned to the task and data distribution; we also evaluate sensitivity of the label-noise ratio on AUC in
Figure 3.
From the experimental results, it can be observed that as the proportion of label noise increases, the AUC metric shows a gradual downward trend. When the noise ratio is 0, the model achieves the highest AUC. This indicates that under clean annotations, causal representation learning can fully exploit its modeling advantages and accurately capture potential risk relationships in financial systems. As the noise ratio rises, the AUC decreases slowly from 0.917 to 0.831, showing that the decline in label quality directly affects model performance.
In the low-noise range (0.1–0.2), AUC decreases slightly but remains at a relatively high level. This suggests that the causal representation learning framework has some ability to handle imperfect annotations. By applying causal constraints, it reduces the influence of spurious correlations and maintains effective recognition of risk patterns. This property is especially important for financial auditing scenarios where annotation is costly and accuracy cannot always be guaranteed.
However, when the noise ratio increases beyond 0.3, the decline of AUC becomes faster. This shows that too many incorrect labels significantly weaken the discriminative power of the model. At this stage, even with causal constraints, it is difficult to fully offset the information shift caused by distorted labels. As a result, risk identification results are more heavily affected. This indicates that while causal representation learning provides some robustness, its performance still depends on the basic reliability of labeled data.
Overall, these experimental results reveal the sensitivity of financial auditing risk identification to the proportion of label noise. In practical applications, ensuring the quality of labeled data combined with causal-driven representation mechanisms is essential to improve reliability in complex environments. With this mechanism, the model can remain robust under moderate noise and also provide credibility and traceability for audit results.
V. Conclusions
This study conducts a systematic exploration of the application of causal representation learning in financial auditing risk identification. By introducing a causal-driven latent space modeling approach, it effectively overcomes the limitations caused by spurious correlations and noise in traditional correlation analysis. Risk identification is no longer limited to outcome-level judgments but can be traced back to underlying causal mechanisms. The results show that this method can extract more interpretable and robust feature representations from multidimensional financial data, providing a new technical path for building a highly reliable audit risk identification system.
Compared with existing correlation-based methods that rely on deep neural networks, the proposed framework emphasizes the organic combination of causality and representation learning. It shows clear advantages in transparency and practicality for risk modeling. Causal representation not only enhances the model’s sensitivity to anomalies and risks but also, through structured causal dependencies, helps auditors better understand potential sources of risk. This ability is of great value in key scenarios such as financial compliance supervision, internal control assessment, and fraud detection. It supports the transformation of intelligent auditing from simple prediction to mechanism-based explanation. At the methodological level, this study achieves a dual improvement in prediction accuracy and causal consistency through joint optimization of the loss function. The model maintains stable performance under multi-source heterogeneous data. In conclusion, this study demonstrates both the theoretical innovation and practical value of causal representation learning in financial auditing risk identification. It improves audit transparency, enhances risk identification accuracy, and contributes to the development of intelligent auditing systems. These findings expand the research frontier at the intersection of financial auditing and artificial intelligence. They also lay the foundation for future applications in complex financial and corporate governance tasks.
References
- Z. Xu, K. Ma, Y. Liu et al., "Causal Representation Learning for Robust Anomaly Detection in Complex Environments," unpublished manuscript, 2025.
- Y. Duan, G. Zhang, S. Wang et al., "CAT-GNN: Enhancing Credit Card Fraud Detection via Causal Temporal Graph Neural Networks," arXiv preprint, arXiv:2402.14708, 2024.
- M. Yang, F. Liu, Z. Chen et al., "CausalVAE: Disentangled Representation Learning via Neural Structural Causal Models," Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9593–9602, 2021.
- A. M. Karimi Mamaghan, A. Dittadi, S. Bauer et al., "Diffusion-Based Causal Representation Learning," Entropy, vol. 26, no. 7, p. 556, 2024. [CrossRef]
- J. Carvalho, M. Zhang, R. Geyer et al., "Invariant Anomaly Detection under Distribution Shifts: A Causal Perspective," Advances in Neural Information Processing Systems, vol. 36, pp. 56310–56337, 2023.
- Y. Liu, Z. Xia, M. Zhao et al., "Learning Causality-Inspired Representation Consistency for Video Anomaly Detection," Proceedings of the 31st ACM International Conference on Multimedia, pp. 203–212, 2023.
- L. Jiao, Y. Wang, X. Liu et al., "Causal Inference Meets Deep Learning: A Comprehensive Survey," Research, vol. 7, p. 0467, 2024. [CrossRef]
- Q. Xu, W. Xu, X. Su, K. Ma, W. Sun and Y. Qin, "Enhancing Systemic Risk Forecasting with Deep Attention Models in Financial Time Series," Proceedings of the 2025 2nd International Conference on Digital Economy, Blockchain and Artificial Intelligence, pp. 340–344, 2025.
- Q. Xu, "Unsupervised Temporal Encoding for Stock Price Prediction through Dual-Phase Learning," 2025.
- Z. Liu and Z. Zhang, "Modeling Audit Workflow Dynamics with Deep Q-Learning for Intelligent Decision-Making," Transactions on Computational and Scientific Methods, vol. 4, no. 12, 2024.
- R. Liu, Y. Zhuang and R. Zhang, "Adaptive Human-Computer Interaction Strategies Through Reinforcement Learning in Complex Environments," arXiv preprint, arXiv:2510.27058, 2025.
- S. Lyu, M. Wang, H. Zhang, J. Zheng, J. Lin and X. Sun, "Integrating Structure-Aware Attention and Knowledge Graphs in Explainable Recommendation Systems," arXiv preprint, arXiv:2510.10109, 2025.
- L. Yan, Q. Wang and C. Liu, "Semantic Knowledge Graph Framework for Intelligent Threat Identification in IoT," 2025.
- Z. Qiu, F. Liu, Y. Wang, C. Hu, Z. Cheng and D. Wu, "Spatiotemporal Traffic Prediction in Distributed Backend Systems via Graph Neural Networks," arXiv preprint, arXiv:2510.15215, 2025.
- C. Hua, N. Lyu, C. Wang and T. Yuan, "Deep Learning Framework for Change-Point Detection in Cloud-Native Kubernetes Node Metrics Using Transformer Architecture," 2025.
- D. W. A. S. Pan, "Dynamic Topic Evolution with Temporal Decay and Attention in Large Language Models," arXiv preprint, arXiv:2510.10613, 2025.
- Y. Li, S. Han, S. Wang, M. Wang and R. Meng, "Collaborative Evolution of Intelligent Agents in Large-Scale Microservice Systems," arXiv preprint, arXiv:2508.20508, 2025.
- X. Hu, Y. Kang, G. Yao, T. Kang, M. Wang and H. Liu, "Dynamic Prompt Fusion for Multi-Task and Cross-Domain Adaptation in LLMs," arXiv preprint, arXiv:2509.18113, 2025.
- H. Wang, C. Nie and C. Chiang, "Attention-Driven Deep Learning Framework for Intelligent Anomaly Detection in ETL Processes," 2025.
- H. Fan, Y. Yi, W. Xu, Y. Wu, S. Long and Y. Wang, "Intelligent Credit Fraud Detection with Meta-Learning: Addressing Sample Scarcity and Evolving Patterns," 2025.
- B. Kasasbeh, B. Aldabaybah and H. Ahmad, "Multilayer Perceptron Artificial Neural Networks-Based Model for Credit Card Fraud Detection," Indonesian Journal of Electrical Engineering and Computer Science, vol. 26, no. 1, pp. 362–373, 2022.
- C. Jiang, J. Song, G. Liu et al., "Credit Card Fraud Detection: A Novel Approach Using Aggregation Strategy and Feedback Mechanism," IEEE Internet of Things Journal, vol. 5, no. 5, pp. 3637–3647, 2018. [CrossRef]
- A. Khadka, "Credit Card Fraud Detection via Model Retraining and Fine-Tuning," 2025.
- C. Yu, Y. Xu, J. Cao et al., "Credit Card Fraud Detection Using Advanced Transformer Model," Proceedings of the 2024 IEEE International Conference on Metaverse Computing, Networking, and Applications, pp. 343–350, 2024.
|
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).