1. Introduction
In the context of increasingly complex and diversified global financial markets, risk management has become a central issue in asset allocation and investment decisions [
1]. The volatility, nonlinearity, and highly dynamic correlations among assets present significant challenges to traditional risk measurement methods. In multi-asset portfolio management, investors are not only concerned with the risk levels of individual assets but also with the impact of asset interdependence on overall portfolio risk [
2]. Therefore, developing a risk prediction model that can effectively capture dynamic dependencies among assets and demonstrate strong temporal modeling capabilities has become a key focus in financial technology research.
Copula functions are widely used tools for modeling dependency structures in finance. They have attracted significant attention due to their ability to capture correlations among variables without relying on marginal distribution assumptions. Compared to traditional linear correlation methods, copula functions offer greater flexibility. They can adapt to inconsistent marginal distributions and exhibit strong performance in capturing tail dependencies in financial data. This is particularly valuable during extreme market conditions, where asymmetric relationships and tail dependencies between assets are more pronounced. However, copula models in static settings fail to reflect the evolving dynamics of financial markets. Thus, integrating Copula functions with temporal modeling techniques is essential for improving the accuracy and foresight of risk predictions [
3]. In recent years, the development of deep learning has led to widespread adoption of Long Short-Term Memory (LSTM) networks for time series forecasting. LSTM has shown strong performance in modeling nonlinear and non-stationary financial time series. Its gated structure mitigates the vanishing gradient problem faced by traditional recurrent neural networks, enabling effective long-term dependency modeling. LSTM has achieved notable results in HCI [
4,
5], market forecasting, asset pricing, and algorithmic trading. However, LSTM alone lacks the ability to model inter-asset dependencies. In multi-asset scenarios, this limitation may result in biased risk estimates. Therefore, combining LSTM with Copula functions offers a theoretically sound and practically feasible approach to integrated risk modeling [
6].
From a macro perspective, accurate risk prediction is vital not only for safeguarding investor capital and optimizing asset allocation but also for maintaining the stability of the financial system. In multi-asset portfolios, ignoring the transmission mechanisms of risk and the temporal evolution of asset relationships may lead to collective losses during systemic risk events, creating a "black swan" effect. Hence, a model capable of capturing both complex inter-asset structures and temporal dynamics is essential. It enhances the scientific basis of portfolio construction and strengthens the resilience of investment strategies. This study also introduces a new paradigm in financial technology by integrating statistical modeling with deep learning, promoting intelligent and refined risk management methods [
7].
2. Method
In recent years, advanced neural architectures have become central to modeling complex financial dynamics. One Transformer-based method [
8] demonstrated improved capability in capturing both temporal dependencies and multi-dimensional features, reinforcing the importance of deep temporal models for financial applications. Complementing this, a multimodal factor modeling approach [
9] incorporated heterogeneous data streams, offering support to this paper’s strategy of fusing marginal and dependency modeling through deep learning and statistical tools.
Efforts in high-frequency financial environments further showcase the relevance of deep learning in complex, sparse datasets. A deep anomaly detection framework [
10] for trading systems exemplifies how robust feature learning can identify irregular patterns, aligning well with the high-dimensional settings addressed in this study. Additionally, a probabilistic approach [
11] leveraging graphical models and variational inference provided strategies to handle class imbalance—paralleling the current work’s focus on tail-risk modeling and rare event prediction using Copula structures. Graph-based methodologies have also informed this paper’s approach to modeling inter-asset dependencies. One study [
12] used graph neural networks to uncover latent relational patterns within user behavior data—methods which inspire the interdependence modeling of assets within a portfolio. Building on this, a graph attention mechanism for heterogeneous data [
13] offers a blueprint for learning risk propagation across structured financial entities. Further enhancement was achieved through a self-supervised GNN approach [
14], which demonstrated efficient representation learning from complex and diverse relational structures—concepts closely tied to joint distribution modeling in Copula functions.
From a broader perspective, developments in deep learning frameworks contribute significantly to the scalability and adaptability of integrated models like the one proposed. A dynamic Transformer-based mining framework [
15] introduced context-awareness into learning systems, a principle adopted in improving interpretability within this hybrid model. Moreover, techniques for efficient model deployment [
16], such as distillation and fine-tuning, address real-world implementation challenges of deep learning systems in finance. Architectural optimization through neural architecture search [
17] further informs future extensions of this model, especially in tailoring hybrid systems for varying market conditions. Collectively, these methodologies not only provide technical insights into modeling temporal sequences and dependencies but also lay the groundwork for scalable and interpretable systems like the LSTM-Copula hybrid proposed in this research.
3. Method
The multi-asset portfolio risk prediction model constructed in this study mainly includes two core modules: a time series modeling module and an asset dependency structure modeling module. Its model architecture is shown in
Figure 1.
Figure 1 presents the comprehensive architecture of the proposed multi-asset portfolio risk prediction model, which integrates a time series modeling module and a dependency structure modeling module. The design builds upon prior work in deep learning and statistical modeling. Inspired by Liang et al. [
18], who emphasize robust feature extraction from complex financial data through neural networks, the left-side module employs Long Short-Term Memory (LSTM) networks to model the historical return sequences of individual assets. These networks dynamically extract crucial temporal features and generate the marginal distribution parameters necessary for dependency modeling. On the right side, the dependency structure module is influenced by insights from Wang [
19] and Feng [
20], particularly regarding the treatment of nonlinear interdependencies and tail risks in financial systems. This module utilizes Copula functions to effectively capture the joint distribution across multiple assets, emphasizing flexibility in modeling tail co-movements. By integrating the asset-specific marginal distributions and Copula-based dependency structure, and subsequently applying asset weights, the model produces an aggregated risk measure for the portfolio. This two-pronged architecture capitalizes on LSTM’s sequence learning capacity and the Copula function’s power in capturing high-dimensional dependencies, thereby enhancing the precision and robustness of portfolio risk estimation under real-world financial complexities.
First, considering that financial asset price series usually exhibit strong nonlinearity, non-stationarity and long-term dependence characteristics, the LSTM network is used to model the yield time series of each asset. Assuming that the yield of asset i at time t is , and the input sequence is a vector of length T, the LSTM network dynamically models the sequence through a nonlinear gating mechanism to obtain a hidden state output , which can be regarded as a risk characterization vector of the asset at the current moment. This process not only retains important patterns in historical information, but can also be used to estimate marginal distribution parameters in subsequent Copula modeling.
To effectively model the intricate dependency structure between financial assets, this study adopts the Copula function to construct the joint distribution across multiple asset returns. Drawing on recent developments in financial data modeling and representation learning [
21], as well as methods that emphasize the importance of capturing temporal and multi-dimensional dependencies [
22], and techniques for handling heterogeneous relationships in complex financial systems [
23], this approach offers a flexible and robust mechanism for modeling nonlinear and tail dependencies—crucial for accurate portfolio risk assessment.According to Sklar's theorem, any multivariate joint distribution function
, where
, can be decomposed into a combination of its marginal distribution function
and a Copula function
, that is:
Considering the importance of tail correlation in financial markets, this paper uses the t-Copula function with tail dependency modeling capability, which is in the form of:
represents the cumulative distribution function of the multivariate t distribution with v degrees of freedom and covariance matrix , and is the inverse function of the t distribution. The marginal distribution can be estimated by the prediction residual sequence output by the LSTM network, and further distribution fitting is performed to make the Copula function modeling more in line with the structural characteristics of the actual data.
Combining the modeling results of LSTM and Copula functions, this paper finally conducts a quantitative assessment of the risk level of multi-asset portfolios based on the joint distribution function. Assuming the portfolio weight vector is
, the portfolio return is:
Based on the joint distribution , the value-at-risk and conditional value-at-risk (VaR) of the portfolio return at a given confidence level can be calculated, thereby achieving dynamic prediction and control of potential losses. This method combines the advantages of LSTM in modeling temporal dependencies and the ability of Copula in capturing complex correlation structures between variables, providing a more flexible and theoretically supported solution for modeling high-dimensional asset portfolio risks.
4. Experiment
4.1. Datasets
This study utilizes a dataset sourced from Yahoo Finance, comprising daily closing prices of ten representative U.S. financial assets—including stocks, bonds, gold, and crude oil—from January 1, 2013, to December 31, 2023 (approximately 2,600 trading days). The selected assets ensure broad market and industry coverage, supporting a realistic multi-asset portfolio construction. Raw prices were converted to log returns and standardized to address scale differences. Missing values due to market suspensions were filled using linear interpolation, and extreme outliers were removed to ensure input stability. The data were split chronologically into training (80%) and testing (20%) sets to prevent future data leakage. The dataset reflects typical financial time series properties such as volatility, non-stationarity, and nonlinear dependencies, posing challenges for risk modeling.
4.2. Experimental Results
This paper first gives a performance comparison of different models in multi-asset portfolio risk prediction, and the experimental results are shown in
Table 1.
As shown in
Table 1, the LSTM-Copula model achieves the best performance in multi-asset risk prediction, with a VaR coverage of 96.2% and CVaR accuracy of 94.8%, demonstrating its strength in capturing temporal dynamics and inter-asset dependencies. The GRU + VAE-Copula model performs closely in CVaR accuracy, benefiting from VAE’s marginal distribution learning and GRU’s efficiency, while TCN-Copula captures local patterns but lacks global modeling capacity. In contrast, Transformer-VaR shows high KLD due to poor joint distribution modeling, and Diffusion-RiskNet suffers from training instability.
Figure 2 further illustrates model stability across varying portfolio sizes.
Figure 2 illustrates that the LSTM-Copula model maintains strong risk prediction performance across varying portfolio sizes, with the best results achieved when the asset count is seven, where both VaR coverage and CVaR accuracy reach peak levels. This indicates that a medium-sized portfolio offers sufficient informational richness without introducing excessive noise, allowing the model to capture market risk structures more effectively. When the number of assets increases to 10 or 15, a slight decline in CVaR accuracy and distribution fitting (KLD) is observed, suggesting increased modeling difficulty due to more complex inter-asset dependencies. In contrast, smaller portfolios with 3 or 5 assets provide limited input diversity, weakening the model’s ability to capture latent linkage risks. These results highlight the sensitivity of the model to portfolio size and emphasize the need for balanced asset selection.
Figure 3 further reports the model’s risk coverage ability based on prediction confidence intervals.
Figure 3 shows the actual VaR coverage rates of different models across multiple confidence levels. Compared to the ideal coverage line, the LSTM-Copula model maintains a high level of consistency across all intervals. At confidence levels above 0.95, its coverage rate closely approaches or slightly exceeds the theoretical value. This demonstrates the model’s refined and stable ability to define risk boundaries.
The GRU + VAE-Copula model exhibits a similar trend to LSTM-Copula and shows a certain level of stability. However, slight deviations are observed at some intervals, particularly at lower confidence levels. This suggests a mildly conservative bias in its estimation of risk intervals. Nevertheless, its performance at higher confidence levels remains within a reasonable range. The model demonstrates good tail risk identification ability and is suitable for supporting risk management decisions.
The Transformer-VaR model shows lower coverage rates than the other two models at all confidence levels. The deviation is especially notable at high confidence levels. This reflects a tendency to underestimate risk when constructing risk intervals. The result indicates that without incorporating explicit dependency modeling, Transformer-VaR struggles to capture complex inter-asset structures. This limits its effectiveness in high-precision risk forecasting tasks.
5. Conclusions
This study proposes a multi-asset portfolio risk prediction model that integrates Long Short-Term Memory (LSTM) networks with Copula functions. The goal is to capture both the temporal dependencies in asset return series and the complex inter-asset correlation structure. LSTM is used for marginal distribution modeling, while Copula functions are employed to construct the joint distribution. This enables dynamic prediction and evaluation of key risk metrics such as Value-at-Risk (VaR) and Conditional Value-at-Risk (CVaR). The model shows superior stability and predictive accuracy across multiple experiments.
The experimental design includes model comparison, sensitivity analysis with respect to the number of assets, and evaluation of risk coverage under different confidence intervals. These tests comprehensively verify the model's adaptability to high-dimensional and sparse financial data. Results show that the LSTM-Copula model outperforms other advanced models in boundary control, tail risk identification, and distribution fitting. This highlights the importance and potential of combining deep learning with statistical dependency modeling in financial risk analysis. In addition, the experiments involving changes in the number of assets indicate that the model performs best with moderately sized portfolios. Its fitting capacity shows certain limitations in high-dimensional settings. This suggests that future work may focus on enhancing model scalability and optimizing its structure to better handle complex market conditions. The confidence interval experiments also demonstrate the consistency of the LSTM-Copula model in risk control across different levels, offering practical support for tolerance-based risk regulation. This study not only extends the theoretical landscape of risk modeling but also provides a scalable and interpretable framework for practical financial risk management.
References
- Z. Hu, et al., “Research on financial multi-asset portfolio risk prediction model based on convolutional neural networks and image processing,” International Journal of Innovative Research in Engineering and Management, vol. 11, no. 6, 2024.
- H. Chaudhari, et al., “Multi-Asset Portfolio Management System: Integrating Diverse Investments for Optimal Returns and Risk Mitigation”, Proceedings of the 2024 ASU International Conference in Emerging Technologies for Sustainability and Intelligent Systems (ICETSIS), pp. –, 2024.
- F. J. Fabozzi, “Editor’s Introduction for the 2024 Special Issue on Multi-Asset Strategies and Asset Allocation,” Journal of Portfolio Management, vol. 50, no. 5, 2024.
- Q. Sun, “A Visual Communication Optimization Method for Human-Computer Interaction Interfaces Using Fuzzy Logic and Wavelet Transform”, Proceedings of the 2024 4th International Conference on Communication Technology and Information Technology (ICCTIT), pp. 140-144, 2024.
- S. Duan, “Human-Computer Interaction in Smart Devices: Leveraging Sentiment Analysis and Knowledge Graphs for Personalized User Experiences”, Proceedings of the 2024 4th International Conference on Electronic Information Engineering and Computer Communication (EIECC), pp. 1294-1298, 2024.
- J. Kim, et al., “HADAPS: Hierarchical Adaptive Multi-Asset Portfolio Selection,” IEEE Access, vol. 11, pp. 73394-73402, 2023.
- H. Li, et al., “Multi-asset portfolio model optimization based on mean multifractal detrended cross correlation analysis,” Mathematical and Computer Modelling of Dynamical Systems, vol. 30, no. 1, pp. 736-757, 2024.
- Y. Wang, “Time-Series Premium Risk Prediction via Bidirectional Transformer,” Transactions on Computational and Scientific Methods, vol. 5, no. 2, 2025.
- J. Liu, “Multimodal Data-Driven Factor Models for Stock Market Forecasting,” Journal of Computer Technology and Software, vol. 4, no. 2, 2025.
- Q. Bao, J. Wang, H. Gong, Y. Zhang, X. Guo and H. Feng, “A Deep Learning Approach to Anomaly Detection in High-Frequency Trading Data,” arXiv preprint arXiv:2504.00287, 2025.
- Y. Lou, J. Liu, Y. Sheng, J. Wang, Y. Zhang and Y. Ren, “Addressing Class Imbalance with Probabilistic Graphical Models and Variational Inference,” arXiv preprint arXiv:2504.05758, 2025.
- Y. Zhang, “Social Network User Profiling for Anomaly Detection Based on Graph Neural Networks,” arXiv preprint arXiv:2503.19380, 2025.
- X. Du, “Audit Fraud Detection via EfficiencyNet with Separable Convolution and Self-Attention,” Transactions on Computational and Scientific Methods, vol. 5, no. 2, 2025.
- J. Wei, Y. Liu, X. Huang, X. Zhang, W. Liu and X. Yan, “Self-Supervised Graph Neural Networks for Enhanced Feature Extraction in Heterogeneous Information Networks,” Proceedings of the 2024 5th International Conference on Machine Learning and Computer Application (ICMLCA), pp. 272-276, 2024.
- J. Liu, Y. Zhang, Y. Sheng, Y. Lou, H. Wang and B. Yang, “Context-Aware Rule Mining Using a Dynamic Transformer-Based Framework,” arXiv preprint arXiv:2503.11125, 2025.
- Kai, L. Zhu and J. Gong, “Efficient Compression of Large Language Models with Distillation and Fine-Tuning,” Journal of Computer Science and Software Applications, vol. 3, no. 4, pp. 30-38, 2023.
- X. Yan, J. X. Yan, J. Du, L. Wang, Y. Liang, J. Hu and B. Wang, “The Synergistic Role of Deep Learning and Neural Architecture Search in Advancing Artificial Intelligence,” Proceedings of the 2024 International Conference on Electronics and Devices, Computational Science (ICEDCS), pp. 452-456, Sep. 2024.
- Y. Liang, L. Dai, S. Shi, M. Dai, J. Du and H. Wang, “Contrastive and Variational Approaches in Self-Supervised Learning for Complex Data Mining,” arXiv preprint arXiv:2504.04032, 2025.
- Y. Wang, “A Data Balancing and Ensemble Learning Approach for Credit Card Fraud Detection,” arXiv preprint arXiv:2503.21160, 2025.
- P. Feng, “Hybrid BiLSTM-Transformer Model for Identifying Fraudulent Transactions in Financial Systems,” Journal of Computer Science and Software Applications, vol. 5, no. 3, 2025.
- Q. Sha, T. Tang, X. Du, J. Liu, Y. Wang and Y. Sheng, “Detecting Credit Card Fraud via Heterogeneous Graph Neural Networks with Graph Attention,” arXiv preprint arXiv:2504.08183, 2025.
- Y. Yao, “Stock Price Prediction Using an Improved Transformer Model: Capturing Temporal Dependencies and Multi-Dimensional Features,” Journal of Computer Science and Software Applications, vol. 5, no. 2, 2024.
- J. Wang, “Credit Card Fraud Detection via Hierarchical Multi-Source Data Fusion and Dropout Regularization,” Transactions on Computational and Scientific Methods, vol. 5, no. 1, 2025.
- X. Li, et al., “Short-term Wind Speed Prediction Based on Time-varying Filtered Empirical Modal Decomposition with Spatio-temporal Correlation Error Correction”, Proceedings of the 2024 36th Chinese Control and Decision Conference (CCDC), pp. –, 2024.
- W. Sun, et al., “Advanced Risk Prediction and Stability Assessment of Banks Using Time Series Transformer Models,” arXiv preprint arXiv:2412.03606, 2024.
- X. Ruan, et al., “Real-time risk prediction of colorectal surgery-related post-surgical complications using GRU-D model,” Journal of Biomedical Informatics, vol. 135, 104202, 2022.
|
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).