Submitted:
08 August 2025
Posted:
13 August 2025
Read the latest preprint version here
Abstract
Keywords:
1. Introduction
2. Related Work
2.1. Classical Neural Network Approaches
2.2. Hybrid Classical Models
2.3. Quantum Approaches
3. Design of Quantum LSTM Fusion Model
3.1. Quantum Encoding and Hybrid Computing Layer
3.1.1. Quantum Convolution Module (QCNN)
3.1.2. Quantify LSTM Unit
3.1.3. Quantum Attention Module
3.2. Key Technology Implementation
3.2.1. Quantum Activation Function
3.2.2. Parameter Optimization Strategy

3.3.3. Regularization Mechanism
- Quantum convolutional layer: Randomly skip 30% of CNOT entanglement gates.
- Quantum LSTM: Dropout is applied to the rotation gates of the forget gate and input gate.
- Quantum Attention: Controlled Phase Gates in Fidelity Computing.
4. Experiments and Results
4.1. Experimental Setup
- Quantum encoding qubits n=8: chosen to fully encode the 138-node road-network features while remaining within the 7-qubit-plus-1-ancilla capacity of IBM’s ibm_perth backend.
- QGCN depth K=3: a grid search over {2,3,4} revealed that two layers under-capture long-range spatial dependencies and four layers incur>2% fidelity loss under realistic gate error rates (≈1×10-3); three layers give the lowest validation MAE.
- QNG learning-rate η=0.01: selected from a log-linear grid {0.005,0.01,0.02}; 0.01 yields fastest convergence without overshoot.
- Quantum Dropout probability p=0.1: fine-tuned in {0.05,0.1,0.2,0.3}; 0.1 minimizes validation loss while keeping fidelity drop<3%.
- Gate-pruning threshold θ=0.05: derived from the 5th percentile of Pauli-gradient magnitudes; removing gates below this value reduces circuit depth by 22 % with<1% accuracy loss.
- Classical LSTM hidden size 64: tuned within {32,64,128}; 64 units balance model capacity and Jetson AGX Orin memory budget (<400 MB).
4.2. Spatiotemporal Prediction
4.2.1. Performance Comparison of Various Models
- Establishing entanglement between nodes through CNOT gates:
- When node A fails, the quantum state of its associated node B still contains information about A:
- Experimental measured information retention rate:
4.2.2. Statistical Significance Analysis
- Data split: the same chronological train/validation/test partition (70%/15%/15%) was kept across all runs.
- Runs: each model (QGCN-LSTM and 7 baselines) was independently trained 5 times with different random seeds (2023-2027).
- Metrics: at every run we recorded MAE and sMAPE on the morning-rush subset (7500 detectors, 138 nodes).
- Sample size: 5 paired observations per model pair (df=4 for the t-test).
| Model | ΔMAE(mean±SD) | t(4) | p-value | 95% CI ΔsMAPE | ΔMAE(mean±SD) | t(4) | p-value | 95% CI ΔsMAPE |
|---|---|---|---|---|---|---|---|---|
| HA | -18.2±0.31 | -131.4 | <0.001 | [-18.6,-17.8] | -12.6±0.22 | -128.2 | <0.001 | [-13.1,-12.1] |
| GCN-LSTM | -7.4±0.18 | -92.0 | <0.001 | [-7.7,-7.1] | -6.2±0.16 | -87.0 | <0.001 | [-6.6,-5.8] |
| GraphWaveNet | -3.8±0.15 | -56.7 | <0.001 | [-4.0,-3.6] | -3.5±0.13 | -60.3 | <0.001 | [-3.8,-3.2] |
| DCRNN | -4.3±0.17 | -56.6 | <0.001 | [-4.6,-4.0] | -4.0±0.14 | -64.0 | <0.001 | [-4.3,-3.7] |
| ST-Transformer | -3.6±0.16 | -50.0 | <0.001 | [-3.9,-3.3] | -3.7±0.15 | -55.3 | <0.001 | [-4.0,-3.4] |
| QSTMixer | -2.9±0.13 | -50 | <0.001 | [-3.1,-2.7] | -2.8±0.12 | -52.3 | <0.001 | [-3.0,-2.6] |
| QG-TCN | -2.2±0.12 | -41.0 | <0.001 | [-2.4,-2.0] | -2.1±0.11 | -43.0 | <0.001 | [-2.4,-1.8] |
4.3. Dynamic Response Analysis of Sudden Congestion Events
| Incident type | n | Mean lead-time (min) | 95%CI (min) | Median (min) | Range (min) |
|---|---|---|---|---|---|
| Accident | 5 | 33.8 | [30.1, 37.5] | 34 | 29–38 |
| Weather | 4 | 31.2 | [27.4, 35.0] | 31 | 27–36 |
| Event | 3 | 35.7 | [32.8, 38.6] | 36 | 33–39 |
| Overall | 12 | 33.5 | [31.8, 35.2] | 34 | 27–39 |
4.4. Edge Computing Efficiency Analysis
| Model | Inference latency (ms) | Peak memory (MB) | Training energy consumption (W·h) |
|---|---|---|---|
| ST-Transformer | 142±12 | 1103±85 | 5.4 |
| GCN-LSTM | 89±8 | 682±42 | 3.9 |
| GraphWaveNet | 121±11 | 918±70 | 4.9 |
| DCRNN | 115±10 | 865±65 | 4.7 |
| QGNN | 76±6 | 521±38 | 2.4 |
| QGCN-LSTM | 48±4 | 327±25 | 1.8 |
4.5. Ablation Experiment and Attribution
4.6. Sensitivity Analysis of Quantum Hyper-Parameters
- n-qubits ∈ {6, 8, 10}
- QGCN depth K ∈ {2, 3, 4}
- Quantum-Dropout retain probability p ∈ {0.7, 0.8, 0.9, 1.0}
5. Conclusions and Future Work
5.1. Conclusions
5.2. Future Work
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Wang, X.H.; Zhang, S.; Chen, Y.; He, L.Y.; Ren, Y.M.; Zhang, Z.; Li, J.; Zhang, S.Q. Air quality forecasting using a spatiotemporal hybrid deep learning model based on VMD–GAT–BiLSTM. Scientifc Reports. 2024, 14, 17841. [Google Scholar] [CrossRef]
- Barra, S; Carta, S.M.; Corriga, A; Podda, A.S.; Recupero, D.R. Deep learning and time series-to-image encoding for financial forecasting. IEEE/CAA Journal of Automatica Sinica. 2020, 7, 683–692. [Google Scholar] [CrossRef]
- ALijoyo, F.A.; Gongada, T.N.; Kaur, C.; Mageswari, N.; Sekhar, J.C.; Ramesh, J.V.N.; El-Ebiary, Y.A.B.; Ulmas, Z. Advanced hybrid CNN-Bi-LSTM model augmented with GA and FFO for enhanced cyclone intensity forecasting. Alexandria Engineering Journal. 2024, 92, 346–357. [Google Scholar] [CrossRef]
- Li, J.Y.; Wang, X.D.; He, Q.X. Application and performance optimization of CNN enhanced Informer model in industrial time series prediction. Journal of Computer Applications. 2024, 44, 79–83. [Google Scholar]
- Seabe, P.L.; Moutsinga, C.R.B.; Pindza, E. Sentiment-driven cryptocurrency forecasting: analyzing LSTM, GRU, Bi-LSTM, and temporal attention model (TAM). Social Network Analysis and Mining. 2025, 15, 52–52. [Google Scholar] [CrossRef]
- Ito, K.; YAmamoto, N.; Morino, K. Sequential prediction of hall thruster performance using echo state network models. Transactions of the Japan Society for Aeronautical and Space Sciences. 2024, 67, 1–11. [Google Scholar] [CrossRef]
- Sherstinsky, A. Fundamentals of recurrent neural network (RNN) and long short-term memory (LSTM) network. Physica D: Nonlinear Phenomena. 2020, 404, 132306. [Google Scholar] [CrossRef]
- Krishna, M.V.; Swaroopa, K.; SwarnaLatha, G.; Yasaswani, V. Crop yield prediction in India based on mayfly optimization empowered attention-bi-directional long short-term memory (LSTM). Multimedia Tools & Applications, 2024, 83, 29841. [Google Scholar] [CrossRef]
- Yang, G.; Chao, S.Y.; Nie, M.; Liu, Y.H.; Zhang, M.L. Construction method of hybrid quantum long-short term memory neural network for image classification. Acta Phys. Sin. 2023, 72, 058901. [Google Scholar] [CrossRef]
- Zhang, H.; Wang, W.G. Neural network ensemble models for financial time series forecasting. Journal of Beijing University of Posts and Telecommunications. 2025, 48, 127–132. [Google Scholar]
- Zhu, X.G.; Zou, F.F.; Li, S.H. Enhancing air quality prediction with an adaptive PSO-Optimized CNN-Bi-LSTM model. Applied Sciences. 2024, 14, 5787–5787. [Google Scholar] [CrossRef]
- Caro, M.C.; Huang, H.Y.; Cerezo, M.; Sharma, K.; Sornborger, A.; Cincio, L.; Coles, P.J. Generalization in quantum machine learning from few training data. Nature Communications. 2022, 13, 4919–4919. [Google Scholar] [CrossRef]
- Sharma, K.; Cerezo, M.; Cincio, L.; Coles, P.J. Trainability of dissipative perceptron-based quantum neural networks. Physical review letters. 2022, 128, 180505–180505. [Google Scholar] [CrossRef] [PubMed]
- Maxwell, T.W.; Martin, S.; Muhammad, U. Reflection equivariant quantum neural networks for enhanced image classification. Learn.: Sci. Technol. 2023, 4, 035027. [Google Scholar] [CrossRef]
- Kulkarni, V.; Pawale, S.; Kharat, A. A classical-quantum convolutional neural network for detecting pneumonia from chest radiographs. Neural Comput & Applic. 2023, 35, 15503–15510. [Google Scholar] [CrossRef]
- Xin, J.; Wei, Z.Y.; Dong, Y.J.; Ni, W. LSTM-RNN-FNN model for load forecasting based on deleuze’s assemblage perspective. Frontiers in Energy Research. 2022, 10. [Google Scholar] [CrossRef]
- Chumakova, E.V.; Korneev, D.G.; Chernova, T.A.; Gasparian, M.S.; Ponomarev, A.A. Comparison of the application of FNN and LSTM based on the use of modules of artificial neural networks in generating an individual knowledge testing trajectory. Journal Européen des Systèmes Automatisés. 2023, 56, 213–220. [Google Scholar] [CrossRef]
- Zhang, F.Y.; Yin, J.L.; Wu, N.; Hu, X.Y.; Sun, S.K.; Wang, Y.B. A dual-path model merging CNN and RNN with attention mechanism for crop classification. European Journal of Agronomy. 2024, 159, 127273. [Google Scholar] [CrossRef]
- Ghatage, N.B.; Patil, P.D.; Shinde, S. Lightweight RNN-Based Model for Adaptive Time Series Forecasting with Concept Drift Detection in Smart Homes. Journal Européen des Systèmes Automatisés. 2023, 56, 981–991. [Google Scholar] [CrossRef]
- Hanen, B.; Ali, B.A.; Riadh, F.I. A Bi-GRU-based encoder–decoder framework for multivariate time series forecasting. Soft Computing. 2024, 28, 6775–6786. [Google Scholar] [CrossRef]
- Agarwal, H.; Mahajan, G.; Shrotriya, A.; Shekhawat, D. Predictive data analysis: leveraging RNN and LSTM techniques for time series dataset. Procedia Computer Science. 2024, 235, 979–989. [Google Scholar] [CrossRef]
- Tian, G.; Zhao, J.; Qu, H.B. A novel CNN-LSTM model with attention mechanism for online monitoring of moisture content in fluidized bed granulation process based on near-infrared spectroscopy. Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy. 2025, 340, 126361. [Google Scholar] [CrossRef] [PubMed]
- Das, P.P.; Wiese, L.; Mast, M.; Böhnke, J.; Wulff, A.; Marschollek, M.; et al. An attention-based bidirectional LSTM-CNN architecture for the early prediction of sepsis. Int J Data Sci Anal. 2024, 1–15. [Google Scholar] [CrossRef]
- Piperno, S.; Ceschini, A.; Chang, S.Y.; Grossi, M.; Vallecorsa, S.; Panella, M. A study on quantum graph neural networks applied to molecular physics. Physica Scripta. 2025, 100, 065126. [Google Scholar] [CrossRef]
- Ghorpade, S.V.S.; Pardeshi, S.A. LSTM-QDCNN: long short-term memory and quantum dilated convolutional neural network enabled occlusion percentage prediction. Australian Journal of Electrical and Electronics Engineering. 2025, 22, 1–14. [Google Scholar] [CrossRef]
- Li, Y.N.; Wang, Z.M.; Xing, R.P.; Shao, C.H.; Shi, S.S.; Li, J.X.; Zhong, G.Q.; Gu, Y.J. Quantum gated recurrent neural networks. IEEE transactions on pattern analysis and machine intelligence. 2025, 47, 2493–2504. [Google Scholar] [CrossRef]
- Pesah, A.; Cerezo, M.; Wang, S.; Volkoff, T.; Sornborger, A.T.; Coles, P.J. Absence of barren plateaus in quantum convolutional neural networks. . Physical Review X. 2021, 11, 041011. [Google Scholar] [CrossRef]
- Li, J.F.; Xin, Z.X.; Hu, J.R.; He, D.S. Quantum optimal control for Pauli operators based on spin-1/2 system. International Journal of Theoretical Physics. 2022, 61, 268. [Google Scholar] [CrossRef]
- Atif, T.A.; Chukwu, U.; Berwald, J.; Dridi, R. Quantum Natural Gradient with Efficient Backtracking Line Search. arXiv arXiv:2211.00615. [CrossRef]
- Marco, M. Fubini-Study metrics and Levi-Civita connections on quantum projective spaces. Advances in Mathematics. 2021, 393. [Google Scholar] [CrossRef]
- Naikoo, J.; Chhajlany, R.W.; Miranowicz, A. Enhanced quantum sensing with hybrid exceptional-diabolic singularities. New Journal of Physics. 2025, 27, 064505–064505. [Google Scholar] [CrossRef]











| Model | MAE | sMAPE (%) | SCI | Fault robustness (ΔMAE) |
|---|---|---|---|---|
| HA | 32.5 | 24.7 | 0.38 | +9.8 |
| GCN-LSTM | 21.7 | 18.3 | 0.72 | +6.2 |
| GraphWaveNet | 18.6 | 16.1 | 0.79 | +5.9 |
| DCRNN | 17.9 | 15.8 | 0.80 | +5.5 |
| ST-Transformer | 18.1 | 15.6 | 0.81 | +5.4 |
| QSTMixer | 17.2 | 14.9 | 0.83 | +4.1 |
| QG-TCN | 16.5 | 14.2 | 0.85 | +3.3 |
| QGCN-LSTM | 14.3 | 12.1 | 0.89 | +2.7 |
| Config | Dropout p | Prune θ | Avg. Fidelity | ΔMAE | #2-q gates |
|---|---|---|---|---|---|
| Baseline | 1.0 (off) | 0.00 (off) | 0.964±0.004 | - | 62 |
| Light | 0.9 | 0.05 | 0.952±0.006 | +0.8 veh/5min | 48(-23%) |
| Aggressive | 0.7 | 0.10 | 0.937±0.009 | +2.1 veh/5min | 34(-45%) |
| Variant model | MAE | sMAPE (%) | SCI | Congestion warning lead time (min) |
|---|---|---|---|---|
| QGCN-LSTM (complete) | 14.3 | 12.1 | 0.89 | 35 |
| w/o QGCN (remove quantum graph convolution) | 18.6 | 16.9 | 0.71 | 28 |
| w/o QA (remove quantum attention) | 16.2 | 14.3 | 0.85 | 18 |
| w/o VQC (classic gate control) | 19.4 | 17.8 | 0.83 | 22 |
| n | K | p | Range | MAE | SCI | Optimal |
|---|---|---|---|---|---|---|
| 6 | 2 | 0.9 | 16.1 | 0.84 | 32 | 0.973 |
| 6 | 3 | 0.9 | 15.4 | 0.86 | 48 | 0.965 |
| 8 | 2 | 0.9 | 15.0 | 0.87 | 42 | 0.968 |
| 8 | 3 | 0.9 | 14.3 | 0.89 | 62 | 0.964 |
| 8 | 4 | 0.9 | 14.1 | 0.90 | 82 | 0.952 |
| 8 | 3 | 0.7 | 14.9 | 0.88 | 62 | 0.962 |
| 10 | 3 | 0.9 | 14.0 | 0.90 | 74 | 0.949 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).