Submitted:
10 January 2025
Posted:
11 January 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
- Dynamic Spatiotemporal Attention Mechanism: This mechanism captures essential spatiotemporal dependencies between moving vehicles by integrating dynamic temporal and spatial attention. The model selectively identifies critical moments within historical traffic data and dynamically adjusts weight distributions according to real-time traffic conditions, enhancing its adaptability. This capability significantly improves prediction accuracy, especially in rapidly changing traffic environments, demonstrating the model’s effectiveness in handling complex, evolving scenarios.
- Driver Behavior Analysis Module: This module segments time-series driving data into behavioral units, enhancing prediction accuracy by incorporating historical trajectories. Driving primitives are extracted through a Sparse Inverse Covariance Clustering (SICC-SC) method, clustering trajectory data to reveal distinct behaviors without predefined labels, achieving robust adaptability. Temporal consistency is maintained via a dynamic programming-based Expectation-Maximization (EM) framework, while a Toeplitz graphical Lasso with ADMM refines model sparsity, enabling precise and interpretable risk analysis and demonstrating effectiveness in complex, multi-vehicle scenarios.
- Real-World Model Validation: Extensive data collection experiments were conducted across diverse road conditions in Wuhan, China, to assess the model’s generalization and effectiveness. Data from 20 drivers were collected over 12 kilometers of urban roads, 34 kilometers of expressways, and 45 kilometers of highways. After preprocessing, this dataset validated the HTSA-LSTM model's performance under real-world driving conditions, demonstrating its robustness and applicability in varied traffic environments.
2. Model Development
2.1. Traditional Models
2.2. Attention Mechanism
2.2.1. Temporal Attention Mechanism
2.2.2. Spatial Attention Mechanism
2.3. Driving Style Analysis Module
2.4. Coupled Spatiotemporal Attention Model Considering Driving Styles
3. Experiments and Evaluation
3.1. Formatting of Mathematical Components
3.2. Dataset Segmentation and Model Training
3.3. Training Setup and Evaluation Metrics
3.4. Results and Comparison
- Constant Velocity (CV) [31]: An earlier trajectory prediction method based on physical principles, primarily using the vehicle's dynamic model and Kalman filter equations. Although the dataset lacks factors like road surface friction, making complex physics-based comparisons impractical, a simple physics-based model is constructed to predict the vehicle's trajectory under constant velocity.
- Simple-LSTM (S-LSTM) [32]: This model uses LSTM units within a recurrent neural network to analyze continuous time-series data for temporal vehicle trajectory predictions.
- Maneuver-LSTM (M-LSTM) [33]: Based on an encoder-decoder framework, this model uses historical trajectory data and lane structure, assigning confidence values to six maneuver categories to predict future motion through a multimodal distribution.
- Attention-LSTM (A-LSTM) [34]: This model uses an LSTM encoder with shared weights to generate vector encodings of each vehicle’s motion. An attention module models interactions based on the importance of adjacent vehicles, and the decoder uses these interaction vectors to predict trajectory distribution.
- Convolutional Social Pooling LSTM (CS-LSTM) [27]: This model, based on an LSTM encoder-decoder framework, employs a social convolutional network to accurately capture spatial interactions among agents, producing multimodal trajectory predictions.
- Spatial-Temporal Attention-LSTM(STA-LSTM) [1]: This model employs an LSTM encoder-decoder framework with integrated spatial and temporal attention mechanisms. It assigns weights to surrounding vehicles and historical time steps, capturing critical interactions for trajectory prediction. STA-LSTM enhances the accuracy of multimodal trajectory predictions by effectively modeling dynamic relationships over time and space.
3.5. Analysis of Temporal and Spatial Attention Allocation
3.5.1. Temporal Attention Allocation
3.5.2. Spatial Attention Allocation
3.6. Analysis of Driving Primitives
3.7. Vehicle Speed, Acceleration, and Trajectory Prediction
4. Trajectory Prediction Based on Real-World Driving Data
4.1. Analysis of HTSA-LSTM Prediction Results Based on Real-World Driving Data
4.2. Vehicle Speed and Acceleration Prediction
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Y. Xing et al., Driver lane change intention inference for intelligent vehicles: Framework, survey, and challenges,IEEE Trans. Veh, May 2019, Tech- nol., vol. 68, no. 5, pp. 4377–4390. [CrossRef]
- Messaoud, K., Yahiaoui, I., Verroust-Blondet, A., & Nashashibi, F. Relational recurrent neural networks for vehicle trajectory prediction. In Proceedings of the IEEE International Conference on Intelligent Transportation Systems (ITSC), Auckland, New Zealand, 27-30 October 2019; pp. 1813–1818.
- Messaoud, K., Yahiaoui, I., Verroust-Blondet, A., & Nashashibi, F. Attention based vehicle trajectory prediction. IEEE Transactions on Intelligent Vehicles 2020, 6(1), 1–10. [CrossRef]
- Cao, D. P., Wang, X., & Li, L. X. Future Directions of Intelligent Vehicles: Potentials, Possibilities, and Perspectives. IEEE Transactions on Intelligent Vehicles 2022, 7(1), 7-10. [CrossRef]
- Wang, L. L. , Chen, Z. G., & Wu, J. Vehicle trajectory prediction algorithm in vehicular network. Wireless Networks 2019, 25(4), 2143–2156. [Google Scholar] [CrossRef]
- Cui, H., Radosavljevic, V., Chou, F. C., Lin, T. H., Thi, N., & Huang, T. K. Multimodal trajectory predictions for autonomous driving using deep convolutional networks. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), Montreal, QC, Canada, 20-24 May 2019; pp. 2090–2096.
- Choi, S. , Kim, J., & Yeo, H. Attention-based recurrent neural network for urban vehicle trajectory prediction. Procedia Computer Science 2019, 151, 327–334. [Google Scholar] [CrossRef]
- Xue, H., Huynh, D. Q., & Reynolds, M. PoPPL: Pedestrian trajectory prediction by LSTM with automatic route class clustering. IEEE Transactions on Neural Networks and Learning Systems 2021, 32(1), 77–90. [CrossRef]
- Qian, L. P., Feng, A., Yu, N., Xu, W., & Wu, Y. Vehicular networking-enabled vehicle state prediction via two-level quantized adaptive Kalman filtering. IEEE Internet of Things Journal 2020, 7(8), 7181–7193. [CrossRef]
- Huang, Y. J., Du, J. T., Yang, Z. R., et al. A Survey on Trajectory-Prediction Methods for Autonomous Driving. IEEE Transactions on Intelligent Vehicles 2022, 7(3), 652-674. [CrossRef]
- Polychronopoulos, A., Tsogas, M., Amditis, A. J., et al. Sensor Fusion for Predicting Vehicles' Path for Collision Avoidance Systems. IEEE Transactions on Intelligent Transportation Systems 2007, 8(3), 549-562. [CrossRef]
- Jin, B., Jiu, B., Su, T., et al. Switched Kalman filter-interacting multiple model algorithm based on optimal autoregressive model for manoeuvring target tracking. IET Radar, Sonar & Navigation 2015, 9(2), 199-209. [CrossRef]
- Matthias, A., & Alexander, M. Comparison of Markov Chain Abstraction and Monte Carlo Simulation for the Safety Assessment of Autonomous Cars. IEEE Transactions on Intelligent Transportation Systems 2011, 12(4), 1237-1247. [CrossRef]
- Bahram, M., Hubmann, C., Lawitzky, A., et al. A combined model-and learning-based framework for interaction-aware maneuver prediction. IEEE Transactions on Intelligent Transportation Systems 2016, 17(6), 1538-1550. [CrossRef]
- Zhu, X. L., Hu, W., Deng, Z. J., et al. Interaction-aware cut-in trajectory prediction and risk assessment in mixed traffic. IEEE/CAA Journal of Automatica Sinica 2022, 9(10), 1752-1762. [CrossRef]
- Ho, J., & Ermon, S. Generative Adversarial Imitation Learning. In Proceedings of the 30th Annual Conference on Neural In-formation Processing Systems (NIPS), Barcelona, Spain, 5-10 December 2016; p. 19.
- Choi, S., Kim, J., & Yeo, H. TrajGAIL: Generating urban vehicle trajectories using generative adversarial imitation learning. Transportation Research Part C: Emerging Technologies 2021, 128, 103091. [CrossRef]
- Tang, C., & Salakhutdinov, R. R. Multiple futures prediction. Advances in Neural Information Processing Systems 2019, 32, 15424-15434.
- Liu, Y. C., Zhang, J. H., Fang, L. J., et al. Multimodal Motion Prediction with Stacked Transformers. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA, 19-25 June 2021; pp. 7577-7586.
- Li, M., Tong, P., & Li, M. Traffic flow prediction with vehicle trajectories. In Proceedings of the AAAI Conference on Artificial Intelligence, Vancouver, BC, Canada, 2-9 February 2021; 35(1), 294-302.
- Jo, E., Sunwoo, M., & Lee, M. Vehicle trajectory prediction using hierarchical graph neural network for considering interaction among multimodal maneuvers. Sensors 2021, 21(16), 5354. [CrossRef]
- Li, R. N., Qin, Y., Wang, J. B., et al. AMGB: Trajectory prediction using attention-based mechanism GCN-BiLSTM in IOV. Pattern Recognition Letters 2023, 169, 17-27 AMGB: Trajectory prediction using attention-based mechanism GCN-BiLSTM in IOV. [CrossRef]
- Mo, X. Y., Huang, Z. Y., Yang, X., et al. Multi-agent trajectory prediction with heterogeneous edge-enhanced graph attention network. IEEE Transactions on Intelligent Transportation Systems 2022, 23(7), 9554-9567. [CrossRef]
- Shi, J.; Sun, D.; Guo, B. Vehicle Trajectory Prediction Based on Graph Convolutional Networks in Connected Vehicle Environment. Appl. Sci. 2023, 13, 13192. [CrossRef]
- Li, H.; Ren, Y.; Li, K.; Chao, W. Trajectory Prediction with Attention-Based Spatial–Temporal Graph Convolutional Networks for Autonomous Driving. Appl. Sci. 2023, 13, 12580. [CrossRef]
- Ni, Q.; Peng, W.; Zhu, Y.; Ye, R. A Novel Trajectory Feature-Boosting Network for Trajectory Prediction. Entropy 2023, 25, 1100. [CrossRef]
- L. Lin, W. Li, H. Bi and L. Qin, "Vehicle Trajectory Prediction Using LSTMs With Spatial–Temporal Attention Mechanisms, IEEE Intelligent Transportation Systems Magazine, March-April 2022, vol. 14, no. 2, pp. 197-208. [CrossRef]
- Nachiket, D., & Mohan, M. T. Convolutional Social Pooling for Vehicle Trajectory Prediction. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Salt Lake City, UT, USA, 18-22 June 2018; pp. 1468-1476.
- Robert, C. M. On Log-Likelihood-Ratios and the Significance of Rare Events. In Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing (EMNLP), Barcelona, Spain, 25-26 July 2004; pp. 333-340.
- Wei, Y. , Jia, B. , Xiao, X. , & Yao, B. . Driving style analysis in car-following scenarios based on semantic analysis method. 2023 7th CAA International Conference on Vehicular Control and Intelligence (CVCI).
- Samer, A., & Fawzi, N. Real time trajectory prediction for collision risk estimation between vehicles. In Proceedings of the 2009 IEEE 5th International Conference on Intelligent Computer Communication and Processing (ICCP), Cluj-Napoca, Romania, 27-29 August 2009; pp. 417-422.
- Alex, Z., Stewart, W., James, W., et al. Long Short Term Memory for Driver Intent Prediction. In Proceedings of the 2017 IEEE Intelligent Vehicles Symposium (IV), Los Angeles, CA, USA, 11-14 June 2017; pp. 1484-1489.
- Nachiket, D., & Mohan, M. T. Multi-Modal Trajectory Prediction of Surrounding Vehicles with Maneuver based LSTMs. In Proceedings of the 2018 IEEE Intelligent Vehicles Symposium (IV), Changshu, China, 26-30 June 2018; pp. 1179-1184.
- Kaouther M, IthKaouther M, Itheri Y, Anne V B, et al. Attention Based Vehicle Trajectory Prediction[J]. IEEE Transactions on Intelligent Vehicles, 2021, 6(1): 175-185eri Y, Anne V B, et al. Attention Based Vehicle Trajectory Prediction[J]. IEEE Transactions on Intelligent Vehicles, 2021, 6(1): 175-185. [CrossRef]
- Qi, G. Q., Du, Y. M., Wu, J. P., et al. Leveraging longitudinal driving behaviour data with data mining techniques for driving style analysis. IET Intelligent Transport Systems 2015, 9(8), 792-801. [CrossRef]


















| Driving primitives | Explanation |
| #0 | Stable High Speed |
| #1 | Stopping |
| #2 | Accelerating |
| #3 | Low Speed |
| #4 | Turning |
| Parameter | Value |
| Input layer dimension of LSTM encoder | 32 |
| Dimension of LSTM hidden vectors | 64 |
| Feedforward network layer dimension | 128 |
| Input data (grid_size: 3×13 road network structure) | 13,3 |
| Batch size | 128 |
| Learning rate (Lr) | 0.001 |
| Optimizer | Adam |
| Training epochs | 200 |
| 2 | |
| 0.11 | |
| 15 | |
| 35 | |
| 5 |
| Evaluation Metric | Prediction Duration [s] | CV | S-LSTM | M-LSTM | A-LSTM | CS-LSTM | STA-LSTM | HTSA-LSTM |
| RMSE | 1 | 0.953 | 0.683 | 0.584 | 0.648 | 0.623 | 0.615 | 0.499 |
| 2 | 1.826 | 1.538 | 1.263 | 1.325 | 1.268 | 1.256 | 0.898 | |
| 3 | 3.352 | 2.466 | 2.128 | 2.163 | 2.065 | 2.047 | 1.314 | |
| 4 | 5.885 | 3.847 | 3.245 | 3.258 | 3.137 | 3.044 | 2.225 | |
| 5 | 8.943 | 5.236 | 4.661 | 4.562 | 4.366 | 4.281 | 3.461 | |
| 6 | 11.127 | 6.542 | 6.157 | 6.037 | 5.879 | 5.761 | 4.897 | |
| NLL | 1 | 3.718 | 2.023 | 1.173 | 1.012 | 0.576 | 0.630 | 0.603 |
| 2 | 5.372 | 3.634 | 2.851 | 2.487 | 2.142 | 1.088 | 1.115 | |
| 3 | 7.389 | 4.628 | 3.806 | 3.362 | 3.028 | 1.718 | 2.056 | |
| 4 | 9.164 | 5.352 | 4.479 | 4.158 | 3.692 | 2.735 | 2.769 | |
| 5 | 10.895 | 6.295 | 5.095 | 5.036 | 4.351 | 4.077 | 3.264 | |
| 6 | 12.026 | 7.036 | 6.432 | 6.347 | 6.028 | 5.852 | 4.318 |
| Road Type | Prediction Horizon [s] | Evaluation Metrics | ||
| RMSE | MAE | R2 | ||
| CRS | 1 | 0.092 | 0.026 | 0.959 |
| 2 | 0.086 | 0.025 | 0.938 | |
| 3 | 0.103 | 0.037 | 0.927 | |
| 4 | 0.126 | 0.039 | 0.912 | |
| 5 | 0.114 | 0.054 | 0.895 | |
| 6 | 0.119 | 0.062 | 0.888 | |
| 7 | 0.132 | 0.067 | 0.872 | |
| 8 | 0.151 | 0.078 | 0.863 | |
| 9 | 0.172 | 0.093 | 0.859 | |
| 10 | 0.188 | 0.098 | 0.851 | |
| UES | 1 | 0.009 | 0.007 | 0.997 |
| 2 | 0.012 | 0.009 | 0.994 | |
| 3 | 0.017 | 0.013 | 0.991 | |
| 4 | 0.019 | 0.015 | 0.989 | |
| 5 | 0.021 | 0.016 | 0.980 | |
| 6 | 0.025 | 0.020 | 0.971 | |
| 7 | 0.031 | 0.024 | 0.962 | |
| 8 | 0.038 | 0.029 | 0.955 | |
| 9 | 0.042 | 0.031 | 0.947 | |
| 10 | 0.047 | 0.036 | 0.941 | |
| HS | 1 | 0.014 | 0.011 | 0.996 |
| 2 | 0.017 | 0.014 | 0.994 | |
| 3 | 0.021 | 0.017 | 0.991 | |
| 4 | 0.023 | 0.018 | 0.986 | |
| 5 | 0.032 | 0.026 | 0.979 | |
| 6 | 0.039 | 0.030 | 0.969 | |
| 7 | 0.042 | 0.033 | 0.957 | |
| 8 | 0.049 | 0.040 | 0.949 | |
| 9 | 0.057 | 0.048 | 0.944 | |
| 10 | 0.068 | 0.054 | 0.936 | |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).