Submitted:
25 September 2025
Posted:
26 September 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Work
3. Methodology
4. Algorithm and Model
4.1. STELLAR Framework Overview
4.2. Context-Aware LLM Semantic Enhancement
4.3. Spatio-Temporal Fusion Network
4.4. Hierarchical Multi-Task Learning
4.5. Cross-Modal Attention and Feature Fusion
4.6. Training Strategy
5. Data Preprocessing
5.1. Multi-Scale Spatial Encoding
5.2. Temporal Pattern Extraction
6. Evaluation Metrics
7. Experiment Results
8. Conclusion
References
- Zhou, X.; Wang, J.; Liu, Y.; Wu, X.; Shen, Z.; Leung, C. Inductive graph transformer for delivery time estimation. In Proceedings of the Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023, pp. 679–687.
- Mashurov, V.; Chopuryan, V.; Porvatov, V.; Ivanov, A.; Semenova, N. Gct-TTE: graph convolutional transformer for travel time estimation. Journal of Big Data 2024, 11, 15. [CrossRef]
- Zhang, L.; Wang, M.; Zhou, X.; Wu, X.; Cao, Y.; Xu, Y.; Cui, L.; Shen, Z. Dual graph multitask framework for imbalanced delivery time estimation. In Proceedings of the International Conference on Database Systems for Advanced Applications. Springer, 2023, pp. 606–618.
- Wang, X.; Wang, L.; Dong, C.; Ren, H.; Xing, K. An online deep reinforcement learning-based order recommendation framework for rider-centered food delivery system. IEEE Transactions on Intelligent Transportation Systems 2023, 24, 5640–5654. [CrossRef]
- Wen, H.; Lin, Y.; Wu, L.; Mao, X.; Cai, T.; Hou, Y.; Guo, S.; Liang, Y.; Jin, G.; Zhao, Y.; et al. A survey on service route and time prediction in instant delivery: Taxonomy, progress, and prospects. IEEE Transactions on Knowledge and Data Engineering 2024. [CrossRef]
- Yalçinkaya, E.; Hızıroğlu, O.A. A comparative analysis of machine learning models for time prediction in food delivery operations. Artificial Intelligence Theory and Applications 2024, 4, 43–56.
- Yu, X.; Lan, A.; Mao, H. Short-term demand prediction for on-demand food delivery with attention-based convolutional lstm. Systems 2023, 11, 485. [CrossRef]
- Zhu, Y.; Liu, Y. LLM-NER: Advancing Named Entity Recognition with LoRA+ Fine-Tuned Large Language Models. In Proceedings of the 2025 11th International Conference on Computing and Artificial Intelligence (ICCAI), 2025, pp. 364–368. [CrossRef]
- Luo, X. Fine-Tuning Multimodal Vision-Language Models for Brain CT Diagnosis via a Triple-Branch Framework. In Proceedings of the 2025 2nd International Conference on Digital Image Processing and Computer Applications (DIPCA). IEEE, 2025, pp. 270–274.
- Guan, S. Predicting Medical Claim Denial Using Logistic Regression and Decision Tree Algorithm. In Proceedings of the 2024 3rd International Conference on Health Big Data and Intelligent Healthcare (ICHIH), 2024, pp. 7–10. [CrossRef]




| Model | WAcc (%) | MAPE (%) | PAE@90 (min) | BOS |
|---|---|---|---|---|
| DeepFM | 75.3 | 15.7 | 18.2 | 0.642 |
| Wide&Deep | 77.1 | 14.3 | 16.8 | 0.671 |
| TransGNN | 80.4 | 12.1 | 14.3 | 0.723 |
| HierMTL | 82.2 | 10.8 | 12.7 | 0.751 |
| STGCN-MT | 83.6 | 9.9 | 11.5 | 0.774 |
| BERT4Rec | 84.1 | 9.5 | 11.1 | 0.782 |
| T5-Delivery | 85.3 | 9.0 | 10.4 | 0.798 |
| STELLAR | 87.3 | 8.2 | 9.2 | 0.826 |
| Configuration | WAcc (%) | MAPE (%) | WAcc | MAPE |
|---|---|---|---|---|
| STELLAR (Full) | 87.3 | 8.2 | – | – |
| w/o LLM Enhancement | 83.1 | 9.3 | -4.2 | +1.1 |
| w/o Spatial-Temporal Fusion | 82.5 | 9.8 | -4.8 | +1.6 |
| w/o Multi-Task Learning | 85.2 | 8.9 | -2.1 | +0.7 |
| w/o Cross-Modal Attention | 84.7 | 8.8 | -2.6 | +0.6 |
| w/o Gradient Surgery | 86.1 | 8.5 | -1.2 | +0.3 |
| w/o Curriculum Learning | 86.4 | 8.4 | -0.9 | +0.2 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).