Submitted:
08 January 2026
Posted:
12 January 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
Contributions
2. Related Work
2.1. AI-Driven Logistics Optimization and Sustainability
2.2. Multi-Agent Systems for Manufacturing and Logistics
2.3. Human-AI Collaboration in Distributed Operations
2.4. Research Gaps and Future Directions
2.5. Preliminary Concepts
3. Method
3.1. Adaptive Logistics Feature Engineering
3.2. Multi-Agent Route Negotiation
3.3. Human-AI Coordination Optimization
3.4. Integrated Algorithm
| Algorithm 1: Adaptive Multi-Agent Logistics Optimization with Human-AI Coordination |
|
3.5. Theoretical Analysis
4. Experimental Evaluation
4.1. Experimental Settings
4.1.1. Benchmarks
4.1.2. Implementation Details
4.2. Main Results
4.2.1. Performance on COCO-Logistics and Solomon VRPTW Benchmarks















4.2.2. Performance on Real-world Delivery Operations
4.2.3. Decision Latency and Coordination Efficiency
4.2.4. Human-AI Coordination and Adaptation Quality
4.3. Case Study Analysis
4.3.1. Dynamic Traffic Scenario Analysis
4.3.2. Multi-Agent Negotiation Pattern Analysis
4.3.3. Human-AI Coordination Learning Analysis
4.4. Ablation Study
4.4.1. Impact of Adaptive Feature Engineering Removal
4.4.2. Analysis of Decision Caching and Coordination Learning Components
4.4.3. Learning Rate Sensitivity Analysis
4.4.4. Architectural Design Choice Evaluation
5. Conclusions
References
- Yu, Z. Ai for science: A comprehensive review on innovations, challenges, and future directions. International Journal of Artificial Intelligence for Science (IJAI4S) 2025, 1. [Google Scholar] [CrossRef]
- Lin, S. LLM-Driven Adaptive Source-Sink Identification and False Positive Mitigation for Static Analysis. arXiv 2025, arXiv:cs. [Google Scholar]
- Ren, J.; Liu, L.; Wu, Y.; Ouyang, L.; Yu, Z. Estimating forest carbon stock using enhanced resnet and sentinel-2 imagery. Forests 2025, 16, 1198. [Google Scholar] [CrossRef]
- Xin, Y.; Du, J.; Wang, Q.; Lin, Z.; Yan, K. Vmt-adapter: Parameter-efficient transfer learning for multi-task dense scene understanding. Proceedings of the Proceedings of the AAAI conference on artificial intelligence 2024, Vol. 38, 16085–16093. [Google Scholar] [CrossRef]
- Cao, Z.; He, Y.; Liu, A.; Xie, J.; Wang, Z.; Chen, F. CoFi-Dec: Hallucination-Resistant Decoding via Coarse-to-Fine Generative Feedback in Large Vision-Language Models. In Proceedings of the Proceedings of the 33rd ACM International Conference on Multimedia, 2025; pp. 10709–10718. [Google Scholar]
- Yu, Z.; Idris, M.Y.I.; Wang, H.; Wang, P.; Chen, J.; Wang, K. From physics to foundation models: A review of ai-driven quantitative remote sensing inversion. arXiv 2025, arXiv:2507.09081. [Google Scholar] [CrossRef]
- Cao, Z.; He, Y.; Liu, A.; Xie, J.; Wang, Z.; Chen, F. PurifyGen: A Risk-Discrimination and Semantic-Purification Model for Safe Text-to-Image Generation. In Proceedings of the Proceedings of the 33rd ACM International Conference on Multimedia, 2025; pp. 816–825. [Google Scholar]
- He, Y.; Li, S.; Li, K.; Wang, J.; Li, B.; Shi, T.; Xin, Y.; Li, K.; Yin, J.; Zhang, M.; et al. GE-Adapter: A General and Efficient Adapter for Enhanced Video Editing with Pretrained Text-to-Image Diffusion Models. Expert Systems with Applications 2025, 129649. [Google Scholar] [CrossRef]
- Gao, B.; Wang, J.; Song, X.; He, Y.; Xing, F.; Shi, T. Free-Mask: A Novel Paradigm of Integration Between the Segmentation Diffusion Model and Image Editing. In Proceedings of the Proceedings of the 33rd ACM International Conference on Multimedia, 2025; pp. 9881–9890. [Google Scholar]
- Wu, X.; Zhang, Y.; Shi, M.; Li, P.; Li, R.; Xiong, N.N. An adaptive federated learning scheme with differential privacy preserving. Future Generation Computer Systems 2022, 127, 362–372. [Google Scholar] [CrossRef]
- Tian, Y.; Yang, Z.; Liu, C.; Su, Y.; Hong, Z.; Gong, Z.; Xu, J. CenterMamba-SAM: Center-Prioritized Scanning and Temporal Prototypes for Brain Lesion Segmentation. arXiv 2025, arXiv:cs. [Google Scholar]
- Wu, X.; Wang, H.; Zhang, Y.; Zou, B.; Hong, H. A tutorial-generating method for autonomous online learning. IEEE Transactions on Learning Technologies 2024, 17, 1532–1541. [Google Scholar] [CrossRef]
- Sarkar, A.; Idris, M.Y.I.; Yu, Z. Reasoning in computer vision: Taxonomy, models, tasks, and methodologies. arXiv arXiv:2508.10523. [CrossRef]
- Liang, C.X.; Tian, P.; Yin, C.H.; Yua, Y.; An-Hou, W.; Ming, L.; Wang, T.; Bi, Z.; Liu, M. A comprehensive survey and guide to multimodal large language models in vision-language tasks. arXiv 2024, arXiv:2411.06284. [Google Scholar] [CrossRef]
- Wu, X.; Wang, H.; Tan, W.; Wei, D.; Shi, M. Dynamic allocation strategy of VM resources with fuzzy transfer learning method. Peer-to-Peer Networking and Applications 2020, 13, 2201–2213. [Google Scholar] [CrossRef]
- Wu, X.; Zhang, Y.T.; Lai, K.W.; Yang, M.Z.; Yang, G.L.; Wang, H.H. A novel centralized federated deep fuzzy neural network with multi-objectives neural architecture search for epistatic detection. IEEE Transactions on Fuzzy Systems 2024, 33, 94–107. [Google Scholar] [CrossRef]
- Xin, Y.; Yan, J.; Qin, Q.; Li, Z.; Liu, D.; Li, S.; Huang, V.S.J.; Zhou, Y.; Zhang, R.; Zhuo, L.; et al. Lumina-mgpt 2.0: Stand-alone autoregressive image modeling. arXiv arXiv:2507.17801.
- Liang, X.; Tao, M.; Xia, Y.; Wang, J.; Li, K.; Wang, Y.; He, Y.; Yang, J.; Shi, T.; Wang, Y.; et al. SAGE: Self-evolving Agents with Reflective and Memory-augmented Abilities. Neurocomputing 2025, 130470. [Google Scholar] [CrossRef]
- Wang, S.; Wang, Z.; Wang, M.; Han, W. g-Good-neighbor conditional diagnosability of star graph networks under PMC model and MM* model. Frontiers of Mathematics in China 2017, 12, 1221–1234. [Google Scholar] [CrossRef]
- Wang, M.; Xiang, D.; Wang, S. Connectivity and diagnosability of leaf-sort graphs. Parallel Processing Letters 2020, 30, 2040004. [Google Scholar] [CrossRef]
- Wang, M.; Wang, S. Diagnosability of Cayley graph networks generated by transposition trees under the comparison diagnosis model. Annals of Applied Mathematics 2016, 32, 166–173. [Google Scholar]
- Wang, S.; Wang, M. A Note on the Connectivity of m-Ary n-Dimensional Hypercubes. Parallel Processing Letters 2019, 29, 1950017. [Google Scholar] [CrossRef]
- Wang, S.; Wang, M. The Edge Connectivity of Expanded k-Ary n-Cubes. Discrete Dynamics in Nature and Society 2018, 2018, 7867342. [Google Scholar] [CrossRef]
- Wang, M.; Yang, W.; Wang, S. Conditional matching preclusion number for the Cayley graph on the symmetric group. Acta Math. Appl. Sin. (Chinese Series) 2013, 36, 813–820. [Google Scholar]
- Wang, M.; Xu, S.; Jiang, J.; Xiang, D.; Hsieh, S.Y. Global reliable diagnosis of networks based on Self-Comparative Diagnosis Model and g-good-neighbor property. Journal of Computer and System Sciences 2025, 103698. [Google Scholar] [CrossRef]
- Song, X.; Chen, K.; Bi, Z.; Niu, Q.; Liu, J.; Peng, B.; Zhang, S.; Yuan, Z.; Liu, M.; Li, M.; et al. Transformer: A Survey and Application. 2025. [Google Scholar]
- Liang, C.X.; Bi, Z.; Wang, T.; Liu, M.; Song, X.; Zhang, Y.; Song, J.; Niu, Q.; Peng, B.; Chen, K.; et al. Low-Rank Adaptation for Scalable Large Language Models: A Comprehensive Survey. 2025. [Google Scholar] [CrossRef] [PubMed]
- Zhou, Y.; He, Y.; Su, Y.; Han, S.; Jang, J.; Bertasius, G.; Bansal, M.; Yao, H. ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding. arXiv arXiv:2506.01300.
- Lin, S. Hybrid Fuzzing with LLM-Guided Input Mutation and Semantic Feedback. arXiv 2025, arXiv:2511.03995. [Google Scholar] [CrossRef]
- Qi, H.; Hu, Z.; Yang, Z.; Zhang, J.; Wu, J.J.; Cheng, C.; Wang, C.; Zheng, L. Capacitive aptasensor coupled with microfluidic enrichment for real-time detection of trace SARS-CoV-2 nucleocapsid protein. Analytical chemistry 2022, 94, 2812–2819. [Google Scholar] [CrossRef] [PubMed]
- Wang, H.; Zhang, X.; Xia, Y.; Wu, X. An intelligent blockchain-based access control framework with federated learning for genome-wide association studies. Computer Standards & Interfaces 2023, 84, 103694. [Google Scholar]
- Li, M.; Chen, K.; Bi, Z.; Liu, M.; Peng, B.; Niu, Q.; Liu, J.; Wang, J.; Zhang, S.; Pan, X.; et al. Surveying the mllm landscape: A meta-review of current surveys. arXiv 2024. arXiv:2409.18991.
- Wu, X.; Dong, J.; Bao, W.; Zou, B.; Wang, L.; Wang, H. Augmented intelligence of things for emergency vehicle secure trajectory prediction and task offloading. IEEE Internet of Things Journal 2024, 11, 36030–36043. [Google Scholar] [CrossRef]
- Bai, Z.; Ge, E.; Hao, J. Multi-Agent Collaborative Framework for Intelligent IT Operations: An AOI System with Context-Aware Compression and Dynamic Task Scheduling. arXiv arXiv:2512.13956.
- Han, X.; Gao, X.; Qu, X.; Yu, Z. Multi-Agent Medical Decision Consensus Matrix System: An Intelligent Collaborative Framework for Oncology MDT Consultations. arXiv arXiv:2512.14321. [CrossRef]
- Qu, D.; Ma, Y. Magnet-bn: Markov-guided Bayesian neural networks for calibrated long-horizon sequence forecasting and community tracking. Mathematics 2025, 13, 2740. [Google Scholar] [CrossRef]
- Lin, S. Abductive Inference in Retrieval-Augmented Language Models: Generating and Validating Missing Premises. arXiv 2025, arXiv:2511.04020. [Google Scholar] [CrossRef]
- Cao, Z.; He, Y.; Liu, A.; Xie, J.; Chen, F.; Wang, Z. TV-RAG: A Temporal-aware and Semantic Entropy-Weighted Framework for Long Video Retrieval and Understanding. In Proceedings of the Proceedings of the 33rd ACM International Conference on Multimedia, 2025; pp. 9071–9079. [Google Scholar]
- Yu, Z.; Idris, M.Y.I.; Wang, P.; Qureshi, R. CoTextor: Training-Free Modular Multilingual Text Editing via Layered Disentanglement and Depth-Aware Fusion. In Proceedings of the The Thirty-ninth Annual Conference on Neural Information Processing Systems Creative AI Track: Humanity, 2025. [Google Scholar]
- Wang, J.; He, Y.; Zhong, Y.; Song, X.; Su, J.; Feng, Y.; Wang, R.; He, H.; Zhu, W.; Yuan, X.; et al. Twin co-adaptive dialogue for progressive image generation. In Proceedings of the Proceedings of the 33rd ACM International Conference on Multimedia, 2025; pp. 3645–3653. [Google Scholar]
- Xin, Y.; Qin, Q.; Luo, S.; Zhu, K.; Yan, J.; Tai, Y.; Lei, J.; Cao, Y.; Wang, K.; Wang, Y.; et al. Lumina-dimoo: An omni diffusion large language model for multi-modal generation and understanding. arXiv arXiv:2510.06308.
- Yang, C.; He, Y.; Tian, A.X.; Chen, D.; Wang, J.; Shi, T.; Heydarian, A.; Liu, P. Wcdt: World-centric diffusion transformer for traffic scene generation. In Proceedings of the 2025 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 2025; pp. 6566–6572. [Google Scholar]
- Lin, Y.; Wang, M.; Xu, L.; Zhang, F. The maximum forcing number of a polyomino. Australas. J. Combin 2017, 69, 306–314. [Google Scholar]
- Solomon, M.M. Algorithms for the vehicle routing and scheduling problems with time window constraints. Operations Research 1987, 35, 254–265. [Google Scholar] [CrossRef]
- OpenStreetMap. 07 01 2026. Available online: https://www.openstreetmap.org.
- Boeing, G. OSMnx: New methods for acquiring, constructing, analyzing, and visualizing complex street networks. Computers, Environment and Urban Systems 2017, 65, 126–139. [Google Scholar] [CrossRef]
- Brandimarte, P. Routing and scheduling in a flexible job shop by tabu search. Annals of Operations Research 1993, 41, 157–183. [Google Scholar] [CrossRef]
- Perron, L.; Furnon, V. Google OR-Tools: An open-source software suite for optimization. Google AI Blog / Optimization Tools Documentation. 2019. Available online: https://developers.google.com/optimization.
- Chen, T.; Guestrin, C. XGBoost: A scalable tree boosting system. In Proceedings of the Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016; pp. 785–794. [Google Scholar]
| Method | Route Accuracy (%) | Fuel Reduction (%) | Decision Time (ms) | Synthetic MAE | Solomon VRPTW Score | Real-world Success (%) |
|---|---|---|---|---|---|---|
| Google OR-Tools [48] | 75.2 | 8.3 | 1200 | 2.45 | 0.823 | 71.4 |
| Traditional Multi-Agent [47] | 72.8 | 6.7 | 2800 | 2.78 | 0.791 | 68.9 |
| XGBoost Baseline [49] | 78.1 | 9.2 | 800 | 2.12 | 0.847 | 74.2 |
| Ours | 87.3 | 15.7 | 220 | 1.34 | 0.912 | 83.6 |
| Method | Decision Time (ms) | Cache Hit Rate (%) | Memory Usage (GB) | Human Acceptance (%) | Adaptation Score | Coordination Success (%) |
|---|---|---|---|---|---|---|
| Google OR-Tools [48] | 1200 | 0.0 | 2.1 | 65.3 | 0.42 | 67.8 |
| Traditional Multi-Agent [47] | 2800 | 15.2 | 3.8 | 60.2 | 0.38 | 63.4 |
| XGBoost Baseline [49] | 800 | 0.0 | 1.9 | 68.7 | 0.51 | 70.1 |
| Ours | 220 | 81.3 | 2.4 | 78.4 | 0.89 | 84.2 |
| Variant | Route Accuracy (%) | Fuel Reduction (%) | Dynamic Adaptation Score |
|---|---|---|---|
| Full Model | 87.3 | 15.7 | 0.89 |
| w/o Adaptive Features (High-level) | 78.1 | 9.2 | 0.34 |
| Variant | Decision Time (ms) | Human Acceptance (%) | Coordination Success (%) |
|---|---|---|---|
| Full Model | 220 | 78.4 | 84.2 |
| w/o Decision Caching (High-level) | 2650 | 76.8 | 81.3 |
| w/o Coordination Learning (High-level) | 235 | 60.2 | 67.9 |
| Variant | Convergence Epochs | Stability Score | Final Performance (%) |
|---|---|---|---|
| Full Model (lr=0.001) | 120 | 0.94 | 87.3 |
| Higher Learning Rate (lr=0.01) | 85 | 0.67 | 83.1 |
| Lower Learning Rate (lr=0.0001) | 180 | 0.96 | 85.9 |
| Variant | Scalability Score | Decision Quality | System Resilience |
|---|---|---|---|
| Full Model (Distributed) | 0.91 | 0.88 | 0.85 |
| Centralized Decision-Making | 0.73 | 0.84 | 0.62 |
| Hybrid Architecture | 0.86 | 0.87 | 0.79 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).