Submitted:
06 June 2025
Posted:
06 June 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
2.1. Simulation Environment
2.1.1. Controller–Simulator Interaction via TraCI
2.1.2. Decentralized Agent Coordination Through Environmental Feedback
2.1.3. Simulation Scenarios
2.2. Reinforcement Learning Framework
2.2.1. Waiting Time-Based Reward
2.2.2. Queue Length-Based Reward
2.2.3. Weighted Combination Reward

2.3. Reward Function Variations
3. Results
3.1. Performance Under Waiting Time Reward
3.2. Performance Under Queue Length Reward
3.3. Performance Under Combined Reward
3.4. Comparative Analysis
| Reward Function | Mean Waiting Time | Mean Queue Length | Total Waiting Time | Maximum Delay | Maximum Queue Size |
| Waiting Time Only | Low | Moderate | Low | Lowest | Moderate |
| Queue Length Only | High | Very Low | High | High | Lowest |
| Combined Reward | Moderate | Low | Moderate | Lower than Queue Only | Low |
4. Discussion
5. Conclusion
References
- Tan, J.; Yuan, Q.; Guo, W.; Xie, N.; Liu, F.; Wei, J.; Zhang, X. Deep Reinforcement Learning for Traffic Signal Control Model and Adaptation Study. Sensors 2022, 22, 8732. [CrossRef]
- Wu, C.; Kreidieh, A.; Parvate, K.; Vinitsky, E.; Bayen, A.M. Flow: Deep Reinforcement Learning for Control in SUMO. arXiv 2023, arXiv:2311.15920. [CrossRef]
- Müller, U.; Widmer, J.; Djukic, T. Towards Real-World Deployment of Reinforcement Learning for Traffic Signal Control. arXiv 2021, arXiv:2109.15121. [CrossRef]
- Korkmaz, M.; Cakiroglu, M. A Hybrid Traffic Controller System Based on Flower Pollination Algorithm and Fuzzy Logic. Sensors 2024, 24, 1568. [CrossRef]
- Odeh, S.M.; Nawayseh, K. A Hybrid Fuzzy Genetic Algorithm for an Adaptive Traffic Signal System. Jordanian Journal of Mechanical and Industrial Engineering 2015, 9, 73–78.
- Aburas, A.A.A.; Bashi, A.B. Optimising Traffic Flow with Fuzzy Logic and Machine Learning. Zenodo 2024. [CrossRef]
- Zahwa, F.; Cheng, C.-T.; Simic, M. Novel Intelligent Traffic Light Controller Design. Machines 2024, 12, 469. [CrossRef]
- Zahwa, F.; Simic, M.; Cheng, C.-T. Fuzzy-Based Traffic Light Control Strategy: The Good, the Bad, and the Ugly. In Big Data Analytics and Data Science; Bhateja, V., Ed.; Lecture Notes in Networks and Systems, Vol. 1106; Springer: Singapore, 2024; pp. 53–63. [CrossRef]
- Wegener, A.; Piórkowski, M.; Raya, M.; Hellbrück, H.; Fischer, S.; Hubaux, J.-P. TraCI: An Interface for Coupling Road Traffic and Network Simulators. In Proceedings of the 11th Communications and Networking Simulation Symposium (CNS), Ottawa, ON, Canada, 14–17 April 2008; pp. 155–163. [CrossRef]
- Alharbi, S.; Alqahtani, H.; Muthanna, A.; Alsubhi, K.; Alzahrani, B. Multi-Objective Reinforcement Learning for Urban Traffic Signal Control: A Multi-Agent Approach. Appl. Sci. 2024, 14, 3219. [CrossRef]
- Karimipour, H.; Wang, Y.; Zhou, J.; Yu, T. Reward Shaping Techniques in Reinforcement Learning-Based Traffic Signal Control. Appl. Sci. 2023, 13, 8019. [CrossRef]
- Aldakkhelallah, A.A.A.; Shiwakoti, N.; Dabic, T.; Lu, J.; Yii, W.; Simic, M. Development of New Technologies for Real-Time Traffic Monitoring. AIP Conf. Proc. 2022, 2681, 020080. [CrossRef]
- Aldakkhelallah, A.A.A.; Shiwakoti, N.; Dabic, T.; Lu, J.; Yii, W.; Simic, M. Public Opinion Survey on the Development of an Intelligent Transport System: A Case Study in Saudi Arabia. AIP Conf. Proc. 2022, 2681, 020089. [CrossRef]
- Aldakkhelallah, A.A.A.; Simic, M. Autonomous Vehicles in Intelligent Transportation Systems. In Human Centred Intelligent Systems; Zimmermann, A., Howlett, R.J., Jain, L.C., Schmidt, R., Eds.; Springer: Singapore, 2021; pp. 185–198.
- Todorovic, M.; Aldakkhelallah, A.; Simic, M. Managing Transitions to Autonomous and Electric Vehicles: Scientometric and Bibliometric Review. World Electr. Veh. J. 2023, 14, 314. [CrossRef]
- Aldakkhelallah, A.; Alamri, A.S.; Georgiou, S.; Simic, M. Public Perception of the Introduction of Autonomous Vehicles. World Electr. Veh. J. 2023, 14, 345. [CrossRef]
- Zhao, Z.; Simic, M. Transition to Electric and Autonomous Vehicles in China. In Human Centred Intelligent Systems; Zimmermann, A., Howlett, R.J., Jain, L.C., Eds.; Springer: Singapore, 2022; pp. 111–123.
- Liang, X.; Du, X.; Wang, G.; Han, Z. A Deep Reinforcement Learning Network for Traffic Light Cycle Control. IEEE Transactions on Vehicular Technology 2020, 69, 386–395.
- Van der Pol, E.; Oliehoek, F.A. Coordinated Deep Reinforcement Learners for Traffic Light Control. Proceedings of the NIPS Workshop on Learning, Inference and Control of Multi-Agent Systems, 2016.
- Gao, J.; Shen, Y.; Liu, J.; Ito, M. Adaptive Traffic Signal Control: Deep Reinforcement Learning Algorithm with Experience Replay and Target Network. arXiv 2017, arXiv:1705.02755. [CrossRef]
- Li, L.; Lv, Y.; Wang, F.Y. Traffic Signal Timing via Deep Reinforcement Learning. IEEE/CAA Journal of Automatica Sinica 2016, 3, 247–254. [CrossRef]
- El-Tantawy, S.; Abdulhai, B.; Abdelgawad, H. Multiagent Reinforcement Learning for Integrated Network of Adaptive Traffic Signal Controllers (MARLIN-ATSC): Methodology and Large-Scale Application on Downtown Toronto. IEEE Transactions on Intelligent Transportation Systems 2013, 14, 1140–1150. [CrossRef]
- Mannion, P.; Duggan, J.; Howley, E. An Experimental Review of Reinforcement Learning Algorithms for Adaptive Traffic Signal Control. Springer International Publishing, Autonomic Road Transport Support Systems, 2016; pp. 47–66. [CrossRef]
- Genders, W.; Razavi, S. Using a Deep Reinforcement Learning Agent for Traffic Signal Control. arXiv 2016, arXiv:1611.01142. [CrossRef]
- ei, H.; Zheng, G.; Yao, H.; Li, Z. IntelliLight: A Reinforcement Learning Approach for Intelligent Traffic Light Control. Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018; pp. 2496–2505. [CrossRef]
- Nishi, T.; Chakraborty, S.; Chakraborty, P. A Reinforcement Learning Approach for Signal Timing of Urban Traffic Network. Transportation Research Record 2020, 2674, 160–172.
- Chu, T.; Wang, J.; Codecà, L.; Li, Z. Multi-Agent Deep Reinforcement Learning for Large-Scale Traffic Signal Control. IEEE Transactions on Intelligent Transportation Systems 2020, 21, 1086–1095. [CrossRef]





Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).