Preprint
Article

This version is not peer-reviewed.

Reinforcement Learning-based Resource Allocation Scheme of NR-V2X Sidelink for Joint Communication and Sensing

A peer-reviewed version of this preprint was published in:
Sensors 2025, 25(2), 302. https://doi.org/10.3390/s25020302

Submitted:

19 November 2024

Posted:

22 November 2024

You are already at the latest version

Abstract

Joint communication and sensing(JCS) is becoming an important trend in 6G, owing to its the efficient utilization of spectrum and hardware resource. Utilizing echoes of the same signal can achieve the object location sensing function, in addition to V2X communication function. There is application potential for JCS systems in the fields of ADAS and unmanned autos. Currently, NR-V2X sidelink has been standardized by 3GPP to support the low-latency high-reliability direct communication. In order to combine benefits from both direct communication and JCS, it is promising to extend existing NR-V2X sidelink communication towards sidelink JCS. However, the conflicting performance requirements arise between radar sensing accuracy and communication reliability with the limited sidelink spectrum. In order to overcome the challenges in the distributed resource allocation of sidelink JCS with a full-duplex, this paper has proposed a novel consecutive-collision mitigation semi-persistent scheduling (CCM-SPS) scheme, including the collision detection and Q-learning training stages to suppress collision probabilities. Theoretical performance analyses on Cramér-Rao lower bounds(CRLB) has been made for the sensing of sidelink JCS. Key performance metrics such as CRLB, PRR and UD have been evaluated. The simulation results indicate the superior performance of CCM-SPS to the counterparts, with promising application prospects.

Keywords: 
;  ;  ;  ;  

1. Introduction

In the future, vehicles on the road will need to frequently exchange information with surrounding vehicles, pedestrians, and road traffic infrastructure, which drives the development of Vehicle-to-Everything (V2X) technology. Today’s vehicles are equipped with numerous sensors and communication systems, having transformed from traditional vehicles into intelligent vehicles. Through V2X network, there is potential to achieve Advanced Driver Assistance System(ADAS), improving driving safety and comfort. Recently, the latest beyond 5G and 6G standards have introduced new requirements for V2X, including enhanced demands for sensing accuracy, precision, and resolution, alongside the existing communication criteria of latency, reliability, capacity, and coverage.[1]. Therefore, the joint communication and sensing(JCS) system that utilizes a signal to simultaneously achieve two functions have attracted much attention.
In previous systems, communication and radar sensing were separate systems using different frequencies and hardware resources. However, with increasingly scarce spectrum resources, there is a need for more efficient utilization of spectrum resources by communication and radar systems. As the bandwidth of commercial communication systems increases , coexistence with various existing radar systems is anticipated, leading to the development of JCS concept[2]. JCS can provide integrated and collaborative gains for future systems[3]. On one hand, sharing spectrum and hardware resources can lead to high resource utilization efficiency. on the other hand, sensing function can assist communication in obtaining more accurate channel estimation models, which beneficial for beamforming and Spectrum resource management.
3GPP Release 16 has established standards for vehicle sidelink communication based on the 5G-NR PC5 air interface, enabling vehicles to communicate directly without the assistance of gNB[4], as illustrated in Figure 1. Sidelink is beneficial for reducing latency and improving communication. In the meanwhile, sidelink signals also can be used for near-field positioning, range sensing, and distance measurement[5], thereby complementing or enhancing positioning systems that may be limited by obstacles or other factors, such as network-based positioning or Global Navigation Satellite System(GNSS). Therefore, the V2X sidelink JCS system has significant development potential.
However, due to limited available bandwidth and without the assistance of base station, there is a conflicting requirement between radar and communication in spectrum resource utilization. Radar accuracy requires large bandwidth occupancy, which reducing available resources in the resource pool, increasing resource collision probability, thereby affecting the performance of communication. The issue of resource collision in sidelink scenario is related to resource allocation scheme. Therefore, a flexible and robust resource allocation scheme is crucial for mitigating resource pool conflicts.
Traditional sidelink resource allocation scheme are divided into dynamic allocation and sensing-based semi-persistent scheduling(SB-SPS). SB-SPS is widely used for sidelink resource allocation due to its better reliability and latency[6]. It allows vehicles to autonomously choose and reserve resources for a given reservation period. However, potential collisions may remain undetected due to the lack of coordination from base stations, resulting in consecutive packet collisions and deteriorating sidelink JCS performance. Numerous studies have modified and improved the sensing and reselection process of SB-SPS[7][8], but have struggled to resolve the issue of consecutive collisions. With the advancements in self-interference (SI) technology in recent years[9], simultaneous transmission and reception on the same frequency band with in-band full-duplex (FD) transceivers have become feasible, offering hope for the implementation of JCS systems in the sidelink. Additionally, the powerful sensing capabilities of full-duplex bring collision detection functionality, creating new opportunities for enhancing sidelink resource allocation scheme.
Inspired by the above, this paper focuses on high-positioning accuracy, low-latency and high-reliability in 5G NR-V2X sidelink JCS system. By studying the comprehensive impact of interference due to consecutive collisions, we propose a reinforcement learning-based collision mitigation resource allocation scheme(CCM-SPS). Specifically, this scheme employs JCS full-duplex collision detection and reinforcement learning to optimize traditional SB-SPS parameters, mitigating performance degradation from consecutive collisions. Furthermore, the impact of varying vehicle density and packet sizes on JCS performance in dynamic vehicular networks is discussed. Finally, the effectiveness of the proposed scheme in enhancing overall performance is validated using a V2X sidelink system-level simulator.
The main contributions of this work can be summarized as follows:
  • In order to address the conflicting requirement between sensing accuracy and communication reliability for sidelink resources, a novel collision mitigation resource allocation scheme is proposed. The algorithm integrates the full-duplex detection capability of JCS with the resource sensing reservation process of the traditional SB-SPS scheme. This allows vehicles to dynamically optimize reservation times based on sensing channel information, effectively reducing consecutive packet collisions and enhancing the overall utilization efficiency of resources in the sidelink JCS system.
  • The performance of the proposed CCM-SPS and traditional SB-SPS is theoretically analyzed in scenarios of variety vehicle density and packet size. Through reinforcement learning, comprehensive optimization of resource utilization for sensing and communication is achieved.
  • Comprehensive evaluations are performed using the Cramér-Rao lower bounds(CRLB), packet reception rate (PRR) and update delay (UD). The novel scheme shows comparative advantages in positioning accuracy, latency, and reliability performance indicators over comparative scheme.
The remaining article is organized as follows. In Section 2, a review of recent literature is conducted, in Section 3, theoretical analysis performance on sidelink JCS system. In Section 4, analyzed the consecutive collision problem using traditional resource allocation scheme. Then, in Section 5, the specific implementation of the improved resource allocation scheme was introduced, including full-duplex collision detection and Q-learning based collisions mitigation scheme. The extensive results are presented in Section 6, which analyzes the performance indicators in various scenarios. In Section 7 presents the concluding remarks.

7. Conclusions

This paper proposes a resource allocation scheme in sidelink JCS system, named consecutive collision mitigation semi-persistent scheduling (CCM-SPS). By employing the collision detection referring to the echo power threshold and Q-learning to train the RC decreasing step size, this scheme can effectively suppress the consecutive collision probability. Compared with traditional SB-SPS and FD-enhanced scheme, CCM-SPS can achieve superior both sensing and communication performance even in the high-density vehicle scenarios. Furthermore, CCM-SPS can support services with large packet sizes to achieve accurate sensing with less cost of communication reliability with the increasing of distance. It is particularly meaningful of CCM-SPS in the perspective of enabling sidelinks to support sensing and communication collaboration in 6G network. In future work, there are interesting studies such as practical full-duplex impacts from interference and cross-layer optimization.

References

  1. YAN, S.; PENG, M.g.; WANG, W.b. Integration of communication, sensing and computing: The vision and key technologies of 6G. Journal of Beijing University of Posts and Telecommunications 2021, 44, 1. [Google Scholar]
  2. Griffiths, H.; Cohen, L.; Watts, S.; Mokole, E.; Baker, C.; Wicks, M.; Blunt, S. Radar spectrum engineering and management: Technical and regulatory issues. Proceedings of the IEEE 2014, 103, 85–102. [Google Scholar] [CrossRef]
  3. Liu, F.; Cui, Y.; Masouros, C.; Xu, J.; Han, T.X.; Eldar, Y.C.; Buzzi, S. Integrated sensing and communications: Toward dual-functional wireless networks for 6G and beyond. IEEE journal on selected areas in communications 2022, 40, 1728–1767. [Google Scholar] [CrossRef]
  4. Garcia, M.H.C.; Molina-Galan, A.; Boban, M.; Gozalvez, J.; Coll-Perales, B.; Şahin, T.; Kousaridas, A. A tutorial on 5G NR V2X communications. IEEE Communications Surveys & Tutorials 2021, 23, 1972–2026. [Google Scholar]
  5. Decarli, N.; Bartoletti, S.; Bazzi, A.; Stirling-Gallacher, R.A.; Masini, B.M. Performance Characterization of Joint Communication and Sensing With Beyond 5 G NR-V2X Sidelink. IEEE Transactions on Vehicular Technology 2024. [Google Scholar] [CrossRef]
  6. Jeon, Y.; Kuk, S.; Kim, H. Reducing message collisions in sensing-based semi-persistent scheduling (SPS) by using reselection lookaheads in cellular V2X. Sensors 2018, 18, 4388. [Google Scholar] [CrossRef] [PubMed]
  7. Gu, B.; Chen, W.; Alazab, M.; Tan, X.; Guizani, M. Multiagent reinforcement learning-based semi-persistent scheduling scheme in c-v2x mode 4. IEEE Transactions on Vehicular Technology 2022, 71, 12044–12056. [Google Scholar] [CrossRef]
  8. Gu, X.; Peng, J.; Cai, L.; Cheng, Y.; Zhang, X.; Liu, W.; Huang, Z. Performance analysis and optimization for semi-persistent scheduling in c-v2x. IEEE Transactions on Vehicular Technology 2022, 72, 4628–4642. [Google Scholar] [CrossRef]
  9. Kolodziej, K.E.; Perry, B.T.; Herd, J.S. In-band full-duplex technology: Techniques and systems survey. IEEE Transactions on Microwave Theory and Techniques 2019, 67, 3025–3041. [Google Scholar] [CrossRef]
  10. Li, Y.; Liu, F.; Du, Z.; Yuan, W.; Shi, Q.; Masouros, C. Frame Structure and Protocol Design for Sensing-Assisted NR-V2X Communications. IEEE Transactions on Mobile Computing 2024. [Google Scholar] [CrossRef]
  11. Ni, Z.; Zhang, J.A.; Yang, K.; Liu, R. Frequency-hopping based joint automotive radar-communication systems using a single device. 2022 IEEE International Conference on Communications Workshops (ICC Workshops). IEEE, 2022, pp. 480–485.
  12. Liu, Y.; Liao, G.; Chen, Y.; Xu, J.; Yin, Y. Super-resolution range and velocity estimations with OFDM integrated radar and communications waveform. IEEE Transactions on Vehicular Technology 2020, 69, 11659–11672. [Google Scholar] [CrossRef]
  13. Gaudio, L.; Kobayashi, M.; Caire, G.; Colavolpe, G. On the effectiveness of OTFS for joint radar parameter estimation and communication. IEEE Transactions on Wireless Communications 2020, 19, 5951–5965. [Google Scholar] [CrossRef]
  14. Zhang, Q.; Li, Z.; Gao, X.; Feng, Z. Performance evaluation of radar and communication integrated system for autonomous driving vehicles. IEEE INFOCOM 2021-IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS). IEEE, 2021, pp. 1–2.
  15. Yang, H.; Wei, Z.; Feng, Z.; Qiu, C.; Fang, Z.; Chen, X.; Zhang, P. Queue-aware dynamic resource allocation for the joint communication-radar system. IEEE Transactions on Vehicular Technology 2020, 70, 754–767. [Google Scholar] [CrossRef]
  16. Liu, F.; Masouros, C. Joint localization and predictive beamforming in vehicular networks: Power allocation beyond water-filling. ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2021, pp. 8393–8397.
  17. Zhong, Y.; Bi, T.; Wang, J.; Zeng, J.; Huang, Y.; Jiang, T.; Wu, Q.; Wu, S. Empowering the V2X network by integrated sensing and communications: Background, design, advances, and opportunities. IEEE Network 2022, 36, 54–60. [Google Scholar] [CrossRef]
  18. Liyanaarachchi, S.D.; Barneto, C.B.; Riihonen, T.; Valkama, M. Experimenting joint vehicular communications and sensing with optimized 5G NR waveform. 2021 IEEE 93rd Vehicular Technology Conference (VTC2021-Spring). IEEE, 2021, pp. 1–5.
  19. Decarli, N.; Guerra, A.; Giovannetti, C.; Guidi, F.; Masini, B.M. V2X sidelink localization of connected automated vehicles. IEEE Journal on Selected Areas in Communications 2023. [Google Scholar] [CrossRef]
  20. Giovannetti, C.; Decarli, N.; Bartoletti, S.; Stirling-Gallacher, R.A.; Masini, B.M. Target Positioning Accuracy of V2X Sidelink Joint Communication and Sensing. IEEE Wireless Communications Letters 2023. [Google Scholar] [CrossRef]
  21. Bartoletti, S.; Decarli, N.; Masini, B.M. Sidelink 5G-V2X for integrated sensing and communication: the impact of resource allocation. 2022 IEEE International Conference on Communications Workshops (ICC Workshops). IEEE, 2022, pp. 79–84.
  22. European Telecommunications Standards Institute (ETSI). LTE; 5G; Overall description of Radio Access Network (RAN) aspects for Vehicle-to-everything (V2X) based on LTE and NR, 1 January 2024.
  23. 3GPP. Technical specification Group services and system Aspects. Release 2018, 15.
  24. Jeon, Y.; Kim, H. An explicit reservation-augmented resource allocation scheme for C-V2X sidelink mode 4. IEEE Access 2020, 8, 147241–147255. [Google Scholar] [CrossRef]
  25. Yin, J.; Hwang, S.H. Reuse Distance-Aided Resource Selection Mechanisms for NR-V2X Sidelink Communication. Sensors 2023, 24, 253. [Google Scholar] [CrossRef]
  26. Jiawei, T.; Pawase, C.J.; Chang, K. Adaptive Sidelink Open Loop Power Control Optimization Strategies for Vehicle-to-Vehicle Communications in 5G-NR-V2X. IEEE Access 2024. [Google Scholar] [CrossRef]
  27. Molina-Galan, A.; Lusvarghi, L.; Coll-Perales, B.; Gozalvez, J.; Merani, M.L. On the Impact of Re-evaluation in 5G NR V2X Mode 2. IEEE Transactions on Vehicular Technology 2023. [Google Scholar] [CrossRef]
  28. Lu, Y.; Wang, P.; Wang, S.; Yao, W. A Q-learning based SPS resource scheduling algorithm for reliable C-V2X communication. Proceedings of the 2021 5th International Conference on Digital Signal Processing, 2021, pp. 201–206.
  29. Saad, M.M.; Tariq, M.A.; Seo, J.; Ajmal, M.; Kim, D. Age-of-information aware intelligent MAC for congestion control in NR-V2X. 2023 Fourteenth International Conference on Ubiquitous and Future Networks (ICUFN). IEEE, 2023, pp. 265–270.
  30. Urmonov, O.; Aliev, H.; Kim, H. Multi-agent deep reinforcement learning for enhancement of distributed resource allocation in vehicular network. IEEE Systems Journal 2022, 17, 491–502. [Google Scholar] [CrossRef]
  31. Campolo, C.; Molinaro, A.; Berthet, A.O.; Vinel, A. Full-duplex radios for vehicular communications. IEEE Communications Magazine 2017, 55, 182–189. [Google Scholar] [CrossRef]
  32. Campolo, C.; Bazzi, A.; Todisco, V.; Bartoletti, S.; Decarli, N.; Molinaro, A.; Berthet, A.O.; Stirling-Gallacher, R.A. Enhancing the 5G-V2X sidelink autonomous mode through full-duplex capabilities. 2022 IEEE 95th Vehicular Technology Conference:(VTC2022-Spring). IEEE, 2022, pp. 1–6.
  33. Campolo, C.; Molinaro, A.; Romeo, F.; Bazzi, A.; Berthet, A.O. Full duplex-aided sensing and scheduling in cellular-V2X mode 4. Proceedings of the 1st ACM MobiHoc workshop on technologies, models, and protocols for cooperative connected cars, 2019, pp. 19–24.
  34. Radios, F.D. Performance Analysis of V2V Beaconing Using LTE in Direct Mode with Full Duplex Radios 2015.
  35. Barneto, C.B.; Liyanaarachchi, S.D.; Heino, M.; Riihonen, T.; Valkama, M. Full duplex radio/radar technology: The enabler for advanced joint communication and sensing. IEEE Wireless Communications 2021, 28, 82–88. [Google Scholar] [CrossRef]
  36. Gaudio, L.; Kobayashi, M.; Bissinger, B.; Caire, G. Performance analysis of joint radar and communication using OFDM and OTFS. 2019 IEEE International Conference on Communications Workshops (ICC Workshops). IEEE, 2019, pp. 1–6.
  37. Keskin, M.F.; Koivunen, V.; Wymeersch, H. Limited feedforward waveform design for OFDM dual-functional radar-communications. IEEE Transactions on Signal Processing 2021, 69, 2955–2970. [Google Scholar] [CrossRef]
  38. Lien, S.Y.; Deng, D.J.; Lin, C.C.; Tsai, H.L.; Chen, T.; Guo, C.; Cheng, S.M. 3GPP NR sidelink transmissions toward 5G V2X. IEEE Access 2020, 8, 35368–35382. [Google Scholar] [CrossRef]
  39. Shuai, W.; Yan, L.; Jie, Z.; Ping, W. A novel collision supervision and avoidance algorithm for scalable MAC of vehicular networks. Chinese Journal of Electronics 2021, 30, 164–170. [Google Scholar] [CrossRef]
  40. Wang, P.; Wang, S. A fairness-enhanced intelligent MAC scheme using Q-learning-based bidirectional backoff for distributed vehicular communication networks. Tsinghua Science and Technology 2022, 28, 258–268. [Google Scholar] [CrossRef]
  41. 3rd Generation Partnership Project (3GPP). Technical Specification Group Radio Access Network; Study on LTE-based V2X Services; Release 14. Technical Report 36.885, 3rd Generation Partnership Project (3GPP), 2016.
Figure 1. NG-RAN architecture supporting the PC5 interface
Figure 1. NG-RAN architecture supporting the PC5 interface
Preprints 140140 g001
Figure 2. Process flow of sensing-based semi-persistent scheduling (SB-SPS).
Figure 2. Process flow of sensing-based semi-persistent scheduling (SB-SPS).
Preprints 140140 g002
Figure 3. Markov chain for state transition of SPS.
Figure 3. Markov chain for state transition of SPS.
Preprints 140140 g003
Figure 4. Reinforcement learning framework.
Figure 4. Reinforcement learning framework.
Preprints 140140 g004
Figure 5. CCM-SPS accelerate reselection
Figure 5. CCM-SPS accelerate reselection
Preprints 140140 g005
Figure 6. Empirical CDF of the root CRLB for range using SB-SPS,FD-enhanced and CCM-SPS.
Figure 6. Empirical CDF of the root CRLB for range using SB-SPS,FD-enhanced and CCM-SPS.
Preprints 140140 g006
Figure 7. bar graph of root CRLB(at CCDF = 95-percentile) for the range using different scheme with vary vehicle density.
Figure 7. bar graph of root CRLB(at CCDF = 95-percentile) for the range using different scheme with vary vehicle density.
Preprints 140140 g007
Figure 8. PRR over distance using SB-SPS, FD-enhanced and CCM-SPS.
Figure 8. PRR over distance using SB-SPS, FD-enhanced and CCM-SPS.
Preprints 140140 g008
Figure 9. The maximum distance allowing PRR larger than 0.95 is evaluated using conventional SB-SPS,FD-enhanced methods and CCM-SPS.
Figure 9. The maximum distance allowing PRR larger than 0.95 is evaluated using conventional SB-SPS,FD-enhanced methods and CCM-SPS.
Preprints 140140 g009
Figure 10. empirical CDF of root CRLB for range estimation
Figure 10. empirical CDF of root CRLB for range estimation
Preprints 140140 g010
Figure 11. PRR vs. distance performance of SB-SPS with different pack sizes in case of density = 50 150 250 veh/km
Figure 11. PRR vs. distance performance of SB-SPS with different pack sizes in case of density = 50 150 250 veh/km
Preprints 140140 g011
Figure 12. CCM-SPS’s range sensing performance evaluation on empirical CDF of root CRLB with different pack sizes in case of density = 50 150 250 veh/km
Figure 12. CCM-SPS’s range sensing performance evaluation on empirical CDF of root CRLB with different pack sizes in case of density = 50 150 250 veh/km
Preprints 140140 g012
Figure 13. PRR vs. distance performance of CCM-SPS with different pack sizes in case of density = 50 150 250 veh/km
Figure 13. PRR vs. distance performance of CCM-SPS with different pack sizes in case of density = 50 150 250 veh/km
Preprints 140140 g013
Figure 14. CCM-SPS’s communication performance evaluation on empirical CDF of updata delay with different pack sizes in case of density = 50 150 250 veh/km
Figure 14. CCM-SPS’s communication performance evaluation on empirical CDF of updata delay with different pack sizes in case of density = 50 150 250 veh/km
Preprints 140140 g014
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2026 MDPI (Basel, Switzerland) unless otherwise stated