Abstract
The Third Generation Partnership Project has standardized cellular vehicle-to-everything (C-V2X) sidelink Mode 4 to support direct communication between vehicles. In Mode 4, the sensing-based semipersistent scheduling (SPS) scheme enables vehicles to autonomously reserve and select radio resources. In particular, SPS has three processes to realize the resource scheduling, including continuously sensing resources, probabilistically reselecting resources, and periodically reserving resources. However, vehicles randomly select resources from the available resource lists in the resource reselection process, resulting in frequent packet collisions especially when radio resources are insufficient. Unlike the traditional SPS, this paper proposes a multiagent deep reinforcement learning-based SPS (RL-SPS) algorithm to help vehicles select appropriate radio resources with the aim of reducing packet collisions. Furthermore, a multi-head attention mechanism is adopted to improve the training efficiency by helping vehicles selectively pay attention to the observations and actions of neighbouring vehicles. It is worth noting that the RL-SPS algorithm fits the characteristics of Mode 4, which selects resources without requiring any global information. Simulation results show that RL-SPS outperforms other decentralized approaches and demonstrate the scalability and robustness of RL-SPS in a dynamic vehicular network.
Original language | English |
---|---|
Pages (from-to) | 12044-12056 |
Number of pages | 13 |
Journal | IEEE Transactions on Vehicular Technology |
Volume | 71 |
Issue number | 11 |
Early online date | 2022 |
DOIs | |
Publication status | Published - 1 Nov 2022 |
Bibliographical note
Funding Information:This work was supported in part by the National_Science_Foundation of China (NSFC) under Grant U20A20175 and in part by the National Key RandD Program of China under Grant 2020YFB1713800
Publisher Copyright:
© 1967-2012 IEEE.