Multiagent Reinforcement Learning-Based Semi-Persistent Scheduling Scheme in C-V2X Mode 4

Bo Gu, Weixiang Chen, Mamoun Alazab, Xiaojun Tan, Mohsen Guizani

    Research output: Contribution to journalArticlepeer-review

    21 Citations (Scopus)

    Abstract

    The Third Generation Partnership Project has standardized cellular vehicle-to-everything (C-V2X) sidelink Mode 4 to support direct communication between vehicles. In Mode 4, the sensing-based semipersistent scheduling (SPS) scheme enables vehicles to autonomously reserve and select radio resources. In particular, SPS has three processes to realize the resource scheduling, including continuously sensing resources, probabilistically reselecting resources, and periodically reserving resources. However, vehicles randomly select resources from the available resource lists in the resource reselection process, resulting in frequent packet collisions especially when radio resources are insufficient. Unlike the traditional SPS, this paper proposes a multiagent deep reinforcement learning-based SPS (RL-SPS) algorithm to help vehicles select appropriate radio resources with the aim of reducing packet collisions. Furthermore, a multi-head attention mechanism is adopted to improve the training efficiency by helping vehicles selectively pay attention to the observations and actions of neighbouring vehicles. It is worth noting that the RL-SPS algorithm fits the characteristics of Mode 4, which selects resources without requiring any global information. Simulation results show that RL-SPS outperforms other decentralized approaches and demonstrate the scalability and robustness of RL-SPS in a dynamic vehicular network.

    Original languageEnglish
    Pages (from-to)12044-12056
    Number of pages13
    JournalIEEE Transactions on Vehicular Technology
    Volume71
    Issue number11
    Early online date2022
    DOIs
    Publication statusPublished - 1 Nov 2022

    Bibliographical note

    Funding Information:
    This work was supported in part by the National_Science_Foundation of China (NSFC) under Grant U20A20175 and in part by the National Key RandD Program of China under Grant 2020YFB1713800

    Publisher Copyright:
    © 1967-2012 IEEE.

    Fingerprint

    Dive into the research topics of 'Multiagent Reinforcement Learning-Based Semi-Persistent Scheduling Scheme in C-V2X Mode 4'. Together they form a unique fingerprint.

    Cite this