A Reinforcement Learning Method for Joint Mode Selection and Power Adaptation in the V2V Communication Network in 5G
Author | Zhao, Di |
Author | Qin, Hao |
Author | Song, Bin |
Author | Zhang, Yanli |
Author | Du, Xiaojiang |
Author | Guizani, Mohsen |
Available date | 2022-12-06T17:19:46Z |
Publication Date | 2020-06-01 |
Publication Name | IEEE Transactions on Cognitive Communications and Networking |
Identifier | http://dx.doi.org/10.1109/TCCN.2020.2983170 |
Citation | Zhao, D., Qin, H., Song, B., Zhang, Y., Du, X., & Guizani, M. (2020). A reinforcement learning method for joint mode selection and power adaptation in the V2V communication network in 5G. IEEE Transactions on Cognitive Communications and Networking, 6(2), 452-463. |
Abstract | A 5G network is the key driving factor in the development of vehicle-to-vehicle (V2V) communication technology, and V2V communication in 5G has recently attracted great interest. In the V2V communication network, users can choose different transmission modes and power levels for communication, to guarantee their quality-of-service (QoS), high capacity of vehicle-to-infrastructure (V2I) links and ultra-reliability of V2Vlinks. Aiming atV2V communication mode selection and power adaptation in 5G communication networks, a reinforcement learning (RL) framework based on slow fading parameters and statistical information is proposed. In this paper, our objective is to maximize the total capacity of V2I links while guaranteeing the strict transmission delay and reliability constraints of V2V links. Considering the fast channel variations and the continuous-valued state in a high mobility vehicular environment, we use a multi-agent double deep Q-learning (DDQN) algorithm. Each V2V link is considered as an agent, learning the optimal policy with the updated Q-network by interacting with the environment. Experiments verify the convergence of our algorithm. The simulation results show that the proposed scheme can significantly optimize the total capacity of the V2I links and ensure the latency and reliability requirements of the V2V links. |
Sponsor | This work has been supported by the National Natural Science Foundation of China (Nos. 61772387), the Fundamental Research Funds of Ministry of Education and China Mobile (MCM20170202), the National Natural Science Foundation of Shaanxi Province (Grant No. 2019ZDLGY03-03) and also supported by the ISN State Key Laboratory. |
Language | en |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Subject | 5G mode selection power adaptation reinforcement learning V2V |
Type | Article |
Pagination | 452-463 |
Issue Number | 2 |
Volume Number | 6 |
Files in this item
Files | Size | Format | View |
---|---|---|---|
There are no files associated with this item. |
This item appears in the following Collection(s)
-
Computer Science & Engineering [2402 items ]