A Reinforcement Learning Method for Joint Mode Selection and Power Adaptation in the V2V Communication Network in 5G

Zhao, Di; Qin, Hao; Song, Bin; Zhang, Yanli; Du, Xiaojiang; Guizani, Mohsen

Author	Zhao, Di
Author	Qin, Hao
Author	Song, Bin
Author	Zhang, Yanli
Author	Du, Xiaojiang
Author	Guizani, Mohsen
Available date	2022-12-06T17:19:46Z
Publication Date	2020-06-01
Publication Name	IEEE Transactions on Cognitive Communications and Networking
Identifier	http://dx.doi.org/10.1109/TCCN.2020.2983170
Citation	Zhao, D., Qin, H., Song, B., Zhang, Y., Du, X., & Guizani, M. (2020). A reinforcement learning method for joint mode selection and power adaptation in the V2V communication network in 5G. IEEE Transactions on Cognitive Communications and Networking, 6(2), 452-463.‏
URI	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85082530665&origin=inward
URI	http://hdl.handle.net/10576/37007
Abstract	A 5G network is the key driving factor in the development of vehicle-to-vehicle (V2V) communication technology, and V2V communication in 5G has recently attracted great interest. In the V2V communication network, users can choose different transmission modes and power levels for communication, to guarantee their quality-of-service (QoS), high capacity of vehicle-to-infrastructure (V2I) links and ultra-reliability of V2Vlinks. Aiming atV2V communication mode selection and power adaptation in 5G communication networks, a reinforcement learning (RL) framework based on slow fading parameters and statistical information is proposed. In this paper, our objective is to maximize the total capacity of V2I links while guaranteeing the strict transmission delay and reliability constraints of V2V links. Considering the fast channel variations and the continuous-valued state in a high mobility vehicular environment, we use a multi-agent double deep Q-learning (DDQN) algorithm. Each V2V link is considered as an agent, learning the optimal policy with the updated Q-network by interacting with the environment. Experiments verify the convergence of our algorithm. The simulation results show that the proposed scheme can significantly optimize the total capacity of the V2I links and ensure the latency and reliability requirements of the V2V links.
Sponsor	This work has been supported by the National Natural Science Foundation of China (Nos. 61772387), the Fundamental Research Funds of Ministry of Education and China Mobile (MCM20170202), the National Natural Science Foundation of Shaanxi Province (Grant No. 2019ZDLGY03-03) and also supported by the ISN State Key Laboratory.
Language	en
Publisher	Institute of Electrical and Electronics Engineers Inc.
Subject	5G mode selection power adaptation reinforcement learning V2V
Title	A Reinforcement Learning Method for Joint Mode Selection and Power Adaptation in the V2V Communication Network in 5G
Type	Article
Pagination	452-463
Issue Number	2
Volume Number	6
dc.accessType	Abstract Only

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

Computer Science & Engineering [‎2484‎ items ]

Show simple item record

A Reinforcement Learning Method for Joint Mode Selection and Power Adaptation in the V2V Communication Network in 5G

Files in this item

This item appears in the following Collection(s)

Video