A Reinforcement Learning Method for Joint Mode Selection and Power Adaptation in the V2V Communication Network in 5G

Zhao, Di; Qin, Hao; Song, Bin; Zhang, Yanli; Du, Xiaojiang; Guizani, Mohsen

Date

2020-06-01

Author

Zhao, Di
Qin, Hao
Song, Bin
Zhang, Yanli
Du, Xiaojiang
...show more authors ...show less authors

Metadata

Show full item record

Abstract

A 5G network is the key driving factor in the development of vehicle-to-vehicle (V2V) communication technology, and V2V communication in 5G has recently attracted great interest. In the V2V communication network, users can choose different transmission modes and power levels for communication, to guarantee their quality-of-service (QoS), high capacity of vehicle-to-infrastructure (V2I) links and ultra-reliability of V2Vlinks. Aiming atV2V communication mode selection and power adaptation in 5G communication networks, a reinforcement learning (RL) framework based on slow fading parameters and statistical information is proposed. In this paper, our objective is to maximize the total capacity of V2I links while guaranteeing the strict transmission delay and reliability constraints of V2V links. Considering the fast channel variations and the continuous-valued state in a high mobility vehicular environment, we use a multi-agent double deep Q-learning (DDQN) algorithm. Each V2V link is considered as an agent, learning the optimal policy with the updated Q-network by interacting with the environment. Experiments verify the convergence of our algorithm. The simulation results show that the proposed scheme can significantly optimize the total capacity of the V2I links and ensure the latency and reliability requirements of the V2V links.