TY - JOUR
T1 - Deep Reinforcement Learning for Cooperative Content Caching in Vehicular Edge Computing and Networks
AU - Qiao, Guanhua
AU - Leng, Supeng
AU - Maharjan, Sabita
AU - Zhang, Yan
AU - Ansari, Nirwan
N1 - Funding Information:
Manuscript received April 27, 2019; revised July 11, 2019 and August 25, 2019; accepted September 21, 2019. Date of publication October 22, 2019; date of current version January 10, 2020. This work was supported in part by the National Key Research and Development Program of China under Grant 2018YFC0807101, in part by the Science and Technology Program of Sichuan Province, China, under Grant 2019YFH0007, in part by the Fundamental Research Funds for the Central Universities, China, under Grant ZYGX2016Z011, and in part by the EU H2020 Project COSAFE under Grant MSCA-RISE-2018-824019. (Corresponding author: Supeng Leng.) G. Qiao and S. Leng are with the School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China (e-mail: qghuestc@126.com; spleng@uestc.edu.cn).
Publisher Copyright:
© 2014 IEEE.
PY - 2020/1
Y1 - 2020/1
N2 - In this article, we propose a cooperative edge caching scheme, a new paradigm to jointly optimize the content placement and content delivery in the vehicular edge computing and networks, with the aid of the flexible trilateral cooperations among a macro-cell station, roadside units, and smart vehicles. We formulate the joint optimization problem as a double time-scale Markov decision process (DTS-MDP), based on the fact that the time-scale of content timeliness changes less frequently as compared to the vehicle mobility and network states during the content delivery process. At the beginning of the large time-scale, the content placement/updating decision can be obtained according to the content popularity, vehicle driving paths, and resource availability. On the small time-scale, the joint vehicle scheduling and bandwidth allocation scheme is designed to minimize the content access cost while satisfying the constraint on content delivery latency. To solve the long-term mixed integer linear programming (LT-MILP) problem, we propose a nature-inspired method based on the deep deterministic policy gradient (DDPG) framework to obtain a suboptimal solution with a low computation complexity. The simulation results demonstrate that the proposed cooperative caching system can reduce the system cost, as well as the content delivery latency, and improve content hit ratio, as compared to the noncooperative and random edge caching schemes.
AB - In this article, we propose a cooperative edge caching scheme, a new paradigm to jointly optimize the content placement and content delivery in the vehicular edge computing and networks, with the aid of the flexible trilateral cooperations among a macro-cell station, roadside units, and smart vehicles. We formulate the joint optimization problem as a double time-scale Markov decision process (DTS-MDP), based on the fact that the time-scale of content timeliness changes less frequently as compared to the vehicle mobility and network states during the content delivery process. At the beginning of the large time-scale, the content placement/updating decision can be obtained according to the content popularity, vehicle driving paths, and resource availability. On the small time-scale, the joint vehicle scheduling and bandwidth allocation scheme is designed to minimize the content access cost while satisfying the constraint on content delivery latency. To solve the long-term mixed integer linear programming (LT-MILP) problem, we propose a nature-inspired method based on the deep deterministic policy gradient (DDPG) framework to obtain a suboptimal solution with a low computation complexity. The simulation results demonstrate that the proposed cooperative caching system can reduce the system cost, as well as the content delivery latency, and improve content hit ratio, as compared to the noncooperative and random edge caching schemes.
KW - Content delivery
KW - content placement
KW - cooperative edge caching
KW - deep deterministic policy gradient (DDPG)
KW - double time-scale Markov decision process (DTS-MDP)
KW - vehicular edge computing and networks
UR - http://www.scopus.com/inward/record.url?scp=85078253824&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85078253824&partnerID=8YFLogxK
U2 - 10.1109/JIOT.2019.2945640
DO - 10.1109/JIOT.2019.2945640
M3 - Article
AN - SCOPUS:85078253824
SN - 2327-4662
VL - 7
SP - 247
EP - 257
JO - IEEE Internet of Things Journal
JF - IEEE Internet of Things Journal
IS - 1
M1 - 8879573
ER -