TY - GEN
T1 - Deep reinforcement learning for delay-sensitive LTE downlink scheduling
AU - Sharma, Nikhilesh
AU - Zhang, Sen
AU - Somayajula Venkata, Someshwar Rao
AU - Malandra, Filippo
AU - Mastronarde, Nicholas
AU - Chakareski, Jacob
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/8
Y1 - 2020/8
N2 - We consider an LTE downlink scheduling system where a base station allocates resource blocks (RBs) to users running delay-sensitive applications. We aim to find a scheduling policy that minimizes the queuing delay experienced by the users. We formulate this problem as a Markov Decision Process (MDP) that integrates the channel quality indicator (CQI) of each user in each RB, and queue status of each user. To solve this complex problem involving high dimensional state and action spaces, we propose a Deep Reinforcement Learning based scheduling framework that utilizes the Deep Deterministic Policy Gradient (DDPG) algorithm to minimize the queuing delay experienced by the users. Our extensive experiments demonstrate that our approach outperforms state-of-the-art benchmarks in terms of average throughput, queuing delay, and fairness, achieving up to 55% lower queuing delay than the best benchmark.
AB - We consider an LTE downlink scheduling system where a base station allocates resource blocks (RBs) to users running delay-sensitive applications. We aim to find a scheduling policy that minimizes the queuing delay experienced by the users. We formulate this problem as a Markov Decision Process (MDP) that integrates the channel quality indicator (CQI) of each user in each RB, and queue status of each user. To solve this complex problem involving high dimensional state and action spaces, we propose a Deep Reinforcement Learning based scheduling framework that utilizes the Deep Deterministic Policy Gradient (DDPG) algorithm to minimize the queuing delay experienced by the users. Our extensive experiments demonstrate that our approach outperforms state-of-the-art benchmarks in terms of average throughput, queuing delay, and fairness, achieving up to 55% lower queuing delay than the best benchmark.
UR - http://www.scopus.com/inward/record.url?scp=85094110161&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85094110161&partnerID=8YFLogxK
U2 - 10.1109/PIMRC48278.2020.9217110
DO - 10.1109/PIMRC48278.2020.9217110
M3 - Conference contribution
AN - SCOPUS:85094110161
T3 - IEEE International Symposium on Personal, Indoor and Mobile Radio Communications, PIMRC
BT - 2020 IEEE 31st Annual International Symposium on Personal, Indoor and Mobile Radio Communications, PIMRC 2020
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 31st IEEE Annual International Symposium on Personal, Indoor and Mobile Radio Communications, PIMRC 2020
Y2 - 31 August 2020 through 3 September 2020
ER -