TY - GEN
T1 - Performance Evaluation of 5G Delay-Sensitive Single-Carrier Multi-User Downlink Scheduling
AU - Omer, Anjali
AU - Malandra, Filippo
AU - Chakareski, Jacob
AU - Mastronarde, Nicholas
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - The coexistence of a wide variety of different applications with diverse Quality of Service (QoS) requirements calls for more sophisticated radio resource scheduling (RRS) in 5G networks compared to previous generations. To address this challenge, a growing body of research formulates the RRS problem as a Markov decision process (MDP) and aims to solve it using deep reinforcement learning (DRL). A key consideration when formulating an MDP is the choice of reward function, which determines the goal of the decision agent. Despite the reward function being a critical component of an MDP, there is currently no systematic study comparing how different reward functions affect network performance. To this end, we carry out a comparative study of the delay and overflow performance using several reward functions that aim to minimize packet delays. Through extensive simulations under different traffic and channel conditions, we identify a reward function that can achieve near optimal delay with up to 55 - 67% fewer packet drops than the other investigated options, and does not require any tuning.
AB - The coexistence of a wide variety of different applications with diverse Quality of Service (QoS) requirements calls for more sophisticated radio resource scheduling (RRS) in 5G networks compared to previous generations. To address this challenge, a growing body of research formulates the RRS problem as a Markov decision process (MDP) and aims to solve it using deep reinforcement learning (DRL). A key consideration when formulating an MDP is the choice of reward function, which determines the goal of the decision agent. Despite the reward function being a critical component of an MDP, there is currently no systematic study comparing how different reward functions affect network performance. To this end, we carry out a comparative study of the delay and overflow performance using several reward functions that aim to minimize packet delays. Through extensive simulations under different traffic and channel conditions, we identify a reward function that can achieve near optimal delay with up to 55 - 67% fewer packet drops than the other investigated options, and does not require any tuning.
UR - http://www.scopus.com/inward/record.url?scp=85178257183&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85178257183&partnerID=8YFLogxK
U2 - 10.1109/PIMRC56721.2023.10293980
DO - 10.1109/PIMRC56721.2023.10293980
M3 - Conference contribution
AN - SCOPUS:85178257183
T3 - IEEE International Symposium on Personal, Indoor and Mobile Radio Communications, PIMRC
BT - 2023 IEEE 34th Annual International Symposium on Personal, Indoor and Mobile Radio Communications
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 34th IEEE Annual International Symposium on Personal, Indoor and Mobile Radio Communications, PIMRC 2023
Y2 - 5 September 2023 through 8 September 2023
ER -