TY - GEN
T1 - Aerial 360-Degree Video Delivery for Immersive First Person View UAV Navigation
AU - Singh, Simran
AU - Chakareski, Jacob
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Adaptive transmission of conventional video from a UAV to the ground has been researched for various applications, but the research topic of 360° video transmission from a UAV for the specific application of first-person view (FPV) based navigation is still nascent. In this work, we present adaptive 360° video compression and streaming methods to optimize the perceptual quality of experience of a pilot, who navigates the UAV in real time by viewing this immersive FPV feed, which is sent wirelessly from the UAV to the pilot. This adaptation of the 360° FPV feed is performed in response to the wireless channel conditions and the pilot's viewport, wherein each 360° frame is split into two regions of variable size, one meant to be within the pilot's viewport and the other outside. Each region is encoded using different H. 265 quantization parameters (QP) and modulation orders. We model the scenario realistically by generating probability distributions of the variation in frame size and quality with QP, for aerial 360° videos. These models are expressed using a two-term exponential function, whose parameters are also provided. This model achieves lower prediction errors than the single-term exponential and power law functions. Simulations on a set of aerial 360-degree videos demonstrate that the adaptive approach achieves 9.73 dB (21.77 %) greater QoE than a baseline approach that utilizes throughput-based adaptive bit rate algorithm (ABR) to tune QP per GoP, and a 5G new radio adaptive modulation scheme (AMS) to tune modulation order: Additionally, we present a deep reinforcement learning approach to adapt FPV, which achieves an expected pilot QoE just 2.07 dB lower than the adaptive approach, while being significantly faster and requiring no prior knowledge of the environment.
AB - Adaptive transmission of conventional video from a UAV to the ground has been researched for various applications, but the research topic of 360° video transmission from a UAV for the specific application of first-person view (FPV) based navigation is still nascent. In this work, we present adaptive 360° video compression and streaming methods to optimize the perceptual quality of experience of a pilot, who navigates the UAV in real time by viewing this immersive FPV feed, which is sent wirelessly from the UAV to the pilot. This adaptation of the 360° FPV feed is performed in response to the wireless channel conditions and the pilot's viewport, wherein each 360° frame is split into two regions of variable size, one meant to be within the pilot's viewport and the other outside. Each region is encoded using different H. 265 quantization parameters (QP) and modulation orders. We model the scenario realistically by generating probability distributions of the variation in frame size and quality with QP, for aerial 360° videos. These models are expressed using a two-term exponential function, whose parameters are also provided. This model achieves lower prediction errors than the single-term exponential and power law functions. Simulations on a set of aerial 360-degree videos demonstrate that the adaptive approach achieves 9.73 dB (21.77 %) greater QoE than a baseline approach that utilizes throughput-based adaptive bit rate algorithm (ABR) to tune QP per GoP, and a 5G new radio adaptive modulation scheme (AMS) to tune modulation order: Additionally, we present a deep reinforcement learning approach to adapt FPV, which achieves an expected pilot QoE just 2.07 dB lower than the adaptive approach, while being significantly faster and requiring no prior knowledge of the environment.
KW - 360-degree video
KW - UAVs
KW - first-person view
KW - quality of experience
KW - wireless communications
UR - http://www.scopus.com/inward/record.url?scp=85190264432&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85190264432&partnerID=8YFLogxK
U2 - 10.1109/ISM59092.2023.00026
DO - 10.1109/ISM59092.2023.00026
M3 - Conference contribution
AN - SCOPUS:85190264432
T3 - Proceedings - 2023 IEEE International Symposium on Multimedia, ISM 2023
SP - 139
EP - 146
BT - Proceedings - 2023 IEEE International Symposium on Multimedia, ISM 2023
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2023 IEEE International Symposium on Multimedia, ISM 2023
Y2 - 11 December 2023 through 13 December 2023
ER -