Low-Complexity Physics-Informed Reinforcement Learning Using Post-Decision States with Stochastic Sampling

Andrew Corra, Nicholas Mastronarde, Jacob Chakareski

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Delay-sensitive Internet of Things (IoT) applications continue to grow in prevalence as new wireless technologies are adopted. Since these applications often operate in unknown dynamic environments, reinforcement learning (RL) has emerged as an effective method to learn optimal decision policies that improve their overall performance. However, typical data-driven RL techniques that have been adopted to solve these problems do not exploit available knowledge of system dynamics. Consequently, they must 'learn' some information about the system that may already be known to the system's designer. Post-decision state (PDS) learning, on the other hand, leverages known system information (i.e., it is 'physics-informed') to simplify the learning task and improve learning performance. However, this comes at the cost of increased computational complexity, and makes it impractical to implement on resource constrained devices. This work introduces stochastic PDS learning, a novel RL algorithm that combines traditional PDS learning with stochastic sampling to produce a physics-informed RL agent that can leverage known system information even with limited computational resources. Performance of stochastic PDS learning is compared against numerous traditional RL algorithms in the context of a delaysensitive energy-efficient scheduling problem simulated as an environment in Gymnasium.

Original languageEnglish (US)
Title of host publicationICC 2025 - IEEE International Conference on Communications
EditorsMatthew Valenti, David Reed, Melissa Torres
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages4559-4564
Number of pages6
ISBN (Electronic)9798331505219
DOIs
StatePublished - 2025
Event2025 IEEE International Conference on Communications, ICC 2025 - Montreal, Canada
Duration: Jun 8 2025Jun 12 2025

Publication series

NameIEEE International Conference on Communications
ISSN (Print)1550-3607

Conference

Conference2025 IEEE International Conference on Communications, ICC 2025
Country/TerritoryCanada
CityMontreal
Period6/8/256/12/25

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Low-Complexity Physics-Informed Reinforcement Learning Using Post-Decision States with Stochastic Sampling'. Together they form a unique fingerprint.

Cite this