TY - GEN
T1 - On a Hybrid Scheduling Approach Combining Whale Optimization and Deep Reinforcement Learning for DAG-Structured Machine Learning Workloads
AU - Du, Nana
AU - Nie, Weike
AU - Wu, Chase
AU - Song, Ruiqi
AU - Hou, Aiqin
N1 - Publisher Copyright:
© 2025 Copyright held by the owner/author(s).
PY - 2025/12/2
Y1 - 2025/12/2
N2 - In RF computing systems, devices with limited resources work together to run Machine Learning (ML) training and inference. In these systems, efficient task scheduling is key to meeting time and energy limits. The workloads are often represented as Directed Acyclic Graphs (DAGs), where each task depends on others. Scheduling DAG tasks across different accelerators and wireless links is challenging because of limits in RF channel bandwidth, device differences, and task data dependencies. This problem can be modeled as a Nonlinear Integer Programming (NIP) problem and is NP-complete. Based on our earlier work showing a clear relationship between Scheduling Plan Distance (SPD) and Finish Time Gap (FTG), we design WORL-RTGS, a hybrid method. It combines the global search ability of the Whale Optimization Algorithm (WOA) with the adaptive learning of Double Deep Q-Networks (DDQN). Experiments with real Alibaba ML workload traces in RF-based GPU systems show that WORL-RTGS improves WOA’s stability and reduces completion time by up to 66.56% compared with five state-of-the-art baselines.
AB - In RF computing systems, devices with limited resources work together to run Machine Learning (ML) training and inference. In these systems, efficient task scheduling is key to meeting time and energy limits. The workloads are often represented as Directed Acyclic Graphs (DAGs), where each task depends on others. Scheduling DAG tasks across different accelerators and wireless links is challenging because of limits in RF channel bandwidth, device differences, and task data dependencies. This problem can be modeled as a Nonlinear Integer Programming (NIP) problem and is NP-complete. Based on our earlier work showing a clear relationship between Scheduling Plan Distance (SPD) and Finish Time Gap (FTG), we design WORL-RTGS, a hybrid method. It combines the global search ability of the Whale Optimization Algorithm (WOA) with the adaptive learning of Double Deep Q-Networks (DDQN). Experiments with real Alibaba ML workload traces in RF-based GPU systems show that WORL-RTGS improves WOA’s stability and reduces completion time by up to 66.56% compared with five state-of-the-art baselines.
KW - Radio frequency
KW - double deep Q-network
KW - machine learning workloads
KW - whale optimization algorithm
KW - workload scheduling
UR - https://www.scopus.com/pages/publications/105025471106
UR - https://www.scopus.com/pages/publications/105025471106#tab=citedBy
U2 - 10.1145/3737906.3767105
DO - 10.1145/3737906.3767105
M3 - Conference contribution
AN - SCOPUS:105025471106
T3 - RFCom 2025 - Proceedings of the 2025 the 2nd International Workshop on Radio Frequency (RF) Computing
SP - 41
EP - 46
BT - RFCom 2025 - Proceedings of the 2025 the 2nd International Workshop on Radio Frequency (RF) Computing
PB - Association for Computing Machinery, Inc
T2 - 2nd International Workshop on Radio Frequency (RF) Computing, RFCom 2025
Y2 - 4 November 2025 through 8 November 2025
ER -