Skip to main navigation Skip to search Skip to main content

On a Hybrid Scheduling Approach Combining Whale Optimization and Deep Reinforcement Learning for DAG-Structured Machine Learning Workloads

  • Nana Du
  • , Weike Nie
  • , Chase Wu
  • , Ruiqi Song
  • , Aiqin Hou

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

In RF computing systems, devices with limited resources work together to run Machine Learning (ML) training and inference. In these systems, efficient task scheduling is key to meeting time and energy limits. The workloads are often represented as Directed Acyclic Graphs (DAGs), where each task depends on others. Scheduling DAG tasks across different accelerators and wireless links is challenging because of limits in RF channel bandwidth, device differences, and task data dependencies. This problem can be modeled as a Nonlinear Integer Programming (NIP) problem and is NP-complete. Based on our earlier work showing a clear relationship between Scheduling Plan Distance (SPD) and Finish Time Gap (FTG), we design WORL-RTGS, a hybrid method. It combines the global search ability of the Whale Optimization Algorithm (WOA) with the adaptive learning of Double Deep Q-Networks (DDQN). Experiments with real Alibaba ML workload traces in RF-based GPU systems show that WORL-RTGS improves WOA’s stability and reduces completion time by up to 66.56% compared with five state-of-the-art baselines.

Original languageEnglish (US)
Title of host publicationRFCom 2025 - Proceedings of the 2025 the 2nd International Workshop on Radio Frequency (RF) Computing
PublisherAssociation for Computing Machinery, Inc
Pages41-46
Number of pages6
ISBN (Electronic)9798400719837
DOIs
StatePublished - Dec 2 2025
Externally publishedYes
Event2nd International Workshop on Radio Frequency (RF) Computing, RFCom 2025 - Hong Kong, China
Duration: Nov 4 2025Nov 8 2025

Publication series

NameRFCom 2025 - Proceedings of the 2025 the 2nd International Workshop on Radio Frequency (RF) Computing

Conference

Conference2nd International Workshop on Radio Frequency (RF) Computing, RFCom 2025
Country/TerritoryChina
CityHong Kong
Period11/4/2511/8/25

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Control and Systems Engineering
  • Electrical and Electronic Engineering

Keywords

  • Radio frequency
  • double deep Q-network
  • machine learning workloads
  • whale optimization algorithm
  • workload scheduling

Fingerprint

Dive into the research topics of 'On a Hybrid Scheduling Approach Combining Whale Optimization and Deep Reinforcement Learning for DAG-Structured Machine Learning Workloads'. Together they form a unique fingerprint.

Cite this