AI Agents 相关度: 6/10

A Learning Method with Gap-Aware Generation for Heterogeneous DAG Scheduling

Ruisong Zhou, Haijun Zou, Li Zhou, Chumin Sun, Zaiwen Wen
arXiv: 2603.23249v1 发布: 2026-03-24 更新: 2026-03-24

AI 摘要

提出WeCAN框架,利用强化学习解决异构DAG调度问题,优化makespan并加速调度。

主要贡献

  • 提出WeCAN端到端强化学习框架,解决异构DAG调度问题
  • 通过订单空间分析解决生成诱导的最优性差距
  • 设计skip-extended realization以扩大可达订单集合

方法论

基于强化学习,采用双阶段单次前向传递,结合加权交叉注意力编码器和跳跃扩展实现。

原文摘要

Efficient scheduling of directed acyclic graphs (DAGs) in heterogeneous environments is challenging due to resource capacities and dependencies. In practice, the need for adaptability across environments with varying resource pools and task types, alongside rapid schedule generation, complicates these challenges. We propose WeCAN, an end-to-end reinforcement learning framework for heterogeneous DAG scheduling that addresses task--pool compatibility coefficients and generation-induced optimality gaps. It adopts a two-stage single-pass design: a single forward pass produces task--pool scores and global parameters, followed by a generation map that constructs schedules without repeated network calls. Its weighted cross-attention encoder models task--pool interactions gated by compatibility coefficients, and is size-agnostic to environment fluctuations. Moreover, widely used list-scheduling maps can incur generation-induced optimality gaps from restricted reachability. We introduce an order-space analysis that characterizes the reachable set of generation maps via feasible schedule orders, explains the mechanism behind generation-induced gaps, and yields sufficient conditions for gap elimination. Guided by these conditions, we design a skip-extended realization with an analytically parameterized decreasing skip rule, which enlarges the reachable order set while preserving single-pass efficiency. Experiments on computation graphs and real-world TPC-H DAGs demonstrate improved makespan over strong baselines, with inference time comparable to classical heuristics and faster than multi-round neural schedulers.

标签

强化学习 DAG调度 异构环境 订单空间分析

arXiv 分类

cs.LG cs.AI math.OC