AI Agents 相关度: 9/10

When Should a Robot Think? Resource-Aware Reasoning via Reinforcement Learning for Embodied Robotic Decision-Making

Jun Liu, Pu Zhao, Zhenglun Kong, Xuan Shen, Peiyan Dong, Fan Yang, Lin Cui, Hao Tang, Geng Yuan, Wei Niu, Wenbin Zhang, Xue Lin, Gaowen Liu, Yanzhi Wang, Dong Huang
arXiv: 2603.16673v1 发布: 2026-03-17 更新: 2026-03-17

AI 摘要

RARRL框架通过强化学习自适应控制机器人何时进行推理,优化资源使用并提升任务成功率。

主要贡献

  • 提出了RARRL框架,用于资源感知的机器人推理决策
  • 利用强化学习学习高层编排策略,自适应决定何时推理和使用何种推理角色
  • 实验证明RARRL在任务成功率、执行延迟和鲁棒性方面优于固定或启发式策略

方法论

使用强化学习训练一个高层策略,该策略根据观察、历史和资源决定是否调用LLM进行推理以及如何分配计算预算。

原文摘要

Embodied robotic systems increasingly rely on large language model (LLM)-based agents to support high-level reasoning, planning, and decision-making during interactions with the environment. However, invoking LLM reasoning introduces substantial computational latency and resource overhead, which can interrupt action execution and reduce system reliability. Excessive reasoning may delay actions, while insufficient reasoning often leads to incorrect decisions and task failures. This raises a fundamental question for embodied agents: when should the agent reason, and when should it act? In this work, we propose RARRL (Resource-Aware Reasoning via Reinforcement Learning), a hierarchical framework for resource-aware orchestration of embodied agents. Rather than learning low-level control policies, RARRL learns a high-level orchestration policy that operates at the agent's decision-making layer. This policy enables the agent to adaptively determine whether to invoke reasoning, which reasoning role to employ, and how much computational budget to allocate based on current observations, execution history, and remaining resources. Extensive experiments, including evaluations with empirical latency profiles derived from the ALFRED benchmark, show that RARRL consistently improves task success rates while reducing execution latency and enhancing robustness compared with fixed or heuristic reasoning strategies. These results demonstrate that adaptive reasoning control is essential for building reliable and efficient embodied robotic agents.

标签

强化学习 机器人 LLM 推理 资源管理

arXiv 分类

cs.RO cs.AI cs.LG