AI Agents 相关度: 9/10

C-STEP: Continuous Space-Time Empowerment for Physics-informed Safe Reinforcement Learning of Mobile Agents

Guihlerme Daubt, Adrian Redder
arXiv: 2603.24241v1 发布: 2026-03-25 更新: 2026-03-25

AI 摘要

提出C-STEP安全强化学习方法,通过物理信息指导奖励函数,提升移动机器人的安全导航能力。

主要贡献

  • 提出C-STEP安全度量方法
  • 设计物理信息指导的内在奖励函数
  • 提升移动机器人的安全导航性能

方法论

基于强化学习,利用C-STEP度量指导奖励函数设计,优化智能体在复杂环境中的安全导航。

原文摘要

Safe navigation in complex environments remains a central challenge for reinforcement learning (RL) in robotics. This paper introduces Continuous Space-Time Empowerment for Physics-informed (C-STEP) safe RL, a novel measure of agent-centric safety tailored to deterministic, continuous domains. This measure can be used to design physics-informed intrinsic rewards by augmenting positive navigation reward functions. The reward incorporates the agents internal states (e.g., initial velocity) and forward dynamics to differentiate safe from risky behavior. By integrating C-STEP with navigation rewards, we obtain an intrinsic reward function that jointly optimizes task completion and collision avoidance. Numerical results demonstrate fewer collisions, reduced proximity to obstacles, and only marginal increases in travel time. Overall, C-STEP offers an interpretable, physics-informed approach to reward shaping in RL, contributing to safety for agentic mobile robotic systems.

标签

强化学习 安全导航 机器人 奖励塑造 物理信息

arXiv 分类

eess.SY cs.LG