AI Agents 相关度: 9/10

C-STEP: Continuous Space-Time Empowerment for Physics-informed Safe Reinforcement Learning of Mobile Agents

Guihlerme Daubt, Adrian Redder

arXiv: 2603.24241v1 发布: 2026-03-25 更新: 2026-03-25

下载 PDF arXiv 页面

AI 摘要

提出C-STEP安全强化学习方法，通过物理信息指导奖励函数，提升移动机器人的安全导航能力。

主要贡献

提出C-STEP安全度量方法
设计物理信息指导的内在奖励函数
提升移动机器人的安全导航性能

方法论

基于强化学习，利用C-STEP度量指导奖励函数设计，优化智能体在复杂环境中的安全导航。

原文摘要

Safe navigation in complex environments remains a central challenge for reinforcement learning (RL) in robotics. This paper introduces Continuous Space-Time Empowerment for Physics-informed (C-STEP) safe RL, a novel measure of agent-centric safety tailored to deterministic, continuous domains. This measure can be used to design physics-informed intrinsic rewards by augmenting positive navigation reward functions. The reward incorporates the agents internal states (e.g., initial velocity) and forward dynamics to differentiate safe from risky behavior. By integrating C-STEP with navigation rewards, we obtain an intrinsic reward function that jointly optimizes task completion and collision avoidance. Numerical results demonstrate fewer collisions, reduced proximity to obstacles, and only marginal increases in travel time. Overall, C-STEP offers an interpretable, physics-informed approach to reward shaping in RL, contributing to safety for agentic mobile robotic systems.

arXiv 分类

eess.SY cs.LG

AI 摘要

主要贡献

方法论

原文摘要

标签

arXiv 分类