AI Agents 相关度: 8/10

AERR-Nav: Adaptive Exploration-Recovery-Reminiscing Strategy for Zero-Shot Object Navigation

Jingzhi Huang, Junkai Huang, Haoyang Yang, Haoang Li, Yi Wang

arXiv: 2603.17712v1 发布: 2026-03-18 更新: 2026-03-18

下载 PDF arXiv 页面

AI 摘要

AERR-Nav通过自适应探索策略，提升了零样本目标导航在复杂环境下的性能。

主要贡献

提出自适应探索-恢复-回忆策略（AERR）
设计自适应探索状态，包含快慢思考模式
在HM3D和MP3D基准测试上取得SOTA性能

方法论

利用MLLM作为决策框架，通过动态调整探索、恢复和回忆状态，平衡探索和利用，实现高效导航。

原文摘要

Zero-Shot Object Navigation (ZSON) in unknown multi-floor environments presents a significant challenge. Recent methods, mostly based on semantic value greedy waypoint selection, spatial topology-enhanced memory, and Multimodal Large Language Model (MLLM) as a decision-making framework, have led to improvements. However, these architectures struggle to balance exploration and exploitation for ZSON when encountering unseen environments, especially in multi-floor settings, such as robots getting stuck at narrow intersections, endlessly wandering, or failing to find stair entrances. To overcome these challenges, we propose AERR-Nav, a Zero-Shot Object Navigation framework that dynamically adjusts its state based on the robot's environment. Specifically, AERR-Nav has the following two key advantages: (1) An Adaptive Exploration-Recovery-Reminiscing Strategy, enables robots to dynamically transition between three states, facilitating specialized responses to diverse navigation scenarios. (2) An Adaptive Exploration State featuring Fast and Slow-Thinking modes helps robots better balance exploration, exploitation, and higher-level reasoning based on evolving environmental information. Extensive experiments on the HM3D and MP3D benchmarks demonstrate that our AERR-Nav achieves state-of-the-art performance among zero-shot methods. Comprehensive ablation studies further validate the efficacy of our proposed strategy and modules.

arXiv 分类

cs.RO cs.CV

AI 摘要

主要贡献

方法论

原文摘要

标签

arXiv 分类