AI Agents 相关度: 7/10

Perception-Based Beliefs for POMDPs with Visual Observations

Miriam Schäfers, Merlijn Krale, Thiago D. Simão, Nils Jansen, Maximilian Weininger
arXiv: 2602.05679v1 发布: 2026-02-05 更新: 2026-02-05

AI 摘要

PBP框架通过图像分类器将视觉信息融入POMDP信念更新,提升高维观测下决策效率。

主要贡献

  • 提出感知信念的POMDP框架(PBP)
  • 利用图像分类器概率分布更新信念
  • 引入不确定性量化方法提升鲁棒性

方法论

构建图像分类器将视觉观测映射到状态分布,将其融入POMDP信念更新,并量化不确定性。

原文摘要

Partially observable Markov decision processes (POMDPs) are a principled planning model for sequential decision-making under uncertainty. Yet, real-world problems with high-dimensional observations, such as camera images, remain intractable for traditional belief- and filtering-based solvers. To tackle this problem, we introduce the Perception-based Beliefs for POMDPs framework (PBP), which complements such solvers with a perception model. This model takes the form of an image classifier which maps visual observations to probability distributions over states. PBP incorporates these distributions directly into belief updates, so the underlying solver does not need to reason explicitly over high-dimensional observation spaces. We show that the belief update of PBP coincides with the standard belief update if the image classifier is exact. Moreover, to handle classifier imprecision, we incorporate uncertainty quantification and introduce two methods to adjust the belief update accordingly. We implement PBP using two traditional POMDP solvers and empirically show that (1) it outperforms existing end-to-end deep RL methods and (2) uncertainty quantification improves robustness of PBP against visual corruption.

标签

POMDP 强化学习 视觉感知 不确定性量化

arXiv 分类

cs.LG