AI Agents 相关度: 8/10

VANGUARD: Vehicle-Anchored Ground Sample Distance Estimation for UAVs in GPS-Denied Environments

Yifei Chen, Xupeng Chen, Feng Wang, Niangang Jiao, Jiayin Liu
arXiv: 2603.04277v1 发布: 2026-03-04 更新: 2026-03-04

AI 摘要

VANGUARD利用车辆作为锚点,解决无人机在GPS受限环境中尺度估计问题。

主要贡献

  • 提出VANGUARD方法,用于在GPS受限环境下估计GSD
  • 使用车辆作为环境锚点,通过kernel density estimation估计GSD
  • 实验表明,VANGUARD能有效提高空间推理的准确性和安全性

方法论

VANGUARD通过检测图像中的车辆,利用先验知识和kernel density estimation估计GSD,并提供置信度评估。

原文摘要

Autonomous aerial robots operating in GPS-denied or communication-degraded environments frequently lose access to camera metadata and telemetry, leaving onboard perception systems unable to recover the absolute metric scale of the scene. As LLM/VLM-based planners are increasingly adopted as high-level agents for embodied systems, their ability to reason about physical dimensions becomes safety-critical -- yet our experiments show that five state-of-the-art VLMs suffer from spatial scale hallucinations, with median area estimation errors exceeding 50%. We propose VANGUARD, a lightweight, deterministic Geometric Perception Skill designed as a callable tool that any LLM-based agent can invoke to recover Ground Sample Distance (GSD) from ubiquitous environmental anchors: small vehicles detected via oriented bounding boxes, whose modal pixel length is robustly estimated through kernel density estimation and converted to GSD using a pre-calibrated reference length. The tool returns both a GSD estimate and a composite confidence score, enabling the calling agent to autonomously decide whether to trust the measurement or fall back to alternative strategies. On the DOTA~v1.5 benchmark, VANGUARD achieves 6.87% median GSD error on 306~images. Integrated with SAM-based segmentation for downstream area measurement, the pipeline yields 19.7% median error on a 100-entry benchmark -- with 2.6x lower category dependence and 4x fewer catastrophic failures than the best VLM baseline -- demonstrating that equipping agents with deterministic geometric tools is essential for safe autonomous spatial reasoning.

标签

UAV GSD Estimation GPS-Denied Geometric Perception Vehicle Detection

arXiv 分类

cs.RO cs.AI