Multimodal Learning 相关度: 8/10

RGB-Event HyperGraph Prompt for Kilometer Marker Recognition based on Pre-trained Foundation Models

Xiaoyu Xian, Shiao Wang, Xiao Wang, Daxin Tian, Yan Tian
arXiv: 2602.22026v1 发布: 2026-02-25 更新: 2026-02-25

AI 摘要

提出了一种基于RGB-Event数据和预训练模型的公里标识别方法,并构建了大规模数据集EvMetro5K。

主要贡献

  • 提出了基于RGB-Event HyperGraph Prompt的KMR方法
  • 构建了大规模RGB-Event数据集EvMetro5K
  • 在EvMetro5K和benchmark上验证了方法的有效性

方法论

使用预训练RGB OCR模型,通过多模态适配,结合RGB和Event数据,用于公里标识别。

原文摘要

Metro trains often operate in highly complex environments, characterized by illumination variations, high-speed motion, and adverse weather conditions. These factors pose significant challenges for visual perception systems, especially those relying solely on conventional RGB cameras. To tackle these difficulties, we explore the integration of event cameras into the perception system, leveraging their advantages in low-light conditions, high-speed scenarios, and low power consumption. Specifically, we focus on Kilometer Marker Recognition (KMR), a critical task for autonomous metro localization under GNSS-denied conditions. In this context, we propose a robust baseline method based on a pre-trained RGB OCR foundation model, enhanced through multi-modal adaptation. Furthermore, we construct the first large-scale RGB-Event dataset, EvMetro5K, containing 5,599 pairs of synchronized RGB-Event samples, split into 4,479 training and 1,120 testing samples. Extensive experiments on EvMetro5K and other widely used benchmarks demonstrate the effectiveness of our approach for KMR. Both the dataset and source code will be released on https://github.com/Event-AHU/EvMetro5K_benchmark

标签

RGB-Event 公里标识别 预训练模型 多模态学习 深度学习

arXiv 分类

cs.CV cs.AI