Multimodal Learning 相关度: 7/10

A 360-degree Multi-camera System for Blue Emergency Light Detection Using Color Attention RT-DETR and the ABLDataset

Francisco Vacalebri-Lloret, Lucas Banchero, Jose J. Lopez, Jose M. Mossi
arXiv: 2603.05058v1 发布: 2026-03-05 更新: 2026-03-05

AI 摘要

该研究提出一种基于多目视觉和颜色注意力机制的蓝光应急车辆检测系统。

主要贡献

  • 构建了欧洲应急车辆蓝光图像数据集 ABLDataset
  • 提出了基于颜色注意力机制的 RT-DETR 蓝光检测算法
  • 设计了一种多目视觉系统实现蓝光车辆的方位估计

方法论

采用多目鱼眼相机采集图像,使用颜色注意力增强的 RT-DETR 模型进行蓝光检测,并结合几何变换估计方位。

原文摘要

This study presents an advanced system for detecting blue lights on emergency vehicles, developed using ABLDataset, a curated dataset that includes images of European emergency vehicles under various climatic and geographic conditions. The system employs a configuration of four fisheye cameras, each with a 180-degree horizontal field of view, mounted on the sides of the vehicle. A calibration process enables the azimuthal localization of the detections. Additionally, a comparative analysis of major deep neural network algorithms was conducted, including YOLO (v5, v8, and v10), RetinaNet, Faster R-CNN, and RT-DETR. RT-DETR was selected as the base model and enhanced through the incorporation of a color attention block, achieving an accuracy of 94.7 percent and a recall of 94.1 percent on the test set, with field test detections reaching up to 70 meters. Furthermore, the system estimates the approach angle of the emergency vehicle relative to the center of the car using geometric transformations. Designed for integration into a multimodal system that combines visual and acoustic data, this system has demonstrated high efficiency, offering a promising approach to enhancing Advanced Driver Assistance Systems (ADAS) and road safety.

标签

蓝光检测 应急车辆 多目视觉 目标检测 RT-DETR

arXiv 分类

cs.CV cs.AI eess.IV