Multimodal Learning 相关度: 9/10

KidsNanny: A Two-Stage Multimodal Content Moderation Pipeline Integrating Visual Classification, Object Detection, OCR, and Contextual Reasoning for Child Safety

Viraj Panchal, Tanmay Talsaniya, Parag Patel, Meet Patel
arXiv: 2603.16181v1 发布: 2026-03-17 更新: 2026-03-17

AI 摘要

提出KidsNanny多模态内容审核框架,结合视觉和文本分析提高儿童安全内容检测效率。

主要贡献

  • 提出了一个两阶段多模态内容审核架构KidsNanny
  • 结合视觉分类、目标检测、OCR和上下文推理
  • 在UnsafeBench数据集上取得了优于现有方法的性能

方法论

使用ViT和目标检测进行视觉筛选,然后使用OCR和7B语言模型进行上下文推理,两阶段流水线处理。

原文摘要

We present KidsNanny, a two-stage multimodal content moderation architecture for child safety. Stage 1 combines a vision transformer (ViT) with an object detector for visual screening (11.7 ms); outputs are routed as text not raw pixels to Stage 2, which applies OCR and a text based 7B language model for contextual reasoning (120 ms total pipeline). We evaluate on the UnsafeBench Sexual category (1,054 images) under two regimes: vision-only, isolating Stage 1, and multimodal, evaluating the full Stage 1+2 pipeline. Stage 1 achieves 80.27% accuracy and 85.39% F1 at 11.7 ms; vision-only baselines range from 59.01% to 77.04% accuracy. The full pipeline achieves 81.40% accuracy and 86.16% F1 at 120 ms, compared to ShieldGemma-2 (64.80% accuracy, 1,136 ms) and LlavaGuard (80.36% accuracy, 4,138 ms). To evaluate text-awareness, we filter two subsets: a text+visual subset (257 images) and a text-only subset (44 images where safety depends primarily on embedded text). On text-only images, KidsNanny achieves 100% recall (25/25 positives; small sample) and 75.76% precision; ShieldGemma-2 achieves 84% recall and 60% precision at 1,136 ms. Results suggest that dedicated OCR-based reasoning may offer recall-precision advantages on text-embedded threats at lower latency, though the small text-only subset limits generalizability. By documenting this architecture and evaluation methodology, we aim to contribute to the broader research effort on efficient multimodal content moderation for child safety.

标签

多模态学习 内容审核 儿童安全 视觉分类 目标检测 OCR 语言模型

arXiv 分类

cs.CV cs.CR