Multimodal Learning 相关度: 8/10

Digital Twin Driven Textile Classification and Foreign Object Recognition in Automated Sorting Systems

Serkan Ergun, Tobias Mitterer, Hubert Zangl
arXiv: 2603.05230v1 发布: 2026-03-05 更新: 2026-03-05

AI 摘要

论文提出一种基于数字孪生的纺织品分拣系统,利用VLM进行分类和异物识别。

主要贡献

  • 提出数字孪生驱动的纺织品分拣系统
  • 评估了多种VLM在纺织品分类任务上的性能
  • 结合VLM语义推理与传统抓取检测,实现自主分拣

方法论

利用双臂机器人和多模态感知,结合VLM进行纺织品分类和异物识别,并使用数字孪生进行路径规划。

原文摘要

The increasing demand for sustainable textile recycling requires robust automation solutions capable of handling deformable garments and detecting foreign objects in cluttered environments. This work presents a digital twin driven robotic sorting system that integrates grasp prediction, multi modal perception, and semantic reasoning for real world textile classification. A dual arm robotic cell equipped with RGBD sensing, capacitive tactile feedback, and collision-aware motion planning autonomously separates garments from an unsorted basket, transfers them to an inspection zone, and classifies them using state of the art Visual Language Models (VLMs). We benchmark nine VLM s from five model families on a dataset of 223 inspection scenarios comprising shirts, socks, trousers, underwear, foreign objects (including garments outside of the aforementioned classes), and empty scenes. The evaluation assesses per class accuracy, hallucination behavior, and computational performance under practical hardware constraints. Results show that the Qwen model family achieves the highest overall accuracy (up to 87.9 %), with strong foreign object detection performance, while lighter models such as Gemma3 offer competitive speed accuracy trade offs for edge deployment. A digital twin combined with MoveIt enables collision aware path planning and integrates segmented 3D point clouds of inspected garments into the virtual environment for improved manipulation reliability. The presented system demonstrates the feasibility of combining semantic VLM reasoning with conventional grasp detection and digital twin technology for scalable, autonomous textile sorting in realistic industrial settings.

标签

数字孪生 机器人分拣 视觉语言模型 纺织品分类

arXiv 分类

cs.CV cs.RO