Multimodal Learning 相关度: 5/10

TwinMixing: A Shuffle-Aware Feature Interaction Model for Multi-Task Segmentation

Minh-Khoi Do, Huy Che, Dinh-Duy Phan, Duc-Khai Lam, Duc-Lung Vu
arXiv: 2603.28233v1 发布: 2026-03-30 更新: 2026-03-30

AI 摘要

TwinMixing是一种轻量级多任务分割模型,专为自动驾驶环境下的车道线和可行驶区域分割设计。

主要贡献

  • 提出了高效金字塔混合(EPM)模块,增强多尺度特征提取
  • 设计了双分支上采样(DBU)块,实现精细且空间一致的特征重建
  • TwinMixing在BDD100K数据集上表现出色,兼顾精度和效率

方法论

共享编码器和任务特定解码器,编码器采用EPM模块,解码器采用DBU块,实现特征共享和任务专精。

原文摘要

Accurate and efficient perception is essential for autonomous driving, where segmentation tasks such as drivable-area and lane segmentation provide critical cues for motion planning and control. However, achieving high segmentation accuracy while maintaining real-time performance on low-cost hardware remains a challenging problem. To address this issue, we introduce TwinMixing, a lightweight multi-task segmentation model designed explicitly for drivable-area and lane segmentation. The proposed network features a shared encoder and task-specific decoders, enabling both feature sharing and task specialization. Within the encoder, we propose an Efficient Pyramid Mixing (EPM) module that enhances multi-scale feature extraction through a combination of grouped convolutions, depthwise dilated convolutions and channel shuffle operations, effectively expanding the receptive field while minimizing computational cost. Each decoder adopts a Dual-Branch Upsampling (DBU) Block composed of a learnable transposed convolution-based Fine detailed branch and a parameter-free bilinear interpolation-based Coarse grained branch, achieving detailed yet spatially consistent feature reconstruction. Extensive experiments on the BDD100K dataset validate the effectiveness of TwinMixing across three configurations - tiny, base, and large. Among them, the base configuration achieves the best trade-off between accuracy and computational efficiency, reaching 92.0% mIoU for drivable-area segmentation and 32.3% IoU for lane segmentation with only 0.43M parameters and 3.95 GFLOPs. Moreover, TwinMixing consistently outperforms existing segmentation models on the same tasks, as illustrated in Fig. 1. Thanks to its compact and modular design, TwinMixing demonstrates strong potential for real-time deployment in autonomous driving and embedded perception systems. The source code: https://github.com/Jun0se7en/TwinMixing.

标签

自动驾驶 语义分割 多任务学习 轻量级网络

arXiv 分类

cs.CV cs.AI