Multimodal Learning 相关度: 9/10

IDRL: An Individual-Aware Multimodal Depression-Related Representation Learning Framework for Depression Diagnosis

Chongxiao Wang, Junjie Liang, Peng Cao, Jinzhu Yang, Osmar R. Zaiane
arXiv: 2603.11644v1 发布: 2026-03-12 更新: 2026-03-12

AI 摘要

IDRL框架通过解耦多模态表示和个体感知融合,提升抑郁症诊断的准确性和鲁棒性。

主要贡献

  • 提出IDRL框架,用于多模态抑郁症诊断
  • 解耦多模态表示为抑郁症相关和不相关空间
  • 引入个体感知模态融合模块(IAF)

方法论

IDRL通过解耦多模态表示并动态调整模态权重,实现个体化的跨模态融合,从而进行抑郁症诊断。

原文摘要

Depression is a severe mental disorder, and reliable identification plays a critical role in early intervention and treatment. Multimodal depression detection aims to improve diagnostic performance by jointly modeling complementary information from multiple modalities. Recently, numerous multimodal learning approaches have been proposed for depression analysis; however, these methods suffer from the following limitations: 1) inter-modal inconsistency and depression-unrelated interference, where depression-related cues may conflict across modalities while substantial irrelevant content obscures critical depressive signals, and 2) diverse individual depressive presentations, leading to individual differences in modality and cue importance that hinder reliable fusion. To address these issues, we propose Individual-aware Multimodal Depression-related Representation Learning Framework (IDRL) for robust depression diagnosis. Specifically, IDRL 1) disentangles multimodal representations into a modality-common depression space, a modality-specific depression space, and a depression-unrelated space to enhance modality alignment while suppressing irrelevant information, and 2) introduces an individual-aware modality-fusion module (IAF) that dynamically adjusts the weights of disentangled depression-related features based on their predictive significance, thereby achieving adaptive cross-modal fusion for different individuals. Extensive experiments demonstrate that IDRL achieves superior and robust performance for multimodal depression detection.

标签

多模态学习 抑郁症诊断 表示学习 个体化模型

arXiv 分类

cs.CV cs.AI