LLM Reasoning 相关度: 8/10

Emotion Entanglement and Bayesian Inference for Multi-Dimensional Emotion Understanding

Hemanth Kotaprolu, Kishan Maharaj, Raey Zhao, Abhijit Mishra, Pushpak Bhattacharyya

arXiv: 2604.00819v1 发布: 2026-04-01 更新: 2026-04-01

下载 PDF arXiv 页面

AI 摘要

提出了EmoScene基准测试多维情感理解，并使用贝叶斯推理改进情感预测。

主要贡献

提出了EmoScene基准测试，用于评估多维情感理解。
构建了基于Plutchik情感理论的8维情感向量标注数据。
提出了情感纠缠感知的贝叶斯推理框架，提升情感预测的一致性。

方法论

构建情感场景数据集，并使用Instruction-tuned LLM进行零样本评估，然后通过贝叶斯推理进行后处理，利用情感共现统计提高预测效果。

原文摘要

Understanding emotions in natural language is inherently a multi-dimensional reasoning problem, where multiple affective signals interact through context, interpersonal relations, and situational cues. However, most existing emotion understanding benchmarks rely on short texts and predefined emotion labels, reducing this process to independent label prediction and ignoring the structured dependencies among emotions. To address this limitation, we introduce Emotional Scenarios (EmoScene), a theory-grounded benchmark of 4,731 context-rich scenarios annotated with an 8-dimensional emotion vector derived from Plutchik's basic emotions. We evaluate six instruction-tuned large language models in a zero-shot setting and observe modest performance, with the best model achieving a Macro F1 of 0.501, highlighting the difficulty of context-aware multi-label emotion prediction. Motivated by the observation that emotions rarely occur independently, we further propose an entanglement-aware Bayesian inference framework that incorporates emotion co-occurrence statistics to perform joint posterior inference over the emotion vector. This lightweight post-processing improves structural consistency of predictions and yields notable gains for weaker models (e.g., +0.051 Macro F1 for Qwen2.5-7B). EmoScene therefore provides a challenging benchmark for studying multi-dimensional emotion understanding and the limitations of current language models.

arXiv 分类

cs.CL cs.AI

AI 摘要

主要贡献

方法论

原文摘要

标签

arXiv 分类