Multimodal Learning 相关度: 9/10

MER-Bench: A Comprehensive Benchmark for Multimodal Meme Reappraisal

Yiqi Nie, Fei Wang, Junjie Chen, Kun Li, Yudi Cai, Dan Guo, Chenglong Li, Meng Wang
arXiv: 2603.15020v1 发布: 2026-03-16 更新: 2026-03-16

AI 摘要

提出了Meme Reappraisal任务,构建了MER-Bench数据集,并提出了评估框架。

主要贡献

  • 提出了Meme Reappraisal任务
  • 构建了MER-Bench数据集
  • 提出了基于MLLM的评估框架

方法论

构建数据集并使用MLLM作为评估器,分析现有模型在结构保持、语义一致性和情感转换方面的性能。

原文摘要

Memes represent a tightly coupled, multimodal form of social expression, in which visual context and overlaid text jointly convey nuanced affect and commentary. Inspired by cognitive reappraisal in psychology, we introduce Meme Reappraisal, a novel multimodal generation task that aims to transform negatively framed memes into constructive ones while preserving their underlying scenario, entities, and structural layout. Unlike prior works on meme understanding or generation, Meme Reappraisal requires emotion-controllable, structure-preserving multimodal transformation under multiple semantic and stylistic constraints. To support this task, we construct MER-Bench, a benchmark of real-world memes with fine-grained multimodal annotations, including source and target emotions, positively rewritten meme text, visual editing specifications, and taxonomy labels covering visual type, sentiment polarity, and layout structure. We further propose a structured evaluation framework based on a multimodal large language model (MLLM)-as-a-Judge paradigm, decomposing performance into modality-level generation quality, affect controllability, structural fidelity, and global affective alignment. Extensive experiments across representative image-editing and multimodal-generation systems reveal substantial gaps in satisfying the constraints of structural preservation, semantic consistency, and affective transformation. We believe MER-Bench establishes a foundation for research on controllable meme editing and emotion-aware multimodal generation. Our code is available at: https://github.com/one-seven17/MER-Bench.

标签

Meme Multimodal Generation Reappraisal Benchmark

arXiv 分类

cs.CV cs.CL