LLM Memory & RAG 相关度: 8/10

PEACE 2.0: Grounded Explanations and Counter-Speech for Combating Hate Expressions

Greta Damo, Stéphane Petiot, Elena Cabrio, Serena Villata
arXiv: 2602.17467v1 发布: 2026-02-19 更新: 2026-02-19

AI 摘要

PEACE 2.0工具利用RAG生成证据支撑的反仇恨言论解释和回复。

主要贡献

  • 提出PEACE 2.0工具
  • 利用RAG生成仇恨言论的解释和回复
  • 探索反仇恨言论回复的特征

方法论

使用Retrieval-Augmented Generation (RAG) 框架,从证据和事实中检索信息,并生成基于证据的反仇恨言论。

原文摘要

The increasing volume of hate speech on online platforms poses significant societal challenges. While the Natural Language Processing community has developed effective methods to automatically detect the presence of hate speech, responses to it, called counter-speech, are still an open challenge. We present PEACE 2.0, a novel tool that, besides analysing and explaining why a message is considered hateful or not, also generates a response to it. More specifically, PEACE 2.0 has three main new functionalities: leveraging a Retrieval-Augmented Generation (RAG) pipeline i) to ground HS explanations into evidence and facts, ii) to automatically generate evidence-grounded counter-speech, and iii) exploring the characteristics of counter-speech replies. By integrating these capabilities, PEACE 2.0 enables in-depth analysis and response generation for both explicit and implicit hateful messages.

标签

仇恨言论检测 反仇恨言论 Retrieval-Augmented Generation

arXiv 分类

cs.CL