LLM Memory & RAG 相关度: 8/10

PEACE 2.0: Grounded Explanations and Counter-Speech for Combating Hate Expressions

Greta Damo, Stéphane Petiot, Elena Cabrio, Serena Villata

arXiv: 2602.17467v1 发布: 2026-02-19 更新: 2026-02-19

下载 PDF arXiv 页面

AI 摘要

PEACE 2.0工具利用RAG生成证据支撑的反仇恨言论解释和回复。

主要贡献

提出PEACE 2.0工具
利用RAG生成仇恨言论的解释和回复
探索反仇恨言论回复的特征

方法论

使用Retrieval-Augmented Generation (RAG) 框架，从证据和事实中检索信息，并生成基于证据的反仇恨言论。

原文摘要

The increasing volume of hate speech on online platforms poses significant societal challenges. While the Natural Language Processing community has developed effective methods to automatically detect the presence of hate speech, responses to it, called counter-speech, are still an open challenge. We present PEACE 2.0, a novel tool that, besides analysing and explaining why a message is considered hateful or not, also generates a response to it. More specifically, PEACE 2.0 has three main new functionalities: leveraging a Retrieval-Augmented Generation (RAG) pipeline i) to ground HS explanations into evidence and facts, ii) to automatically generate evidence-grounded counter-speech, and iii) exploring the characteristics of counter-speech replies. By integrating these capabilities, PEACE 2.0 enables in-depth analysis and response generation for both explicit and implicit hateful messages.

arXiv 分类

cs.CL

AI 摘要

主要贡献

方法论

原文摘要

标签

arXiv 分类