AI Agents 相关度: 9/10

Assessing Deanonymization Risks with Stylometry-Assisted LLM Agent

Boyang Zhang, Yang Zhang
arXiv: 2602.23079v1 发布: 2026-02-26 更新: 2026-02-26

AI 摘要

该论文提出SALA方法,利用LLM代理评估和缓解新闻文章的去匿名化风险,并提出重写策略保护作者隐私。

主要贡献

  • 提出SALA方法,结合文体特征和LLM推理进行作者归属。
  • 设计LLM代理评估去匿名化风险并提供可解释的流程。
  • 提出指导性重构策略,在保留文本意义的同时降低作者可识别性。

方法论

结合文体特征提取,利用LLM进行推理分析,并通过数据库进行增强,最后生成重写提示来降低作者可识别性。

原文摘要

The rapid advancement of large language models (LLMs) has enabled powerful authorship inference capabilities, raising growing concerns about unintended deanonymization risks in textual data such as news articles. In this work, we introduce an LLM agent designed to evaluate and mitigate such risks through a structured, interpretable pipeline. Central to our framework is the proposed $\textit{SALA}$ (Stylometry-Assisted LLM Analysis) method, which integrates quantitative stylometric features with LLM reasoning for robust and transparent authorship attribution. Experiments on large-scale news datasets demonstrate that $\textit{SALA}$, particularly when augmented with a database module, achieves high inference accuracy in various scenarios. Finally, we propose a guided recomposition strategy that leverages the agent's reasoning trace to generate rewriting prompts, effectively reducing authorship identifiability while preserving textual meaning. Our findings highlight both the deanonymization potential of LLM agents and the importance of interpretable, proactive defenses for safeguarding author privacy.

标签

LLM Deanonymization Stylometry Privacy

arXiv 分类

cs.CL cs.CR cs.LG