AI Agents 相关度: 10/10

AdapTools: Adaptive Tool-based Indirect Prompt Injection Attacks on Agentic LLMs

Che Wang, Jiaming Zhang, Ziqi Zhang, Zijie Wang, Yinghui Wang, Jianbo Gao, Tao Wei, Zhong Chen, Wei Yang Bryan Lim
arXiv: 2602.20720v1 发布: 2026-02-24 更新: 2026-02-24

AI 摘要

AdapTools提出了一种自适应的间接提示注入攻击框架,提升了攻击成功率和系统效用劣化。

主要贡献

  • 提出了自适应攻击策略构建方法
  • 提出了攻击增强方法,识别隐蔽工具绕过防御
  • 验证了AdapTools在复杂攻击场景下的有效性

方法论

通过构建可迁移的对抗策略优化提示,并识别隐蔽工具来增强攻击,实现了自适应间接提示注入攻击。

原文摘要

The integration of external data services (e.g., Model Context Protocol, MCP) has made large language model-based agents increasingly powerful for complex task execution. However, this advancement introduces critical security vulnerabilities, particularly indirect prompt injection (IPI) attacks. Existing attack methods are limited by their reliance on static patterns and evaluation on simple language models, failing to address the fast-evolving nature of modern AI agents. We introduce AdapTools, a novel adaptive IPI attack framework that selects stealthier attack tools and generates adaptive attack prompts to create a rigorous security evaluation environment. Our approach comprises two key components: (1) Adaptive Attack Strategy Construction, which develops transferable adversarial strategies for prompt optimization, and (2) Attack Enhancement, which identifies stealthy tools capable of circumventing task-relevance defenses. Comprehensive experimental evaluation shows that AdapTools achieves a 2.13 times improvement in attack success rate while degrading system utility by a factor of 1.78. Notably, the framework maintains its effectiveness even against state-of-the-art defense mechanisms. Our method advances the understanding of IPI attacks and provides a useful reference for future research.

标签

AI Agent Prompt Injection Security Adversarial Attack

arXiv 分类

cs.CR cs.AI