AI Agents 相关度: 9/10

ClawTrap: A MITM-Based Red-Teaming Framework for Real-World OpenClaw Security Evaluation

Haochen Zhao, Shaoyang Cui
arXiv: 2603.18762v1 发布: 2026-03-19 更新: 2026-03-19

AI 摘要

ClawTrap框架通过MITM攻击评估OpenClaw在真实网络环境中的安全性。

主要贡献

  • 提出ClawTrap框架,用于评估OpenClaw安全性
  • 支持多样且可定制的MITM攻击
  • 揭示不同模型在MITM攻击下的安全差异

方法论

构建基于MITM的红队框架ClawTrap,进行可重复的网络流量拦截、转换和审计,评估OpenClaw在真实网络环境下的安全性。

原文摘要

Autonomous web agents such as \textbf{OpenClaw} are rapidly moving into high-impact real-world workflows, but their security robustness under live network threats remains insufficiently evaluated. Existing benchmarks mainly focus on static sandbox settings and content-level prompt attacks, which leaves a practical gap for network-layer security testing. In this paper, we present \textbf{ClawTrap}, a \textbf{MITM-based red-teaming framework for real-world OpenClaw security evaluation}. ClawTrap supports diverse and customizable attack forms, including \textit{Static HTML Replacement}, \textit{Iframe Popup Injection}, and \textit{Dynamic Content Modification}, and provides a reproducible pipeline for rule-driven interception, transformation, and auditing. This design lays the foundation for future research to construct richer, customizable MITM attacks and to perform systematic security testing across agent frameworks and model backbones. Our empirical study shows clear model stratification: weaker models are more likely to trust tampered observations and produce unsafe outputs, while stronger models demonstrate better anomaly attribution and safer fallback strategies. These findings indicate that reliable OpenClaw security evaluation should explicitly incorporate dynamic real-world MITM conditions rather than relying only on static sandbox protocols.

标签

OpenClaw MITM攻击 安全评估 红队

arXiv 分类

cs.CR cs.AI