AI Agents 相关度: 8/10

AegisUI: Behavioral Anomaly Detection for Structured User Interface Protocols in AI Agent Systems

Mohd Safwan Uddin, Saba Hajira
arXiv: 2603.05031v1 发布: 2026-03-05 更新: 2026-03-05

AI 摘要

AegisUI提出了一种检测AI Agent生成UI异常行为的框架,能有效识别恶意UI攻击。

主要贡献

  • 构建了包含恶意UI攻击的带标签数据集
  • 提出了18个可用于异常检测的UI特征
  • 对比了多种异常检测算法在UI恶意行为检测中的表现

方法论

构建数据集,提取UI特征,使用Isolation Forest、Autoencoder和Random Forest进行异常检测,并比较性能。

原文摘要

AI agents that build user interfaces on the fly assembling buttons, forms, and data displays from structured protocol payloads are becoming common in production systems. The trouble is that a payload can pass every schema check and still trick a user: a button might say "View invoice" while its hidden action wipes an account, or a display widget might quietly bind to an internal salary field. Current defenses stop at syntax; they were never built to catch this kind of behavioral mismatch. We built AegisUI to study exactly this gap. The framework generates structured UI payloads, injects realistic attacks into them, extracts numeric features, and benchmarks anomaly detectors end-to-end. We produced 4000 labeled payloads (3000 benign, 1000 malicious) spanning five application domains and five attack families: phishing interfaces, data leakage, layout abuse, manipulative UI, and workflow anomalies. From each payload we extracted 18 features covering structural, semantic, binding, and session dimensions, then compared three detectors: Isolation Forest (unsupervised), a benign-trained autoencoder (semi-supervised), and Random Forest (supervised). On a stratified 80/20 split, Random Forest scored best overall (accuracy 0.931, precision 0.980, recall 0.740, F1 0.843, ROC-AUC 0.952). The autoencoder came second (F1 0.762, ROC-AUC 0.863) and needs no malicious labels at training time, which matters when deploying a new system that lacks attack history. Per-attack-type analysis showed that layout abuse is easiest to catch while manipulative UI payloads are hardest. All code, data, and configurations are released for full reproducibility.

标签

UI安全 异常检测 AI Agent 对抗攻击 行为分析

arXiv 分类

cs.AI