AI Agents 相关度: 9/10

Policy Compiler for Secure Agentic Systems

Nils Palumbo, Sarthak Choudhary, Jihye Choi, Prasad Chalasani, Mihai Christodorescu, Somesh Jha
arXiv: 2602.16708v1 发布: 2026-02-18 更新: 2026-02-18

AI 摘要

PCAS是一个策略编译器,用于确保基于LLM的Agent系统满足复杂的安全策略,提升策略合规性。

主要贡献

  • 提出了PCAS策略编译器,实现确定性的策略执行
  • 使用依赖图建模Agent系统状态,追踪跨Agent的信息流
  • 将策略编译为工具化的系统,无需修改原有Agent架构

方法论

使用Datalog衍生语言声明策略,reference monitor拦截违规操作,将Agent和策略编译为满足策略的系统。

原文摘要

LLM-based agents are increasingly being deployed in contexts requiring complex authorization policies: customer service protocols, approval workflows, data access restrictions, and regulatory compliance. Embedding these policies in prompts provides no enforcement guarantees. We present PCAS, a Policy Compiler for Agentic Systems that provides deterministic policy enforcement. Enforcing such policies requires tracking information flow across agents, which linear message histories cannot capture. Instead, PCAS models the agentic system state as a dependency graph capturing causal relationships among events such as tool calls, tool results, and messages. Policies are expressed in a Datalog-derived language, as declarative rules that account for transitive information flow and cross-agent provenance. A reference monitor intercepts all actions and blocks violations before execution, providing deterministic enforcement independent of model reasoning. PCAS takes an existing agent implementation and a policy specification, and compiles them into an instrumented system that is policy-compliant by construction, with no security-specific restructuring required. We evaluate PCAS on three case studies: information flow policies for prompt injection defense, approval workflows in a multi-agent pharmacovigilance system, and organizational policies for customer service. On customer service tasks, PCAS improves policy compliance from 48% to 93% across frontier models, with zero policy violations in instrumented runs.

标签

AI Agents Policy Enforcement Security Information Flow

arXiv 分类

cs.CR cs.AI cs.MA