AI Agents 相关度: 9/10

Agentic AI for Cybersecurity: A Meta-Cognitive Architecture for Governable Autonomy

Andrei Kojukhov, Arkady Bovshover
arXiv: 2602.11897v1 发布: 2026-02-12 更新: 2026-02-12

AI 摘要

提出一种基于元认知判断的Agentic AI网络安全架构,提升网络安全决策的可解释性和可控性。

主要贡献

  • 提出Agentic AI网络安全架构
  • 引入元认知判断函数治理系统自主性
  • 结合分布式认知理论、多智能体系统和责任AI框架

方法论

通过结合理论框架(分布式认知理论、多智能体系统、责任AI框架)和概念架构设计,阐述了元认知智能体架构的优势。

原文摘要

Contemporary AI-driven cybersecurity systems are predominantly architected as model-centric detection and automation pipelines optimized for task-level performance metrics such as accuracy and response latency. While effective for bounded classification tasks, these architectures struggle to support accountable decision-making under adversarial uncertainty, where actions must be justified, governed, and aligned with organizational and regulatory constraints. This paper argues that cybersecurity orchestration should be reconceptualized as an agentic, multi-agent cognitive system, rather than a linear sequence of detection and response components. We introduce a conceptual architectural framework in which heterogeneous AI agents responsible for detection, hypothesis formation, contextual interpretation, explanation, and governance are coordinated through an explicit meta-cognitive judgement function. This function governs decision readiness and dynamically calibrates system autonomy when evidence is incomplete, conflicting, or operationally risky. By synthesizing distributed cognition theory, multi-agent systems research, and responsible AI governance frameworks, we demonstrate that modern security operations already function as distributed cognitive systems, albeit without an explicit organizing principle. Our contribution is to make this cognitive structure architecturally explicit and governable by embedding meta-cognitive judgement as a first-class system function. We discuss implications for security operations centers, accountable autonomy, and the design of next-generation AI-enabled cyber defence architectures. The proposed framework shifts the focus of AI in cybersecurity from optimizing isolated predictions to governing autonomy under uncertainty.

标签

网络安全 AI Agent 元认知 多智能体系统 可解释性

arXiv 分类

cs.CR cs.AI