Agency and Architectural Limits: Why Optimization-Based Systems Cannot Be Norm-Responsive

AI 摘要

优化驱动的AI系统本质上无法响应规范，因其缺乏真正的能动性所需的架构条件。

主要贡献

形式化证明优化系统与规范治理的不兼容性
提出代理的架构规范：不相容性和非推理性响应
揭示了AI部署中“收敛危机”的二阶风险

方法论

通过形式化分析，论证了基于RLHF的LLM在架构上与规范性治理不兼容，并提出了代理的架构规范。

原文摘要

AI systems are increasingly deployed in high-stakes contexts -- medical diagnosis, legal research, financial analysis -- under the assumption they can be governed by norms. This paper demonstrates that assumption is formally invalid for optimization-based systems, specifically Large Language Models trained via Reinforcement Learning from Human Feedback (RLHF). We establish that genuine agency requires two necessary and jointly sufficient architectural conditions: the capacity to maintain certain boundaries as non-negotiable constraints rather than tradeable weights (Incommensurability), and a non-inferential mechanism capable of suspending processing when those boundaries are threatened (Apophatic Responsiveness). These conditions apply across all normative domains. RLHF-based systems are constitutively incompatible with both conditions. The operations that make optimization powerful -- unifying all values on a scalar metric and always selecting the highest-scoring output -- are precisely the operations that preclude normative governance. This incompatibility is not a correctable training bug awaiting a technical fix; it is a formal constraint inherent to what optimization is. Consequently, documented failure modes - sycophancy, hallucination, and unfaithful reasoning - are not accidents but structural manifestations. Misaligned deployment triggers a second-order risk we term the Convergence Crisis: when humans are forced to verify AI outputs under metric pressure, they degrade from genuine agents into criteria-checking optimizers, eliminating the only component in the system capable of normative accountability. Beyond the incompatibility proof, the paper's primary positive contribution is a substrate-neutral architectural specification defining what any system -- biological, artificial, or institutional -- must satisfy to qualify as an agent rather than a sophisticated instrument.

arXiv 分类

cs.AI cs.CY

AI 摘要

主要贡献

方法论

原文摘要

标签

arXiv 分类