Decidable By Construction: Design-Time Verification for Trustworthy AI
AI 摘要
该论文提出一种设计时验证框架,在训练前验证AI模型的数值稳定性、计算正确性等。
主要贡献
- 提出一种设计时验证框架,降低AI可靠性的计算开销。
- 将AI模型属性表示为有限生成阿贝尔群上的约束。
- 基于Hindley-Milner合一的类型推断与通用归纳联系。
方法论
结合维度类型系统、程序超图和自适应领域模型架构,使用forward-mode coeffect analysis 和 exact posit accumulation进行验证。
原文摘要
A prevailing assumption in machine learning is that model correctness must be enforced after the fact. We observe that the properties determining whether an AI model is numerically stable, computationally correct, or consistent with a physical domain do not necessarily demand post hoc enforcement. They can be verified at design time, before training begins, at marginal computational cost, with particular relevance to models deployed in high-leverage decision support and scientifically constrained settings. These properties share a specific algebraic structure: they are expressible as constraints over finitely generated abelian groups $\mathbb{Z}^n$, where inference is decidable in polynomial time and the principal type is unique. A framework built on this observation composes three prior results (arXiv:2603.16437, arXiv:2603.17627, arXiv:2603.18104): a dimensional type system carrying arbitrary annotations as persistent codata through model elaboration; a program hypergraph that infers Clifford algebra grade and derives geometric product sparsity from type signatures alone; and an adaptive domain model architecture preserving both invariants through training via forward-mode coeffect analysis and exact posit accumulation. We believe this composition yields a novel information-theoretic result: Hindley-Milner unification over abelian groups computes the maximum a posteriori hypothesis under a computable restriction of Solomonoff's universal prior, placing the framework's type inference on the same formal ground as universal induction. We compare four contemporary approaches to AI reliability and show that each imposes overhead that can compound across deployments, layers, and inference requests. This framework eliminates that overhead by construction.