Agentic Neurosymbolic Collaboration for Mathematical Discovery: A Case Study in Combinatorial Design
AI 摘要
研究人机协作在组合设计理论中发现新数学结果,利用神经符号方法证明拉丁方不平衡性的下界。
主要贡献
- 提出人机协作数学发现方法
- 发现拉丁方不平衡性的新的紧下界
- 揭示 LLM 在数学研究中的认知贡献
方法论
结合LLM、符号计算工具和人工指导,通过多轮交互,重建发现过程并分析各组件的认知贡献。
原文摘要
We study mathematical discovery through the lens of neurosymbolic reasoning, where an AI agent powered by a large language model (LLM), coupled with symbolic computation tools, and human strategic direction, jointly produced a new result in combinatorial design theory. The main result of this human-AI collaboration is a tight lower bound on the imbalance of Latin squares for the notoriously difficult case $n \equiv 1 \pmod{3}$. We reconstruct the discovery process from detailed interaction logs spanning multiple sessions over several days and identify the distinct cognitive contributions of each component. The AI agent proved effective at uncovering hidden structure and generating hypotheses. The symbolic component consists of computer algebra, constraint solvers, and simulated annealing, which provides rigorous verification and exhaustive enumeration. Human steering supplied the critical research pivot that transformed a dead end into a productive inquiry. Our analysis reveals that multi-model deliberation among frontier LLMs proved reliable for criticism and error detection but unreliable for constructive claims. The resulting human-AI mathematical contribution, a tight lower bound of $4n(n{-}1)/9$, is achieved via a novel class of near-perfect permutations. The bound was formally verified in Lean 4. Our experiments show that neurosymbolic systems can indeed produce genuine discoveries in pure mathematics.