Natural-Language Agent Harnesses
AI 摘要
提出了一种自然语言驱动的Agent Harness框架,旨在提高Agent harness的可移植性、可比性和可研究性。
主要贡献
- 提出Natural-Language Agent Harnesses (NLAHs)
- 设计Intelligent Harness Runtime (IHR)
- 验证了NLAH在不同benchmark上的可行性
方法论
通过自然语言定义agent harness行为,利用共享运行时执行,并通过对比实验评估其性能和模块化程度。
原文摘要
Agent performance increasingly depends on \emph{harness engineering}, yet harness design is usually buried in controller code and runtime-specific conventions, making it hard to transfer, compare, and study as a scientific object. We ask whether the high-level control logic of an agent harness can instead be externalized as a portable executable artifact. We introduce \textbf{Natural-Language Agent Harnesses} (NLAHs), which express harness behavior in editable natural language, and \textbf{Intelligent Harness Runtime} (IHR), a shared runtime that executes these harnesses through explicit contracts, durable artifacts, and lightweight adapters. Across coding and computer-use benchmarks, we conduct controlled evaluations of operational viability, module ablation, and code-to-text harness migration.