AI Agents 相关度: 9/10

Sensi: Learn One Thing at a Time -- Curriculum-Based Test-Time Learning for LLM Game Agents

Mohsen Arjmandi
arXiv: 2603.17683v1 发布: 2026-03-18 更新: 2026-03-18

AI 摘要

Sensi通过课程学习和双智能体架构提升LLM智能体在ARC-AGI-3游戏中学习效率。

主要贡献

  • 提出双智能体架构分离感知与行动
  • 引入基于课程学习的测试时学习系统
  • 采用数据库作为控制平面以控制智能体上下文窗口

方法论

Sensi使用双智能体架构,通过外部状态机管理课程学习,并使用数据库控制上下文,LLM作为裁判动态评估学习进度。

原文摘要

Large language model (LLM) agents deployed in unknown environments must learn task structure at test time, but current approaches require thousands of interactions to form useful hypotheses. We present Sensi, an LLM agent architecture for the ARC-AGI-3 game-playing challenge that introduces structured test-time learning through three mechanisms: (1) a two-player architecture separating perception from action, (2) a curriculum-based learning system managed by an external state machine, and (3) a database-as-control-plane that makes the agents context window programmatically steerable. We further introduce an LLM-as-judge component with dynamically generated evaluation rubrics to determine when the agent has learned enough about one topic to advance to the next. We report results across two iterations: Sensi v1 solves 2 game levels using the two-player architecture alone, while Sensi v2 adds curriculum learning and solves 0 levels - but completes its entire learning curriculum in approximately 32 action attempts, achieving 50-94x greater sample efficiency than comparable systems that require 1600-3000 attempts. We precisely diagnose the failure mode as a self-consistent hallucination cascade originating in the perception layer, demonstrating that the architectural bottleneck has shifted from learning efficiency to perceptual grounding - a more tractable problem.

标签

LLM Agent Curriculum Learning Test-Time Learning ARC-AGI

arXiv 分类

cs.AI cs.LG