AI Agents 相关度: 9/10

What if Pinocchio Were a Reinforcement Learning Agent: A Normative End-to-End Pipeline

Benoît Alcaraz
arXiv: 2603.16651v1 发布: 2026-03-17 更新: 2026-03-17

AI 摘要

该论文提出一个基于强化学习和论证的规范兼容智能体开发流程,并解决了规范规避问题。

主要贡献

  • 提出一个端到端的规范兼容智能体开发流程
  • 设计了一个自动提取论证的算法
  • 定义并缓解了强化学习智能体中的规范规避现象

方法论

结合AJAR、Jiminy和NGRL架构,构建混合模型pino,使用论证规范顾问监督强化学习智能体。

原文摘要

In the past decade, artificial intelligence (AI) has developed quickly. With this rapid progression came the need for systems capable of complying with the rules and norms of our society so that they can be successfully and safely integrated into our daily lives. Inspired by the story of Pinocchio in ``Le avventure di Pinocchio - Storia di un burattino'', this thesis proposes a pipeline that addresses the problem of developing norm compliant and context-aware agents. Building on the AJAR, Jiminy, and NGRL architectures, the work introduces \pino, a hybrid model in which reinforcement learning agents are supervised by argumentation-based normative advisors. In order to make this pipeline operational, this thesis also presents a novel algorithm for automatically extracting the arguments and relationships that underlie the advisors' decisions. Finally, this thesis investigates the phenomenon of \textit{norm avoidance}, providing a definition and a mitigation strategy within the context of reinforcement learning agents. Each component of the pipeline is empirically evaluated. The thesis concludes with a discussion of related work, current limitations, and directions for future research.

标签

强化学习 规范兼容性 论证 AI安全

arXiv 分类

cs.AI