AI Agents 相关度: 9/10

6GAgentGym: Tool Use, Data Synthesis, and Agentic Learning for Network Management

Jiao Chen, Jianhua Tang, Xiaotong Yang, Zuohong Lv
arXiv: 2603.29656v1 发布: 2026-03-31 更新: 2026-03-31

AI 摘要

提出6GAgentGym,一个闭环6G网络管理环境,并训练开源模型达到GPT-5的性能。

主要贡献

  • 构建了交互式6G网络管理环境6GAgentGym
  • 开发了基于NS-3的实验模型和自指导数据生成方法
  • 训练了一个性能与GPT-5相当的开源模型

方法论

利用NS-3数据,通过自指导生成训练轨迹,进行监督微调和强化学习,实现闭环网络管理。

原文摘要

Autonomous 6G network management requires agents that can execute tools, observe the resulting state changes, and adapt their decisions accordingly. Existing benchmarks based on static questions or scripted episode replay, however, do not support such closed-loop interaction, limiting agents to passive evaluation without the ability to learn from environmental feedback. This paper presents 6GAgentGym to provide closed-loop capability. The framework provides an interactive environment with 42 typed tools whose effect classification distinguishes read-only observation from state-mutating configuration, backed by a learned Experiment Model calibrated on NS-3 simulation data. 6G-Forge bootstraps closed-loop training trajectories from NS-3 seeds via iterative Self-Instruct generation with execution verification against the Experiment Model. Supervised fine-tuning on the resulting corpus followed by reinforcement learning with online closed-loop interaction enables an 8B open-source model to achieve comparable overall success rate to GPT-5 on the accompanying 6GAgentBench, with stronger performance on long-horizon tasks. Together, these components provide a viable path toward autonomous, closed-loop network management.

标签

6G 网络管理 Agent 强化学习 工具使用

arXiv 分类

cs.NI cs.AI