GLM-5: from Vibe Coding to Agentic Engineering
AI 摘要
GLM-5通过DSA降低成本,异步强化学习提升效率,实现从Vibe Coding到Agentic Engineering的转变。
主要贡献
- 采用DSA降低训练和推理成本,同时保持长上下文保真度
- 引入异步强化学习基础设施,提升训练效率
- 提出新型异步agent RL算法,提升RL质量
方法论
GLM-5构建在ARC能力之上,采用DSA降低成本,利用异步强化学习提升模型对复杂任务的学习能力。
原文摘要
We present GLM-5, a next-generation foundation model designed to transition the paradigm of vibe coding to agentic engineering. Building upon the agentic, reasoning, and coding (ARC) capabilities of its predecessor, GLM-5 adopts DSA to significantly reduce training and inference costs while maintaining long-context fidelity. To advance model alignment and autonomy, we implement a new asynchronous reinforcement learning infrastructure that drastically improves post-training efficiency by decoupling generation from training. Furthermore, we propose novel asynchronous agent RL algorithms that further improve RL quality, enabling the model to learn from complex, long-horizon interactions more effectively. Through these innovations, GLM-5 achieves state-of-the-art performance on major open benchmarks. Most critically, GLM-5 demonstrates unprecedented capability in real-world coding tasks, surpassing previous baselines in handling end-to-end software engineering challenges. Code, models, and more information are available at https://github.com/zai-org/GLM-5.