LLM Reasoning 相关度: 9/10

Multilingual Reasoning Gym: Multilingual Scaling of Procedural Reasoning Environments

Konstantin Dobler, Simon Lehnerer, Federico Scozzafava, Jonathan Janke, Mohamed Ali
arXiv: 2603.10793v1 发布: 2026-03-11 更新: 2026-03-11

AI 摘要

提出了多语言推理环境Multilingual Reasoning Gym,可生成14种语言的推理问题。

主要贡献

  • 扩展了Reasoning Gym到多语言环境
  • 构建了包含14种语言的推理问题数据集
  • 实现了大规模跨语言并行数据生成

方法论

通过翻译和调整Reasoning Gym的模板,结合母语者验证,生成多语言推理问题,并保持可验证性。

原文摘要

We present the Multilingual Reasoning Gym, an extension of Reasoning Gym (Stojanovski et al., 2025), that procedurally generates verifiable reasoning problems across 14 languages. We translate templates for 94 tasks with native-speaker validation in 10 languages and targeted code or template adaptations to ensure linguistic naturalness. The Multilingual Reasoning Gym preserves the core benefits of the procedural generation approach used in the original Reasoning Gym, such as virtually unlimited problem instance generation and adjustable difficulty, and remains directly usable for Reinforcement Learning from Verifiable Rewards and evaluation settings. Problems in the Multilingual Reasoning Gym are parallel across languages, enabling crosslingually parallel data generation at massive scale due to the procedural nature of the environments. We release our implementation to support research into multilingual reasoning models.

标签

多语言 推理 数据集 强化学习

arXiv 分类

cs.CL