Multilingual Reasoning Gym: Multilingual Scaling of Procedural Reasoning Environments
AI 摘要
提出了多语言推理环境Multilingual Reasoning Gym,可生成14种语言的推理问题。
主要贡献
- 扩展了Reasoning Gym到多语言环境
- 构建了包含14种语言的推理问题数据集
- 实现了大规模跨语言并行数据生成
方法论
通过翻译和调整Reasoning Gym的模板,结合母语者验证,生成多语言推理问题,并保持可验证性。
原文摘要
We present the Multilingual Reasoning Gym, an extension of Reasoning Gym (Stojanovski et al., 2025), that procedurally generates verifiable reasoning problems across 14 languages. We translate templates for 94 tasks with native-speaker validation in 10 languages and targeted code or template adaptations to ensure linguistic naturalness. The Multilingual Reasoning Gym preserves the core benefits of the procedural generation approach used in the original Reasoning Gym, such as virtually unlimited problem instance generation and adjustable difficulty, and remains directly usable for Reinforcement Learning from Verifiable Rewards and evaluation settings. Problems in the Multilingual Reasoning Gym are parallel across languages, enabling crosslingually parallel data generation at massive scale due to the procedural nature of the environments. We release our implementation to support research into multilingual reasoning models.