Agent Tuning & Optimization 相关度: 6/10

Training data generation for context-dependent rubric-based short answer grading

Pavel Šindelář, Dávid Slivka, Christopher Bouma, Filip Prášil, Ondřej Bojar

arXiv: 2603.28537v1 发布: 2026-03-30 更新: 2026-03-30

下载 PDF arXiv 页面

AI 摘要

针对PISA测试，论文探索利用小规模保密数据集生成大规模训练数据的方法，以提升自动阅卷效果。

主要贡献

提出基于简单文本格式生成大规模训练数据集的方法
成功创建了三个与参考数据集相似的替代数据集
初步实验表明该方法可能改进模型训练

方法论

利用小规模保密数据集，通过简单的文本格式变换生成大规模的替代数据集，并进行初步模型训练实验。

原文摘要

Every 4 years, the PISA test is administered by the OECD to test the knowledge of teenage students worldwide and allow for comparisons of educational systems. However, having to avoid language differences and annotator bias makes the grading of student answers challenging. For these reasons, it would be interesting to compare methods of automatic student answer grading. To train some of these methods, which require machine learning, or to compute parameters or select hyperparameters for those that do not, a large amount of domain-specific data is needed. In this work, we explore a small number of methods for creating a large-scale training dataset using only a relatively small confidential dataset as a reference, leveraging a set of very simple derived text formats to preserve confidentiality. Using these methods, we successfully created three surrogate datasets that are, at the very least, superficially more similar to the reference dataset than purely the result of prompt-based generation. Early experiments suggest one of these approaches might also lead to improved model training.

arXiv 分类

cs.CL

AI 摘要

主要贡献

方法论

原文摘要

标签

arXiv 分类