LLM Reasoning 相关度: 9/10

Pedagogically-Inspired Data Synthesis for Language Model Knowledge Distillation

Bowei He, Yankai Chen, Xiaokun Zhang, Linghe Kong, Philip S. Yu, Xue Liu, Chen Ma
arXiv: 2602.12172v1 发布: 2026-02-12 更新: 2026-02-12

AI 摘要

论文提出一种受教学启发的知识蒸馏框架IOA,提升小模型在复杂推理任务上的性能。

主要贡献

  • 提出IOA框架,包含知识识别、组织和适应三个阶段
  • 结合Bloom的掌握学习原则和维果茨基的最近发展区理论
  • 在知识蒸馏任务上取得了显著的性能提升,尤其是在复杂推理任务上

方法论

IOA框架通过识别学生模型的知识缺陷,组织渐进式课程,并调整表示以匹配学生模型的认知能力进行知识蒸馏。

原文摘要

Knowledge distillation from Large Language Models (LLMs) to smaller models has emerged as a critical technique for deploying efficient AI systems. However, current methods for distillation via synthetic data lack pedagogical awareness, treating knowledge transfer as a one-off data synthesis and training task rather than a systematic learning process. In this paper, we propose a novel pedagogically-inspired framework for LLM knowledge distillation that draws from fundamental educational principles. Our approach introduces a three-stage pipeline -- Knowledge Identifier, Organizer, and Adapter (IOA) -- that systematically identifies knowledge deficiencies in student models, organizes knowledge delivery through progressive curricula, and adapts representations to match the cognitive capacity of student models. We integrate Bloom's Mastery Learning Principles and Vygotsky's Zone of Proximal Development to create a dynamic distillation process where student models approach teacher model's performance on prerequisite knowledge before advancing, and new knowledge is introduced with controlled, gradual difficulty increments. Extensive experiments using LLaMA-3.1/3.2 and Qwen2.5 as student models demonstrate that IOA achieves significant improvements over baseline distillation methods, with student models retaining 94.7% of teacher performance on DollyEval while using less than 1/10th of the parameters. Our framework particularly excels in complex reasoning tasks, showing 19.2% improvement on MATH and 22.3% on HumanEval compared with state-of-the-art baselines.

标签

知识蒸馏 大型语言模型 教学法 推理

arXiv 分类

cs.AI cs.CL