Agent Tuning & Optimization 相关度: 7/10

HiFlow: Hierarchical Feedback-Driven Optimization for Constrained Long-Form Text Generation

Yifan Zhu, Guanting Chen, Bing Wei, Haoran Luo
arXiv: 2603.04996v1 发布: 2026-03-05 更新: 2026-03-05

AI 摘要

HiFlow通过层级反馈优化框架,提升LLM在约束条件下生成长文本的能力,实现全局结构和局部语义的协同。

主要贡献

  • 提出了一种层级反馈驱动的优化框架HiFlow
  • 设计了包含规划层和生成层的两级优化过程
  • 引入约束感知的计划筛选和闭环反馈机制

方法论

HiFlow采用分层优化策略,规划层建模全局结构和约束,生成层进行条件文本生成,通过反馈机制联合优化。

原文摘要

Large language models perform well in short text generation but still struggle with long text generation, particularly under complex constraints. Such tasks involve multiple tightly coupled objectives, including global structural consistency, local semantic coherence, and constraint feasibility, forming a challenging constrained optimization problem. Existing approaches mainly rely on static planning or offline supervision, limiting effective coordination between global and local objectives during generation. To address these challenges, we propose HiFlow, a hierarchical feedback-driven optimization framework for constrained long text generation. HiFlow formulates generation as a two-level optimization process, consisting of a planning layer for global structure and constraint modeling, and a generation layer for conditioned text generation. By incorporating constraint-aware plan screening and closed-loop feedback at both levels, HiFlow enables joint optimization of planning quality and generation behavior, progressively guiding the model toward high-quality, constraint-satisfying outputs. Experiments on multiple backbones confirm HiFlow's effectiveness over baseline methods.

标签

长文本生成 约束优化 反馈学习 语言模型

arXiv 分类

cs.CL