Text Style Transfer with Parameter-efficient LLM Finetuning and Round-trip Translation
AI 摘要
提出了一种基于参数高效微调LLM和回译的文本风格迁移方法。
主要贡献
- 提出使用回译生成平行数据集,解决平行语料稀缺问题
- 采用参数高效微调LLM进行风格迁移
- 集成RAG增强知识和风格一致性
方法论
利用回译合成平行语料,微调LLM学习风格迁移,并结合RAG增强知识。
原文摘要
This paper proposes a novel method for Text Style Transfer (TST) based on parameter-efficient fine-tuning of Large Language Models (LLMs). Addressing the scarcity of parallel corpora that map between styles, the study employs roundtrip translation to synthesize such parallel datasets from monolingual corpora. This approach creates 'neutralized' text devoid of stylistic attributes, essentially creating a shared input style at training-time and inference-time. Experimental results demonstrate consistent superiority of this method over zero-shot prompting and fewshot ICL techniques measured by BLEU scores and style accuracy scores across four investigated domains. Furthermore, the integration of retrieval-augmented generation (RAG) for terminology and name knowledge enhances robustness and stylistic consistency.