LLM Reasoning 相关度: 7/10

Effective QA-driven Annotation of Predicate-Argument Relations Across Languages

Jonathan Davidov, Aviv Slobodkin, Shmuel Tomi Klein, Reut Tsarfaty, Ido Dagan, Ayal Klein
arXiv: 2602.22865v1 发布: 2026-02-26 更新: 2026-02-26

AI 摘要

论文提出了一种利用QA-SRL框架,通过跨语言迁移实现多语言语义角色标注的方法。

主要贡献

  • 提出基于QA-SRL的跨语言语义角色标注方法
  • 设计了约束翻译和词对齐的pipeline自动生成标注
  • 在希伯来语、俄语、法语上验证了方法的有效性

方法论

利用英文QA-SRL解析器,结合约束翻译和词对齐技术,自动生成目标语言的QA-SRL标注数据,并训练特定语言的解析器。

原文摘要

Explicit representations of predicate-argument relations form the basis of interpretable semantic analysis, supporting reasoning, generation, and evaluation. However, attaining such semantic structures requires costly annotation efforts and has remained largely confined to English. We leverage the Question-Answer driven Semantic Role Labeling (QA-SRL) framework -- a natural-language formulation of predicate-argument relations -- as the foundation for extending semantic annotation to new languages. To this end, we introduce a cross-linguistic projection approach that reuses an English QA-SRL parser within a constrained translation and word-alignment pipeline to automatically generate question-answer annotations aligned with target-language predicates. Applied to Hebrew, Russian, and French -- spanning diverse language families -- the method yields high-quality training data and fine-tuned, language-specific parsers that outperform strong multilingual LLM baselines (GPT-4o, LLaMA-Maverick). By leveraging QA-SRL as a transferable natural-language interface for semantics, our approach enables efficient and broadly accessible predicate-argument parsing across languages.

标签

语义角色标注 QA-SRL 跨语言迁移学习 自然语言处理

arXiv 分类

cs.CL