LLM Reasoning 相关度: 9/10

SteuerLLM: Local specialized large language model for German tax law analysis

Sebastian Wind, Jeta Sopa, Laurin Schmid, Quirin Jackl, Sebastian Kiefer, Fei Wu, Martin Mayr, Harald Köstler, Gerhard Wellein, Andreas Maier, Soroosh Tayebi Arasteh
arXiv: 2602.11081v1 发布: 2026-02-11 更新: 2026-02-11

AI 摘要

SteuerLLM针对德国税法领域,通过领域数据训练,性能超越通用LLM。

主要贡献

  • 构建了德国税法领域的开放基准SteuerEx
  • 提出了领域自适应的LLM模型SteuerLLM
  • 证明了领域数据和架构适应性比参数规模更重要

方法论

算法生成基准数据集,使用检索增强管道构建大规模合成数据集,训练领域自适应LLM。

原文摘要

Large language models (LLMs) demonstrate strong general reasoning and language understanding, yet their performance degrades in domains governed by strict formal rules, precise terminology, and legally binding structure. Tax law exemplifies these challenges, as correct answers require exact statutory citation, structured legal argumentation, and numerical accuracy under rigid grading schemes. We algorithmically generate SteuerEx, the first open benchmark derived from authentic German university tax law examinations. SteuerEx comprises 115 expert-validated examination questions spanning six core tax law domains and multiple academic levels, and employs a statement-level, partial-credit evaluation framework that closely mirrors real examination practice. We further present SteuerLLM, a domain-adapted LLM for German tax law trained on a large-scale synthetic dataset generated from authentic examination material using a controlled retrieval-augmented pipeline. SteuerLLM (28B parameters) consistently outperforms general-purpose instruction-tuned models of comparable size and, in several cases, substantially larger systems, demonstrating that domain-specific data and architectural adaptation are more decisive than parameter scale for performance on realistic legal reasoning tasks. All benchmark data, training datasets, model weights, and evaluation code are released openly to support reproducible research in domain-specific legal artificial intelligence. A web-based demo of SteuerLLM is available at https://steuerllm.i5.ai.fau.de.

标签

LLM 税法 领域自适应 法律人工智能

arXiv 分类

cs.CL cs.AI cs.LG