AI Agents 相关度: 8/10

Can LLM Agents Identify Spoken Dialects like a Linguist?

Tobias Bystrich, Lukas Hamm, Maria Hassan, Lea Fischbach, Lucie Flek, Akbar Karimi
arXiv: 2603.29541v1 发布: 2026-03-31 更新: 2026-03-31

AI 摘要

研究LLM作为agent在方言分类任务中的能力,并与传统模型和人类专家进行比较。

主要贡献

  • 评估LLM在方言分类中的表现
  • 结合语音转录和语言学资源
  • 提供LLM和人类专家的基线

方法论

使用ASR生成的语音转录,结合方言特征图等语言学资源,输入LLM进行方言分类。

原文摘要

Due to the scarcity of labeled dialectal speech, audio dialect classification is a challenging task for most languages, including Swiss German. In this work, we explore the ability of large language models (LLMs) as agents in understanding the dialects and whether they can show comparable performance to models such as HuBERT in dialect classification. In addition, we provide an LLM baseline and a human linguist one. Our approach uses phonetic transcriptions produced by ASR systems and combines them with linguistic resources such as dialect feature maps, vowel history, and rules. Our findings indicate that, when linguistic information is provided, the LLM predictions improve. The human baseline shows that automatically generated transcriptions can be beneficial for such classifications, but also presents opportunities for improvement.

标签

LLM 方言分类 语音识别 自然语言处理

arXiv 分类

cs.CL