AI Agents 相关度: 7/10

LabelBuddy: An Open Source Music and Audio Language Annotation Tagging Tool Using AI Assistance

Ioannis Prokopiou, Ioannis Sina, Agisilaos Kounelis, Pantelis Vikatos, Themos Stafylakis
arXiv: 2603.04293v1 发布: 2026-03-04 更新: 2026-03-04

AI 摘要

LabelBuddy是一款开源的、支持AI辅助的音乐和音频标注工具,旨在弥合人类意图与机器理解之间的差距。

主要贡献

  • 开源协作音频标注工具
  • 支持容器化的AI辅助预标注
  • 支持多用户共识

方法论

通过容器化后端解耦界面与推理,允许用户插入自定义模型进行AI辅助预标注,支持多用户协作和模型隔离。

原文摘要

The advancement of Machine learning (ML), Large Audio Language Models (LALMs), and autonomous AI agents in Music Information Retrieval (MIR) necessitates a shift from static tagging to rich, human-aligned representation learning. However, the scarcity of open-source infrastructure capable of capturing the subjective nuances of audio annotation remains a critical bottleneck. This paper introduces \textbf{LabelBuddy}, an open-source collaborative auto-tagging audio annotation tool designed to bridge the gap between human intent and machine understanding. Unlike static tools, it decouples the interface from inference via containerized backends, allowing users to plug in custom models for AI-assisted pre-annotation. We describe the system architecture, which supports multi-user consensus, containerized model isolation, and a roadmap for extending agents and LALMs. Code available at https://github.com/GiannisProkopiou/gsoc2022-Label-buddy.

标签

音频标注 音乐信息检索 AI辅助 开源工具 多用户协作

arXiv 分类

cs.SD cs.AI cs.IR cs.LG