LLM Reasoning 相关度: 6/10

Understanding Wikidata Qualifiers: An Analysis and Taxonomy

Gilles Falquet, Sahar Aljalbout
arXiv: 2603.11767v1 发布: 2026-03-12 更新: 2026-03-12

AI 摘要

深入分析Wikidata限定词的语义和用法,构建分类体系,优化知识图谱查询。

主要贡献

  • 提出了Wikidata限定词的分类体系
  • 分析了限定词的使用频率和多样性
  • 为创建和查询语句提供指导

方法论

分析Wikidata转储,基于频率和多样性,使用修正的香农熵指数,将Top300限定词分类。

原文摘要

This paper presents an in-depth analysis of Wikidata qualifiers, focusing on their semantics and actual usage, with the aim of developing a taxonomy that addresses the challenges of selecting appropriate qualifiers, querying the graph, and making logical inferences. The study evaluates qualifier importance based on frequency and diversity, using a modified Shannon entropy index to account for the "long tail" phenomenon. By analyzing a Wikidata dump, the top 300 qualifiers were selected and categorized into a refined taxonomy that includes contextual, epistemic/uncertainty, structural, and additional qualifiers. The taxonomy aims to guide contributors in creating and querying statements, improve qualifier recommendation systems, and enhance knowledge graph design methodologies. The results show that the taxonomy effectively covers the most important qualifiers and provides a structured approach to understanding and utilizing qualifiers in Wikidata.

标签

Wikidata 知识图谱 限定词 语义分析 分类

arXiv 分类

cs.AI