Assessing and Improving the Quality of SKOS Vocabularies

Assessing and Improving the Quality of SKOS Vocabularies

Abstract

Controlled vocabularies are increasingly made available on the Web of Data using the SKOS ontology. Assessment of vocabulary quality is important for determining the suitability of vocabularies for reuse in applications and for improving vocabulary development processes. We define 26 quality issues, i.e., computable functions that expose potential quality problems. In an analysis of a representative set of 24 SKOS vocabularies, we found all of them to contain structural errors and/or other quality problems. We propose a set of correction heuristics which we have used to automatically correct a significant proportion of the identified problems. Our reference implementations of these methods, the quality assessment tool qSKOS and the quality improvement tool Skosify, are available for reuse as open source software.

Grafik Top
Authors
  • Suominen, Osma
  • Mader, Christian
Grafik Top
Shortfacts
Category
Journal Paper
Divisions
Multimedia Information Systems
Journal or Publication Title
Journal on Data Semantics
ISSN
1861-2032 (Print) 1861-2040 (Online)
Publisher
Springer
Page Range
pp. 47-73
Number
1
Volume
3
Date
March 2014
Export
Grafik Top