ACTC: Active Threshold Calibration for Cold-Start Knowledge Graph Completion
Self-supervised knowledge-graph completion (KGC) relies on estimating a scoring model over (entity, relation, entity)-tuples, for example, by embedding an initial knowledge graph. Prediction quality can be improved by calibrating the scoring model, typically by adjusting the prediction thresholds using manually annotated examples. In this paper, we attempt for the first time cold-start calibration for KGC, where no annotated examples exist initially for calibration, and only a limited number of tuples can be selected for annotation. Our new method ACTC finds good per-relation thresholds efficiently based on a limited set of annotated tuples. Additionally to a few annotated tuples, ACTC also leverages unlabeled tuples by estimating their correctness with Logistic Regression or Gaussian Process classifiers. We also experiment with different methods for selecting candidate tuples for annotation: density-based and random selection. Experiments with five scoring models and an oracle annotator show an improvement of 7% points when using ACTC in the challenging setting with an annotation budget of only 10 tuples, and an average improvement of 4% points over different budgets.
Top- Sedova, Anastasiia
- Roth, Benjamin
Category |
Paper in Conference Proceedings or in Workshop Proceedings (Poster) |
Event Title |
The 61st Annual Meeting of the Association for Computational Linguistics, 2023 |
Divisions |
Data Mining and Machine Learning |
Subjects |
Kuenstliche Intelligenz Sprachverarbeitung |
Event Location |
Toronto, Canada |
Event Type |
Conference |
Event Dates |
July 9-14, 2023 |
Series Name |
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) |
Page Range |
pp. 1853-1863 |
Date |
July 2023 |
Export |