CT-index: Fingerprint-based graph indexing combining cycles and trees

CT-index: Fingerprint-based graph indexing combining cycles and trees

Abstract

Efficient subgraph queries in large databases are a time-critical task in many application areas as e.g. biology or chemistry, where biological networks or chemical compounds are modeled as graphs. The NP-completeness of the underlying subgraph isomorphism problem renders an exact subgraph test for each database graph infeasible. Therefore efficient methods have to be found that avoid most of these tests but still allow to identify all graphs containing the query pattern. We propose a new approach based on the filter-verification paradigm, using a new hash-key fingerprint technique with a combination of tree and cycle features for filtering and a new subgraph isomorphism test for verification. Our approach is able to cope with edge and vertex labels and also allows to use wild card patterns for the search. We present an experimental comparison of our approach with state-of-the-art methods using a benchmark set of both real world and generated graph instances that shows its practicability. Our approach is implemented as part of the Scaffold Hunter software, a tool for the visual analysis of chemical compound databases.

Grafik Top
Authors
  • Klein, Karsten
  • Kriege, Nils M.
  • Mutzel, Petra
Grafik Top
Shortfacts
Category
Paper in Conference Proceedings or in Workshop Proceedings (Paper)
Event Title
27th International Conference on Data Engineering (ICDE)
Divisions
Data Mining and Machine Learning
Event Location
Hannover, Germany
Event Type
Conference
Event Dates
11.-16.04.2011
Series Name
Proceedings of the 2011 IEEE 27th International Conference on Data Engineering
ISSN/ISBN
978-1-4244-8959-6
Publisher
IEEE Computer Society
Page Range
pp. 1115-1126
Date
11 April 2011
Official URL
https://doi.org/10.1109/ICDE.2011.5767909
Export
Grafik Top