Deep Embedded Non-Redundant Clustering

Deep Embedded Non-Redundant Clustering

Abstract

Complex data types like images can be clustered in multiple valid ways. Non-redundant clustering aims at extracting those meaningful groupings by discouraging redundancy between clusterings. Unfortunately, clustering images in pixel space directly has been shown to work unsatisfactory. This has increased interest in combining the high representational power of deep learning with clustering, termed deep clustering. Algorithms of this type combine the non-linear embedding of an autoencoder with a clustering objective and optimize both simultaneously. None of these algorithms try to find multiple non-redundant clusterings. In this paper, we propose the novel Embedded Non-Redundant Clustering algorithm (ENRC). It is the first algorithm that combines neuralnetwork-based representation learning with non-redundant clustering. ENRC can find multiple highly non-redundant clusterings of different dimensionalities within a data set. This is achieved by (softly) assigning each dimension of the embedded space to the different clusterings. For instance, in image data sets it can group the objects by color, material and shape, without the need for explicit feature engineering. We show the viability of ENRC in extensive experiments and empirically demonstrate the advantage of combining non-linear representation learning with non-redundant clustering.

Grafik Top
Authors
  • Miklautz, Lukas
  • Mautz, Dominik
  • Altinigneli, M. C.
  • Böhm, Christian
  • Plant, Claudia
Grafik Top
Shortfacts
Category
Paper in Conference Proceedings or in Workshop Proceedings (Paper)
Event Title
34th AAAI Conference on Artificial Intelligence
Divisions
Data Mining and Machine Learning
Subjects
Kuenstliche Intelligenz
Maschinelles Sehen
Event Location
New York
Event Type
Conference
Event Dates
07-12 Feb 2020
Page Range
pp. 5174-5181
Date
7 February 2020
Export
Grafik Top