Deakin University
Browse

File(s) under permanent embargo

Grouping points by shared subspaces for effective subspace clustering

Version 2 2024-06-04, 12:10
Version 1 2018-07-11, 11:19
journal contribution
posted on 2024-06-04, 12:10 authored by Ye ZhuYe Zhu, KM Ting, MJ Carman
Clusters may exist in different subspaces of a multidimensional dataset. Traditional full-space clustering algorithms have difficulty in identifying these clusters. Various subspace clustering algorithms have used different subspace search strategies. They require clustering to assess whether cluster(s) exist in a subspace. In addition, all of them perform clustering by measuring similarity between points in the given feature space. As a result, the subspace selection and clustering processes are tightly coupled. In this paper, we propose a new subspace clustering framework named CSSub (Clustering by Shared Subspaces). It enables neighbouring core points to be clustered based on the number of subspaces they share. It explicitly splits candidate subspace selection and clustering into two separate processes, enabling different types of cluster definitions to be employed easily. Through extensive experiments on synthetic and real-world datasets, we demonstrate that CSSub discovers non-redundant subspace clusters with arbitrary shapes in noisy data; and it significantly outperforms existing state-of-the-art subspace clustering algorithms.

History

Journal

Pattern recognition

Volume

83

Pagination

230-244

Location

Amsterdam, The Netherlands

ISSN

0031-3203

Language

eng

Publication classification

C Journal article, C1 Refereed article in a scholarly journal

Copyright notice

2018, Elsevier Ltd.

Publisher

Elsevier