Deakin University
Browse

File(s) under permanent embargo

Approximate cluster heat maps of large high-dimensional data

Version 2 2024-06-06, 07:07
Version 1 2019-03-04, 15:59
conference contribution
posted on 2024-06-06, 07:07 authored by P Rathore, JC Bezdek, D Kumar, Sutharshan RajasegararSutharshan Rajasegarar, M Palaniswami
The problem of determining whether clusters are present in numerical data (tendency assessment) is an important first step of cluster analysis. One tool for cluster tendency assessment is the visual assessment of tendency (VAT) algorithm. VAT and improved VAT (iVAT) produce an image that provides visual evidence about the number of clusters to seek in the original dataset. These methods have been successful in determining potential cluster structure in various datasets, but they can be computationally expensive for datasets with a very large number of samples. A scalable version of iVAT called siVAT approximates iVAT images, but siVAT can be computationally expensive for big datasets. In this article, we introduce a modification of siVAT called siVAT+ which approximates cluster heat maps for large volumes of high dimensional data much more rapidly than siVAT. We compare siVAT+ with siVAT on six large, high dimensional datasets. Experimental results confirm that siVAT+ obtains images similar to siVAT images in a few seconds, and is 8 - 55 times faster than siVAT.

History

Pagination

195-200

Location

Beijing, China

Start date

2018-08-20

End date

2018-08-24

ISSN

1051-4651

ISBN-13

9781538637883

Language

eng

Publication classification

E Conference publication, E1 Full written paper - refereed

Copyright notice

2018, IEEE

Editor/Contributor(s)

[Unknown]

Title of proceedings

ICPR 2018 : Proceedings of the 24th International Conference on Pattern Recognition

Event

International Association for Pattern Recognition. Conference (24th : 2018 : Beijing, China)

Publisher

Institute of Electrical and Electronics Engineers

Place of publication

Piscataway, N.J.