Deakin University
Browse

File(s) under permanent embargo

A distance scaling method to improve density-based clustering

conference contribution
posted on 2018-01-01, 00:00 authored by Ye ZhuYe Zhu, K M Ting, Maia Angelova TurkedjievaMaia Angelova Turkedjieva
Density-based clustering is able to find clusters of arbitrary sizes and shapes while effectively separating noise. Despite its advantage over other types of clustering, it is well-known that most density-based algorithms face the same challenge of finding clusters with varied densities. Recently, ReScale, a principled density-ratio preprocessing technique, enables a density-based clustering algorithm to identify clusters with varied densities. However, because the technique is based on one-dimensional scaling, it does not do well in datasets which require multi-dimensional scaling. In this paper, we propose a multi-dimensional scaling method, named DScale, which rescales based on the computed distance. It overcomes the key weakness of ReScale and requires one less parameter while maintaining the simplicity of the implementation. Our empirical evaluation shows that DScale has better clustering performance than ReScale for three existing density-based algorithms, i.e., DBSCAN, OPTICS and DP, on synthetic and real-world datasets.

History

Event

Knowledge Discovery and Data Mining. Pacific-Asia Conference (22nd : 2018 : Melbourne, Victoria)

Volume

10939

Series

Lecture Notes in Computer Science

Pagination

389 - 400

Publisher

Springer

Location

Melbourne, Victoria

Place of publication

Cham, Switzerland

Start date

2018-06-03

End date

2018-06-06

ISSN

0302-9743

eISSN

1611-3349

ISBN-13

9783319930398

Language

eng

Publication classification

E Conference publication; E1 Full written paper - refereed

Editor/Contributor(s)

Dinh Phung, Vincent Tseng, Geoffrey Webb, Bao Ho, Mohadeseh Ganji, Lida Rashidi

Title of proceedings

PAKDD 2018 : Advances in Knowledge Discovery and Data Mining : Proceedings of 22nd Pacific-Asia Conference

Usage metrics

    Research Publications

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC