Deakin University
Browse

Hierarchical clustering via penalty-based aggregation and the Genie approach

Version 2 2024-06-12, 15:08
Version 1 2016-01-01, 00:00
conference contribution
posted on 2024-06-12, 15:08 authored by M Gagolewski, A Cena, M Bartoszuk
© Springer International Publishing Switzerland 2016. The paper discusses a generalization of the nearest centroid hierarchical clustering algorithm. A first extension deals with the incorporation of generic distance-based penalty minimizers instead of the classical aggregation by means of centroids. Due to that the presented algorithm can be applied in spaces equipped with an arbitrary dissimilarity measure (images, DNA sequences, etc.). Secondly, a correction preventing the formation of clusters of too highly unbalanced sizes is applied: just like in the recently introduced Genie approach, which extends the single linkage scheme, the new method averts a chosen inequity measure (e.g., the Gini-, de Vergottini-, or Bonferroni-index) of cluster sizes from raising above a predefined threshold. Numerous benchmarks indicate that the introduction of such a correction increases the quality of the resulting clusterings significantly.

History

Related Materials

Location

Sant Julià de Lòria, Andorra

Language

eng

Publication classification

E1.1 Full written paper - refereed

Volume

9880

Pagination

191-202

Start date

2016-09-19

End date

2016-09-21

ISSN

0302-9743

eISSN

1611-3349

ISBN-13

9783319456553

Title of proceedings

MDAI 2016 :Modeling decisions for artificial intelligence : 13th International Conference, MDAI 2016, Sant Julià de Lòria, Andorra, September 19-21, 2016. Proceedings

Event

Modeling Decisions for Artificial Intelligence. Conference (2016 : 13th : Sant Julià de Lòria, Andorra)

Publisher

Springer

Place of publication

Berlin, Germany

Series

Lecture Notes in Computer Science

Usage metrics

    Research Publications

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC