Hierarchical clustering via penalty-based aggregation and the Genie approach
Version 2 2024-06-12, 15:08Version 2 2024-06-12, 15:08
Version 1 2016-01-01, 00:00Version 1 2016-01-01, 00:00
conference contribution
posted on 2024-06-12, 15:08 authored by M Gagolewski, A Cena, M Bartoszuk© Springer International Publishing Switzerland 2016. The paper discusses a generalization of the nearest centroid hierarchical clustering algorithm. A first extension deals with the incorporation of generic distance-based penalty minimizers instead of the classical aggregation by means of centroids. Due to that the presented algorithm can be applied in spaces equipped with an arbitrary dissimilarity measure (images, DNA sequences, etc.). Secondly, a correction preventing the formation of clusters of too highly unbalanced sizes is applied: just like in the recently introduced Genie approach, which extends the single linkage scheme, the new method averts a chosen inequity measure (e.g., the Gini-, de Vergottini-, or Bonferroni-index) of cluster sizes from raising above a predefined threshold. Numerous benchmarks indicate that the introduction of such a correction increases the quality of the resulting clusterings significantly.
History
Related Materials
- 1.
Location
Sant Julià de Lòria, AndorraLanguage
engPublication classification
E1.1 Full written paper - refereedVolume
9880Pagination
191-202Start date
2016-09-19End date
2016-09-21ISSN
0302-9743eISSN
1611-3349ISBN-13
9783319456553Title of proceedings
MDAI 2016 :Modeling decisions for artificial intelligence : 13th International Conference, MDAI 2016, Sant Julià de Lòria, Andorra, September 19-21, 2016. ProceedingsEvent
Modeling Decisions for Artificial Intelligence. Conference (2016 : 13th : Sant Julià de Lòria, Andorra)Publisher
SpringerPlace of publication
Berlin, GermanySeries
Lecture Notes in Computer ScienceUsage metrics
Categories
No categories selectedKeywords
Licence
Exports
RefWorksRefWorks
BibTeXBibTeX
Ref. managerRef. manager
EndnoteEndnote
DataCiteDataCite
NLMNLM
DCDC

