Problems and challenges of information resources producers' clustering
Version 2 2024-06-13, 13:19Version 2 2024-06-13, 13:19
Version 1 2019-10-09, 08:29Version 1 2019-10-09, 08:29
journal contribution
posted on 2024-06-13, 13:19 authored by A Cena, M Gagolewski, R Mesiar© 2015 Elsevier Ltd. Classically, unsupervised machine learning techniques are applied on data sets with fixed number of attributes (variables). However, many problems encountered in the field of informetrics face us with the need to extend these kinds of methods in a way such that they may be computed over a set of nonincreasingly ordered vectors of unequal lengths. Thus, in this paper, some new dissimilarity measures (metrics) are introduced and studied. Owing to that we may use, e.g. hierarchical clustering algorithms in order to determine an input data set's partition consisting of sets of producers that are homogeneous not only with respect to the quality of information resources, but also their quantity.
History
Journal
Journal of InformetricsVolume
9Pagination
273-284Location
Amsterdam, The NetherlandsPublisher DOI
ISSN
1751-1577eISSN
1875-5879Language
engPublication classification
C1.1 Refereed article in a scholarly journalIssue
2Publisher
ElsevierUsage metrics
Categories
Licence
Exports
RefWorksRefWorks
BibTeXBibTeX
Ref. managerRef. manager
EndnoteEndnote
DataCiteDataCite
NLMNLM
DCDC