Deakin University
Browse

File(s) under permanent embargo

Problems and challenges of information resources producers' clustering

journal contribution
posted on 2015-04-01, 00:00 authored by A Cena, Marek GagolewskiMarek Gagolewski, R Mesiar
© 2015 Elsevier Ltd. Classically, unsupervised machine learning techniques are applied on data sets with fixed number of attributes (variables). However, many problems encountered in the field of informetrics face us with the need to extend these kinds of methods in a way such that they may be computed over a set of nonincreasingly ordered vectors of unequal lengths. Thus, in this paper, some new dissimilarity measures (metrics) are introduced and studied. Owing to that we may use, e.g. hierarchical clustering algorithms in order to determine an input data set's partition consisting of sets of producers that are homogeneous not only with respect to the quality of information resources, but also their quantity.

History

Journal

Journal of Informetrics

Volume

9

Issue

2

Pagination

273 - 284

Publisher

Elsevier

Location

Amsterdam, The Netherlands

ISSN

1751-1577

eISSN

1875-5879

Language

eng

Publication classification

C1.1 Refereed article in a scholarly journal