Learning meaningful and effective representations for transaction data is a crucial prerequisite for transaction classification and clustering tasks. Traditional methods which use frequent itemsets (FIs) as features often suffer from the data sparsity and high-dimensionality problems. Several supervised methods based on discriminative FIs have been proposed to address these disadvantages, but they require transaction labels, thus rendering them inapplicable to real-world applications where labels are not given. In this paper, we propose an unsupervised method which learns low-dimensional continuous vectors for transactions based on information of both singleton items and FIs. We demonstrate the superior performance of our proposed method in classifying transactions on four datasets compared with several state-of-the-art baselines.
History
Volume
10939
Pagination
361-372
Location
Melbourne, Victoria
Start date
2018-06-03
End date
2018-06-06
ISSN
0302-9743
eISSN
1611-3349
ISBN-13
9783319930398
Language
eng
Publication classification
E Conference publication, E1 Full written paper - refereed
Editor/Contributor(s)
Phung D, Tseng V, Webb G, Ho B, Ganji M, Rashidi L
Title of proceedings
PAKDD 2018 : Advances in Knowledge Discovery and Data Mining : Proceedings of 22nd Pacific-Asia Conference
Event
Knowledge Discovery and Data Mining. Pacific-Asia Conference (22nd : 2018 : Melbourne, Victoria)