Deakin University
Browse

Con2Vec: Learning embedding representations for contrast sets

Version 2 2024-06-06, 02:46
Version 1 2021-08-21, 15:12
journal contribution
posted on 2024-06-06, 02:46 authored by Dang NguyenDang Nguyen, Wei LuoWei Luo, B Vo, LTT Nguyen, W Pedrycz
Contrast sets are used in many knowledge-based systems to capture data patterns relevant to a target variable. While they have many advantages such as being highly interpretable, they do not come with a similarity measure or feature vectors for downstream tasks such as regression or classification. To address these disadvantages, we propose Con2Vec (Contrast set to Vector), a method to embed contrast sets into a low-dimensional continuous vector space. Con2Vec defines two novel similarity and co-occurrence contexts for a contrast set, and then leverages a neural embedding model to learn low-dimensional continuous vectors (aka embeddings) for contrast sets. We further apply contrast set embeddings to construct the feature vectors for transactional data. We extensively evaluate our method Con2Vec on four real-world datasets, compared against state-of-the-art embedding and non-embedding methods where the results demonstrate the clear advantages of our method.

History

Journal

Knowledge-Based Systems

Volume

229

Article number

ARTN 107382

Pagination

1 - 10

Location

Amsterdam, The Netherlands

ISSN

0950-7051

eISSN

1872-7409

Language

English

Publication classification

C1 Refereed article in a scholarly journal

Publisher

ELSEVIER