Deakin University
Browse

Information-decomposition-model-based missing value estimation for not missing at random dataset

Version 2 2024-06-06, 05:43
Version 1 2018-04-23, 16:27
journal contribution
posted on 2024-06-06, 05:43 authored by Shigang Liu, H Dai, Min Gan
Missing data estimation is an important strategy for improving learning performance in learning from incomplete data, especially, when there are non discardable records with missing values. However, most of the existing algorithms are focused on missing at random (MAR) or missing completely at random (MCAR), and less attention has been paid to data not missing at random (NMAR). In this paper, an information decomposition imputation (IDIM) algorithm using fuzzy membership function is proposed for addressing the missing value problem under NMAR. Firstly, the proposed IDIM algorithm is presented with detailed examples. Then, the proposed approach is evaluated with extensive experiments compared with some typical algorithms. The experimental results demonstrate that the proposed algorithm has higher accuracy than the exiting imputation approaches in terms of normal root mean square error (NRMSE) and TP?TN evaluation under different missing strategies.

History

Journal

International journal of machine learning and cybernetics

Volume

9

Pagination

85-95

Location

Berlin, Germany

ISSN

1868-8071

eISSN

1868-808X

Language

eng

Publication classification

C Journal article, C1 Refereed article in a scholarly journal

Copyright notice

2015, Springer-Verlag Berlin Heidelberg

Issue

1

Publisher

Springer