posted on 2003-01-01, 00:00authored byGang LiGang Li, Honghua Dai, Yiqing Tu
One common drawback in algorithms for learning Linear Causal Models is that they can not deal with incomplete data set. This is unfortunate since many real problems involve missing data or even hidden variable. In this paper, based on multiple imputation, we propose a three-step process to learn linear causal models from incomplete data set. Experimental results indicate that this algorithm is better than the single imputation method (EM algorithm) and the simple list deletion method, and for lower missing rate, this algorithm can even find models better than the results from the greedy learning algorithm MLGS working in a complete data set. In addition, the method is amenable to parallel or distributed processing, which is an important characteristic for data mining in large data sets.
History
Pagination
1 - 11
Location
Chiang Mai Plaza, Thailand
Open access
Yes
Start date
2003-12-17
End date
2003-12-19
ISBN-13
9789746581516
ISBN-10
9746581511
Language
eng
Notes
Every reasonable effort has been made to ensure that permission has been obtained for items included in Deakin Research Online. If you believe that your rights have been infringed by this repository, please contact drosupport@deakin.edu.au