Unbiased multivariate correlation analysis
Version 2 2024-06-18, 21:35Version 2 2024-06-18, 21:35
Version 1 2020-07-07, 08:43Version 1 2020-07-07, 08:43
conference contribution
posted on 2024-06-18, 21:35 authored by Y Wang, S Romano, V Nguyen, J Bailey, X Ma, ST XiaCopyright © 2017, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved. Correlation measures are a key element of statistics and machine learning, and essential for a wide range of data analysis tasks. Most existing correlation measures are for pairwise relationships, but real-world data can also exhibit complex multivariate correlations, involving three or more variables. We argue that multivariate correlation measures should be comparable, interpretable, scalable and unbiased. However, no existing measures satisfy all these requirements. In this paper, we propose an unbiased multivariate correlation measure, called UMC, which satisfies all the above criteria. UMC is a cumulative entropy based non-parametric multi-variate correlation measure, which can capture both linear and non-linear correlations for groups of three or more variables. It employs a correction for chance using a statistical model of independence to address the issue of bias. UMC has high interpretability and we empirically show it outperforms state-of-the-art multivariate correlation measures in terms of statistical power, as well as for use in both subspace clustering and outlier detection tasks.
History
Pagination
2754-2760Location
San Francisco, CaliforniaStart date
2017-02-04End date
2017-02-10Language
engPublication classification
E1.1 Full written paper - refereedTitle of proceedings
AAAI 2017 : Proceedings of the 31st AAAI Conference on Artificial IntelligenceEvent
AAAI Conference on Artificial Intelligence. Conference (2017 : 31st : San Francisco, California)Publisher
AAAIPlace of publication
[San Francisco, Calif.]Usage metrics
Licence
Exports
RefWorksRefWorks
BibTeXBibTeX
Ref. managerRef. manager
EndnoteEndnote
DataCiteDataCite
NLMNLM
DCDC