Joint analysis of multiple data sources is becoming increasingly popular in transfer learning, multi-task learning and cross-domain data mining. One promising approach to model the data jointly is through learning the shared and individual factor subspaces. However, performance of this approach depends on the subspace dimensionalities and the level of sharing needs to be specified a priori. To this end, we propose a nonparametric joint factor analysis framework for modeling multiple related data sources. Our model utilizes the hierarchical beta process as a nonparametric prior to automatically infer the number of shared and individual factors. For posterior inference, we provide a Gibbs sampling scheme using auxiliary variables. The effectiveness of the proposed framework is validated through its application on two real world problems - transfer learning in text and image retrieval.
History
Event
International Conference on Data Mining (12th : 2012 : Anaheim, Calif.)
Pagination
200 - 212
Publisher
Society for Industrial and Applied Mathemations (SIAM)
Location
Anaheim, Calif.
Place of publication
Anaheim, Calif.
Start date
2012-04-26
End date
2012-04-28
ISBN-13
9781611972320
Language
eng
Publication classification
E1 Full written paper - refereed
Copyright notice
2012, by the Society for Industrial and Applied Mathematics
Title of proceedings
SDM 2012 : Proceedings of the 12th SIAM International Conference on Data Mining