Modelling multilevel data in multimedia : A hierarchical factor analysis approach

Gupta,S, Phung,D and Venkatesh,S 2014, Modelling multilevel data in multimedia : A hierarchical factor analysis approach, Multimedia Tools and Applications, pp. 1-23, doi: 10.1007/s11042-014-2394-3.

Attached Files
Name Description MIMEType Size Downloads

Title Modelling multilevel data in multimedia : A hierarchical factor analysis approach
Author(s) Gupta,SORCID iD for Gupta,S
Phung,DORCID iD for Phung,D
Venkatesh,SORCID iD for Venkatesh,S
Journal name Multimedia Tools and Applications
Start page 1
End page 23
Total pages 23
Publisher Springer
Place of publication New York, New York
Publication date 2014-12-12
ISSN 1380-7501
Keyword(s) Bayesian nonparametrics
Beta process
Dirichlet process
Multilevel data
Semantic gap
Summary Multimedia content understanding research requires rigorous approach to deal with the complexity of the data. At the crux of this problem is the method to deal with multilevel data whose structure exists at multiple scales and across data sources. A common example is modeling tags jointly with images to improve retrieval, classification and tag recommendation. Associated contextual observation, such as metadata, is rich that can be exploited for content analysis. A major challenge is the need for a principal approach to systematically incorporate associated media with the primary data source of interest. Taking a factor modeling approach, we propose a framework that can discover low-dimensional structures for a primary data source together with other associated information. We cast this task as a subspace learning problem under the framework of Bayesian nonparametrics and thus the subspace dimensionality and the number of clusters are automatically learnt from data instead of setting these parameters a priori. Using Beta processes as the building block, we construct random measures in a hierarchical structure to generate multiple data sources and capture their shared statistical at the same time. The model parameters are inferred efficiently using a novel combination of Gibbs and slice sampling. We demonstrate the applicability of the proposed model in three applications: image retrieval, automatic tag recommendation and image classification. Experiments using two real-world datasets show that our approach outperforms various state-of-the-art related methods.
Language eng
DOI 10.1007/s11042-014-2394-3
Field of Research 080109 Pattern Recognition and Data Mining
Socio Economic Objective 970108 Expanding Knowledge in the Information and Computing Sciences
HERDC Research category C1 Refereed article in a scholarly journal
ERA Research output type C Journal article
Copyright notice ©2014, Springer
Persistent URL

Connect to link resolver
Unless expressly stated otherwise, the copyright for items in DRO is owned by the author, with all rights reserved.

Version Filter Type
Citation counts: TR Web of Science Citation Count  Cited 0 times in TR Web of Science
Scopus Citation Count Cited 0 times in Scopus
Google Scholar Search Google Scholar
Access Statistics: 710 Abstract Views, 1 File Downloads  -  Detailed Statistics
Created: Wed, 01 Apr 2015, 15:26:15 EST

Every reasonable effort has been made to ensure that permission has been obtained for items included in DRO. If you believe that your rights have been infringed by this repository, please contact