Learning sparse latent representation and distance metric for image retrieval

Nguyen, Tu Dinh; Truyen, T; Phung, Quoc-Dinh; Venkatesh, Svetha

File(s) under permanent embargo

Learning sparse latent representation and distance metric for image retrieval

conference contribution

posted on 2013-01-01, 00:00 authored by Tu Dinh Nguyen, T Truyen, Quoc-Dinh Phung, Svetha VenkateshSvetha Venkatesh

The performance of image retrieval depends critically on the semantic representation and the distance function used to estimate the similarity of two images. A good representation should integrate multiple visual and textual (e.g., tag) features and offer a step closer to the true semantics of interest (e.g., concepts). As the distance function operates on the representation, they are interdependent, and thus should be addressed at the same time. We propose a probabilistic solution to learn both the representation from multiple feature types and modalities and the distance metric from data. The learning is regularised so that the learned representation and information-theoretic metric will (i) preserve the regularities of the visual/textual spaces, (ii) enhance structured sparsity, (iii) encourage small intra-concept distances, and (iv) keep inter-concept images separated. We demonstrate the capacity of our method on the NUS-WIDE data. For the well-studied 13 animal subset, our method outperforms state-of-the-art rivals. On the subset of single-concept images, we gain 79:5% improvement over the standard nearest neighbours approach on the MAP score, and 45.7% on the NDCG.

History

Event

Multimedia and Expo. IEEE International Conference (14th : 2013 : San Jose, California)

Pagination

1 - 6

Publisher

IEEE

Location

San Jose, California

Place of publication

Piscataway, N.J.

Publisher DOI

https://doi.org/10.1109/ICME.2013.6607435

Start date

2013-07-15

End date

2013-07-19

ISBN-13

9781479900152

Language

eng

Publication classification

E1 Full written paper - refereed

Copyright notice

2013, IEEE

Title of proceedings

ICME 2013 : Proceedings of the 14th IEEE International Conference on Multimedia and Expo

Usage metrics

Keywords

image retrieval mixed-variate NUS-WIDE restricted Boltzmann machines metric learning sparsity Science & Technology Technology Computer Science, Software Engineering Computer Science, Theory & Methods Engineering, Electrical & Electronic Computer Science Engineering

Licence

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

File(s) under permanent embargo

Learning sparse latent representation and distance metric for image retrieval

History

Event

Pagination

Publisher

Location

Place of publication

Publisher DOI

Start date

End date

ISBN-13

Language

Publication classification

Copyright notice

Title of proceedings

Usage metrics

Categories

Keywords

Licence

Exports