Supervised Learning of Semantic Classes for Image Annotation and Retrieval

doi:10.1109/TPAMI.2007.61

Open AccessJournal ArticleDOI

Supervised Learning of Semantic Classes for Image Annotation and Retrieval

Gustavo Carneiro, +3 more

- 01 Mar 2007 -

IEEE Transactions on Pattern Analysis an...

- Vol. 29, Iss: 3, pp 394-410

Chats0

TLDR

The supervised formulation is shown to achieve higher accuracy than various previously published methods at a fraction of their computational cost and to be fairly robust to parameter tuning.

Abstract:

A probabilistic formulation for semantic image annotation and retrieval is proposed. Annotation and retrieval are posed as classification problems where each class is defined as the group of database images labeled with a common semantic label. It is shown that, by establishing this one-to-one correspondence between semantic labels and semantic classes, a minimum probability of error annotation and retrieval are feasible with algorithms that are 1) conceptually simple, 2) computationally efficient, and 3) do not require prior semantic segmentation of training images. In particular, images are represented as bags of localized feature vectors, a mixture density estimated for each image, and the mixtures associated with all images annotated with a common semantic label pooled into a density estimate for the corresponding semantic class. This pooling is justified by a multiple instance learning argument and performed efficiently with a hierarchical extension of expectation-maximization. The benefits of the supervised formulation over the more complex, and currently popular, joint modeling of semantic label and visual feature distributions are illustrated through theoretical arguments and extensive experiments. The supervised formulation is shown to achieve higher accuracy than various previously published methods at a fraction of their computational cost. Finally, the proposed method is shown to be fairly robust to parameter tuning

Supervised Learning of Semantic Classes for Image Annotation and Retrieval

Citations

Image-Based Recommendations on Styles and Substitutes

A new approach to cross-modal multimedia retrieval

Evaluating bag-of-visual-words representations in scene classification

TagProp: Discriminative metric learning in nearest neighbor models for image auto-annotation

Real-Time Computerized Annotation of Pictures

References

Maximum likelihood from incomplete data via the EM algorithm

Pattern Classification

Content-based image retrieval at the end of the early years

Texture features for browsing and retrieval of image data

Solving the multiple instance problem with axis-parallel rectangles

Related Papers (5)

Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary

Automatic image annotation and retrieval using cross-media relevance models

Matching words and pictures

Content-based image retrieval at the end of the early years

Distinctive Image Features from Scale-Invariant Keypoints