Hierarchical Part Matching for Fine-Grained Visual Categorization

doi:10.1109/ICCV.2013.206

Proceedings ArticleDOI

Hierarchical Part Matching for Fine-Grained Visual Categorization

- pp 1641-1648

TLDR

A powerful flowchart named Hierarchical Part Matching (HPM) is proposed to cope with fine-grained classification tasks and achieves the state-of-the-art classification accuracy in the Caltech-UCSD-Birds-200-2011 dataset by making full use of the ground-truth part annotations.

Abstract:

As a special topic in computer vision, fine-grained visual categorization (FGVC) has been attracting growing attention these years. Different with traditional image classification tasks in which objects have large inter-class variation, the visual concepts in the fine-grained datasets, such as hundreds of bird species, often have very similar semantics. Due to the large inter-class similarity, it is very difficult to classify the objects without locating really discriminative features, therefore it becomes more important for the algorithm to make full use of the part information in order to train a robust model. In this paper, we propose a powerful flowchart named Hierarchical Part Matching (HPM) to cope with fine-grained classification tasks. We extend the Bag-of-Features (BoF) model by introducing several novel modules to integrate into image representation, including foreground inference and segmentation, Hierarchical Structure Learning (HSL), and Geometric Phrase Pooling (GPP). We verify in experiments that our algorithm achieves the state-of-the-art classification accuracy in the Caltech-UCSD-Birds-200-2011 dataset by making full use of the ground-truth part annotations.

Citations

PDF

Open Access

More filters

Book ChapterDOI

Part-Based R-CNNs for Fine-Grained Category Detection

Ning Zhang, +3 more

TL;DR: In this article, the authors propose a model for fine-grained categorization by leveraging deep convolutional features computed on bottom-up region proposals, which learns whole-object and part detectors, enforces learned geometric constraints between them, and predicts a finegrained category from a pose normalized representation.

...read moreread less

Book ChapterDOI

Learning to Navigate for Fine-grained Classification

Ze Yang, +5 more

TL;DR: In this paper, a self-supervision mechanism is proposed to locate informative regions without the need of bounding-box/part annotations, which consists of a navigator agent, a teacher agent and a scrutinizer agent.

...read moreread less

Proceedings ArticleDOI

Deep LAC: Deep localization, alignment and classification for fine-grained recognition

Di Lin, +3 more

TL;DR: A valve linkage function (VLF) for back-propagation chaining is proposed to form the deep localization, alignment and classification (LAC) system and can adaptively compromise the errors of classification and alignment when training the LAC model.

...read moreread less

Proceedings ArticleDOI

Picking Deep Filter Responses for Fine-Grained Image Recognition

Xiaopeng Zhang, +4 more

TL;DR: In this article, the authors propose a unified framework based on two steps of deep filter response picking, one picking filter responses to find distinctive filters which respond to specific patterns significantly and consistently, and learn a set of part detectors via iteratively alternating between positive sample mining and part model retraining.

...read moreread less

Journal ArticleDOI

Object-Part Attention Model for Fine-Grained Image Classification.

Yuxin Peng, +2 more

- 01 Mar 2018 -

IEEE Transactions on Image Processing

TL;DR: Zhang et al. as discussed by the authors proposed the object-part attention model (OPAM) for weakly supervised fine-grained image classification, which integrates two level attentions: object-level attention localizes objects of images, and partlevel attention selects discriminative parts of object.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

David G. Lowe

- 01 Nov 2004 -

International Journal of Computer Vision

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.

...read moreread less

Distinctive Image Features from Scale-Invariant Keypoints

Matthijs Dorst

TL;DR: The Scale-Invariant Feature Transform (or SIFT) algorithm is a highly robust method to extract and consequently match distinctive invariant features from images that can then be used to reliably match objects in diering images.

...read moreread less

Journal ArticleDOI

Object Detection with Discriminatively Trained Part-Based Models

Pedro F. Felzenszwalb, +3 more

- 01 Sep 2010 -

IEEE Transactions on Pattern Analysis an...

TL;DR: An object detection system based on mixtures of multiscale deformable part models that is able to represent highly variable object classes and achieves state-of-the-art results in the PASCAL object detection challenges is described.

...read moreread less

Proceedings ArticleDOI

Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

Svetlana Lazebnik, +2 more

TL;DR: This paper presents a method for recognizing scene categories based on approximate global geometric correspondence that exceeds the state of the art on the Caltech-101 database and achieves high accuracy on a large database of fifteen natural scene categories.

...read moreread less

Journal Article

LIBLINEAR: A Library for Large Linear Classification

Rong-En Fan, +4 more

- 01 Jun 2008 -

Journal of Machine Learning Research

TL;DR: LIBLINEAR is an open source library for large-scale linear classification that supports logistic regression and linear support vector machines and provides easy-to-use command-line tools and library calls for users and developers.

...read moreread less

Collapse

arXiv: Computer Vision and Pattern Recog...

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

Hierarchical Part Matching for Fine-Grained Visual Categorization

Citations

Part-Based R-CNNs for Fine-Grained Category Detection

Learning to Navigate for Fine-grained Classification

Deep LAC: Deep localization, alignment and classification for fine-grained recognition

Picking Deep Filter Responses for Fine-Grained Image Recognition

Object-Part Attention Model for Fine-Grained Image Classification.

References

Distinctive Image Features from Scale-Invariant Keypoints

Distinctive Image Features from Scale-Invariant Keypoints

Object Detection with Discriminatively Trained Part-Based Models

Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

LIBLINEAR: A Library for Large Linear Classification

Related Papers (5)

The Caltech-UCSD Birds-200-2011 Dataset

Bilinear CNN Models for Fine-Grained Visual Recognition

3D Object Representations for Fine-Grained Categorization

Fine-Grained Visual Classification of Aircraft

Very Deep Convolutional Networks for Large-Scale Image Recognition