scispace - formally typeset
Proceedings ArticleDOI

Symbiotic Segmentation and Part Localization for Fine-Grained Categorization

TLDR
The model builds a model of the base-level category that can be fitted to images, producing high-quality foreground segmentation and mid-level part localizations, and improves the categorization accuracy over the state-of-the-art.
Abstract
We propose a new method for the task of fine-grained visual categorization The method builds a model of the base-level category that can be fitted to images, producing high-quality foreground segmentation and mid-level part localizations The model can be learnt from the typical datasets available for fine-grained categorization, where the only annotation provided is a loose bounding box around the instance (eg bird) in each image Both segmentation and part localizations are then used to encode the image content into a highly-discriminative visual signature The model is symbiotic in that part discovery/localization is helped by segmentation and, conversely, the segmentation is helped by the detection (eg part layout) Our model builds on top of the part-based object category detector of Felzenszwalb et al, and also on the powerful Grab Cut segmentation algorithm of Rother et al, and adds a simple spatial saliency coupling between them In our evaluation, the model improves the categorization accuracy over the state-of-the-art It also improves over what can be achieved with an analogous system that runs segmentation and part-localization independently

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings ArticleDOI

Bilinear CNN Models for Fine-Grained Visual Recognition

TL;DR: Blinear models, a recognition architecture that consists of two feature extractors whose outputs are multiplied using outer product at each location of the image and pooled to obtain an image descriptor, are proposed.
Book ChapterDOI

Part-Based R-CNNs for Fine-Grained Category Detection

TL;DR: In this article, the authors propose a model for fine-grained categorization by leveraging deep convolutional features computed on bottom-up region proposals, which learns whole-object and part detectors, enforces learned geometric constraints between them, and predicts a finegrained category from a pose normalized representation.
Posted Content

Bilinear CNN Models for Fine-grained Visual Recognition

TL;DR: This paper proposed bilinear models, which consists of two feature extractors whose outputs are multiplied using outer product at each location of the image and pooled to obtain an image descriptor, which can model local pairwise feature interactions in a translationally invariant manner.
Proceedings ArticleDOI

The application of two-level attention models in deep convolutional neural network for fine-grained image classification

TL;DR: This paper proposes to apply visual attention to fine-grained classification task using deep neural network and achieves the best accuracy under the weakest supervision condition, and is competitive against other methods that rely on additional annotations.
Proceedings ArticleDOI

Fine-grained recognition without part annotations

TL;DR: This work proposes a method for fine-grained recognition that uses no part annotations, based on generating parts using co-segmentation and alignment, which is combined in a discriminative mixture.
References
More filters
Journal ArticleDOI

Object Detection with Discriminatively Trained Part-Based Models

TL;DR: An object detection system based on mixtures of multiscale deformable part models that is able to represent highly variable object classes and achieves state-of-the-art results in the PASCAL object detection challenges is described.
Journal ArticleDOI

"GrabCut": interactive foreground extraction using iterated graph cuts

TL;DR: A more powerful, iterative version of the optimisation of the graph-cut approach is developed and the power of the iterative algorithm is used to simplify substantially the user interaction needed for a given quality of result.
Journal ArticleDOI

An experimental comparison of min-cut/max- flow algorithms for energy minimization in vision

TL;DR: This paper compares the running times of several standard algorithms, as well as a new algorithm that is recently developed that works several times faster than any of the other methods, making near real-time performance possible.
Proceedings ArticleDOI

Locality-constrained Linear Coding for image classification

TL;DR: This paper presents a simple but effective coding scheme called Locality-constrained Linear Coding (LLC) in place of the VQ coding in traditional SPM, using the locality constraints to project each descriptor into its local-coordinate system, and the projected coordinates are integrated by max pooling to generate the final representation.
Book ChapterDOI

Improving the fisher kernel for large-scale image classification

TL;DR: In an evaluation involving hundreds of thousands of training images, it is shown that classifiers learned on Flickr groups perform surprisingly well and that they can complement classifier learned on more carefully annotated datasets.
Related Papers (5)