A Deformable Mixture Parsing Model with Parselets

doi:10.1109/ICCV.2013.423

Proceedings ArticleDOI

A Deformable Mixture Parsing Model with Parselets

Jian Dong, +4 more

- pp 3408-3415

Chats0

TLDR

The Deformable Mixture Parsing Model (DMPM) thus directly solves the problem of human parsing by searching for the best graph configuration from a pool of Parse let hypotheses without intermediate tasks.

Abstract:

In this work, we address the problem of human parsing, namely partitioning the human body into semantic regions, by using the novel Parselet representation. Previous works often consider solving the problem of human pose estimation as the prerequisite of human parsing. We argue that these approaches cannot obtain optimal pixel level parsing due to the inconsistent targets between these tasks. In this paper, we propose to use Parselets as the building blocks of our parsing model. Parselets are a group of parsable segments which can generally be obtained by low-level over-segmentation algorithms and bear strong semantic meaning. We then build a Deformable Mixture Parsing Model (DMPM) for human parsing to simultaneously handle the deformation and multi-modalities of Parselets. The proposed model has two unique characteristics: (1) the possible numerous modalities of Parse let ensembles are exhibited as the ``And-Or" structure of sub-trees, (2) to further solve the practical problem of Parselet occlusion or absence, we directly model the visibility property at some leaf nodes. The DMPM thus directly solves the problem of human parsing by searching for the best graph configuration from a pool of Parse let hypotheses without intermediate tasks. Comprehensive evaluations demonstrate the encouraging performance of the proposed approach.

Citations

PDF

Open Access

More filters

The PASCAL Visual Object Classes Challenge

Jianguo Zhang

Proceedings ArticleDOI

Look into Person: Self-Supervised Structure-Sensitive Learning and a New Benchmark for Human Parsing

Ke Gong, +4 more

TL;DR: A new benchmark Look into Person (LIP) is introduced that makes a significant advance in terms of scalability, diversity and difficulty, and a novel self-supervised structure-sensitive learning approach, which imposes human pose structures into parsing results without resorting to extra supervision.

...read moreread less

Proceedings ArticleDOI

Where to Buy It: Matching Street Clothing Photos in Online Shops

M. Hadi Kiapour, +4 more

TL;DR: Three different methods for Exact Street to Shop retrieval are developed, including two deep learning baseline methods, and a method to learn a similarity measure between the street and shop domains.

...read moreread less

Journal ArticleDOI

Superpixels: An evaluation of the state-of-the-art

David Stutz, +2 more

- 01 Jan 2018 -

Computer Vision and Image Understanding

TL;DR: An overall ranking of superpixel algorithms is presented which redefines the state-of-the-art and enables researchers to easily select appropriate algorithms and the corresponding implementations which themselves are made publicly available as part of the authors' benchmark at http://www.davidstutz.de/projects/superpixel-benchmark/ .

...read moreread less

Journal ArticleDOI

Look into Person: Joint Body Parsing & Pose Estimation Network and a New Benchmark

Xiaodan Liang, +3 more

- 01 Apr 2019 -

IEEE Transactions on Pattern Analysis an...

TL;DR: A new benchmark named “Look into Person (LIP)” is introduced that provides a significant advancement in terms of scalability, diversity, and difficulty, which are crucial for future developments in human-centric analysis.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Distinctive Image Features from Scale-Invariant Keypoints

David G. Lowe

- 01 Nov 2004 -

International Journal of Computer Vision

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.

...read moreread less

Proceedings ArticleDOI

Histograms of oriented gradients for human detection

Navneet Dalal, +1 more

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.

...read moreread less

Distinctive Image Features from Scale-Invariant Keypoints

Matthijs Dorst

TL;DR: The Scale-Invariant Feature Transform (or SIFT) algorithm is a highly robust method to extract and consequently match distinctive invariant features from images that can then be used to reliably match objects in diering images.

...read moreread less

Journal ArticleDOI

Object Detection with Discriminatively Trained Part-Based Models

Pedro F. Felzenszwalb, +3 more

- 01 Sep 2010 -

IEEE Transactions on Pattern Analysis an...

TL;DR: An object detection system based on mixtures of multiscale deformable part models that is able to represent highly variable object classes and achieves state-of-the-art results in the PASCAL object detection challenges is described.

...read moreread less

Journal ArticleDOI

SLIC Superpixels Compared to State-of-the-Art Superpixel Methods

Radhakrishna Achanta, +5 more

- 01 Nov 2012 -

IEEE Transactions on Pattern Analysis an...

TL;DR: A new superpixel algorithm is introduced, simple linear iterative clustering (SLIC), which adapts a k-means clustering approach to efficiently generate superpixels and is faster and more memory efficient, improves segmentation performance, and is straightforward to extend to supervoxel generation.

...read moreread less

Collapse

IEEE Transactions on Pattern Analysis an...

Describing clothing by semantic attributes

Huizhong Chen, +2 more

A Deformable Mixture Parsing Model with Parselets

Citations

The PASCAL Visual Object Classes Challenge

Look into Person: Self-Supervised Structure-Sensitive Learning and a New Benchmark for Human Parsing

Where to Buy It: Matching Street Clothing Photos in Online Shops

Superpixels: An evaluation of the state-of-the-art

Look into Person: Joint Body Parsing & Pose Estimation Network and a New Benchmark

References

Distinctive Image Features from Scale-Invariant Keypoints

Histograms of oriented gradients for human detection

Distinctive Image Features from Scale-Invariant Keypoints

Object Detection with Discriminatively Trained Part-Based Models

SLIC Superpixels Compared to State-of-the-Art Superpixel Methods

Related Papers (5)

Parsing clothing in fashion photographs

Fully convolutional networks for semantic segmentation

Deep Residual Learning for Image Recognition

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs

Describing clothing by semantic attributes