Paper Doll Parsing: Retrieving Similar Styles to Parse Clothing Items

doi:10.1109/ICCV.2013.437

Proceedings ArticleDOI

Paper Doll Parsing: Retrieving Similar Styles to Parse Clothing Items

Kota Yamaguchi, +2 more

- pp 3519-3526

Chats0

TLDR

This paper tackles the clothing parsing problem using a retrieval based approach that combines parsing from: pre-trained global clothing models, local clothing models learned on the fly from retrieved examples, and transferred parse masks (paper doll item transfer) from retrieved example.

Abstract:

Clothing recognition is an extremely challenging problem due to wide variation in clothing item appearance, layering, and style. In this paper, we tackle the clothing parsing problem using a retrieval based approach. For a query image, we find similar styles from a large database of tagged fashion images and use these examples to parse the query. Our approach combines parsing from: pre-trained global clothing models, local clothing models learned on the fly from retrieved examples, and transferred parse masks (paper doll item transfer) from retrieved examples. Experimental evaluation shows that our approach significantly outperforms state of the art in parsing accuracy.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Image-Based Recommendations on Styles and Substitutes

Julian McAuley, +3 more

TL;DR: The approach is not based on fine-grained modeling of user annotations but rather on capturing the largest dataset possible and developing a scalable method for uncovering human notions of the visual relationships within.

...read moreread less

Proceedings ArticleDOI

DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations

Ziwei Liu, +4 more

TL;DR: This work introduces DeepFashion1, a large-scale clothes dataset with comprehensive annotations, and proposes a new deep model, namely FashionNet, which learns clothing features by jointly predicting clothing attributes and landmarks.

...read moreread less

Proceedings ArticleDOI

Hypercolumns for object segmentation and fine-grained localization

Bharath Hariharan, +3 more

TL;DR: In this paper, the authors define the hypercolumn at a pixel as the vector of activations of all CNN units above that pixel, and use hypercolumns as pixel descriptors.

...read moreread less

Posted Content

Hypercolumns for Object Segmentation and Fine-grained Localization

Bharath Hariharan, +3 more

- 21 Nov 2014 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: Using hypercolumns as pixel descriptors, this work defines the hypercolumn at a pixel as the vector of activations of all CNN units above that pixel, and shows results on three fine-grained localization tasks: simultaneous detection and segmentation, and keypoint localization.

...read moreread less

Proceedings ArticleDOI

Look into Person: Self-Supervised Structure-Sensitive Learning and a New Benchmark for Human Parsing

Ke Gong, +4 more

TL;DR: A new benchmark Look into Person (LIP) is introduced that makes a significant advance in terms of scalability, diversity and difficulty, and a novel self-supervised structure-sensitive learning approach, which imposes human pose structures into parsing results without resorting to extra supervision.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal Article

LIBLINEAR: A Library for Large Linear Classification

Rong-En Fan, +4 more

- 01 Jun 2008 -

Journal of Machine Learning Research

TL;DR: LIBLINEAR is an open source library for large-scale linear classification that supports logistic regression and linear support vector machines and provides easy-to-use command-line tools and library calls for users and developers.

...read moreread less

Journal ArticleDOI

Fast approximate energy minimization via graph cuts

Yuri Boykov, +2 more

- 01 Nov 2001 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This work presents two algorithms based on graph cuts that efficiently find a local minimum with respect to two types of large moves, namely expansion moves and swap moves that allow important cases of discontinuity preserving energies.

...read moreread less

Journal ArticleDOI

Efficient Graph-Based Image Segmentation

Pedro F. Felzenszwalb, +1 more

- 01 Sep 2004 -

International Journal of Computer Vision

TL;DR: An efficient segmentation algorithm is developed based on a predicate for measuring the evidence for a boundary between two regions using a graph-based representation of the image and it is shown that although this algorithm makes greedy decisions it produces segmentations that satisfy global properties.

...read moreread less

Journal ArticleDOI

An experimental comparison of min-cut/max- flow algorithms for energy minimization in vision

Yuri Boykov, +1 more

- 01 Sep 2004 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This paper compares the running times of several standard algorithms, as well as a new algorithm that is recently developed that works several times faster than any of the other methods, making near real-time performance possible.

...read moreread less

Proceedings ArticleDOI

Vlfeat: an open and portable library of computer vision algorithms

Andrea Vedaldi, +1 more

TL;DR: VLFeat is an open and portable library of computer vision algorithms that includes rigorous implementations of common building blocks such as feature detectors, feature extractors, (hierarchical) k-means clustering, randomized kd-tree matching, and super-pixelization.

...read moreread less

Collapse

Paper Doll Parsing: Retrieving Similar Styles to Parse Clothing Items

Citations

Image-Based Recommendations on Styles and Substitutes

DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations

Hypercolumns for object segmentation and fine-grained localization

Hypercolumns for Object Segmentation and Fine-grained Localization

Look into Person: Self-Supervised Structure-Sensitive Learning and a New Benchmark for Human Parsing

References

LIBLINEAR: A Library for Large Linear Classification

Fast approximate energy minimization via graph cuts

Efficient Graph-Based Image Segmentation

An experimental comparison of min-cut/max- flow algorithms for energy minimization in vision

Vlfeat: an open and portable library of computer vision algorithms

Related Papers (5)

Parsing clothing in fashion photographs

DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations

Describing clothing by semantic attributes

Where to Buy It: Matching Street Clothing Photos in Online Shops

Street-to-shop: cross-scenario clothing retrieval via parts alignment and auxiliary set