Associative Embedding:End-to-End Learning for Joint Detection and Grouping

Open AccessPosted Content

Associative Embedding:End-to-End Learning for Joint Detection and Grouping

Alejandro Newell, +2 more

- 16 Nov 2016 -

arXiv: Computer Vision and Pattern Recog...

Chats0

TLDR

Associative embedding is introduced, a novel method for supervising convolutional neural networks for the task of detection and grouping for multi-person pose estimation and state-of-the-art performance on the MPII and MS-COCO datasets is reported.

Abstract:

We introduce associative embedding, a novel method for supervising convolutional neural networks for the task of detection and grouping. A number of computer vision problems can be framed in this manner including multi-person pose estimation, instance segmentation, and multi-object tracking. Usually the grouping of detections is achieved with multi-stage pipelines, instead we propose an approach that teaches a network to simultaneously output detections and group assignments. This technique can be easily integrated into any state-of-the-art network architecture that produces pixel-wise predictions. We show how to apply this method to both multi-person pose estimation and instance segmentation and report state-of-the-art performance for multi-person pose on the MPII and MS-COCO datasets.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Cascaded Pyramid Network for Multi-person Pose Estimation

Yilun Chen, +5 more

TL;DR: A novel network structure called Cascaded Pyramid Network (CPN) is presented which targets to relieve the problem from these "hard" keypoints, with state-of-art results on the COCO keypoint benchmark, with average precision at 73.0.

...read moreread less

Proceedings ArticleDOI

SGN: Sequential Grouping Networks for Instance Segmentation

Shu Liu, +3 more

TL;DR: This paper proposes Sequential Grouping Networks, a sequence of neural networks, each solving a sub-grouping problem of increasing semantic complexity in order to gradually compose objects out of pixels to tackle the problem of object instance segmentation.

...read moreread less

Posted Content

Recurrent Pixel Embedding for Instance Grouping

Shu Kong, +1 more

- 22 Dec 2017 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: In this paper, a differentiable, end-to-end trainable framework for solving pixel-level grouping problems such as instance segmentation consisting of two novel components is introduced. But the choice of embedding dimension and margin, relating them to theoretical results on the problem of distributing points uniformly on the sphere, is discussed.

...read moreread less

Posted Content

Multiple-Human Parsing in the Wild

Jianshu Li, +7 more

- 19 May 2017 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This work introduces a new multi-human parsing dataset and a novel multi- human parsing model named MH-Parser, which generates global parsing maps and person instance masks simultaneously in a bottom-up fashion with the help of a new Graph-GAN model.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Normalized cuts and image segmentation

Jianbo Shi, +1 more

- 01 Aug 2000 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This work treats image segmentation as a graph partitioning problem and proposes a novel global criterion, the normalized cut, for segmenting the graph, which measures both the total dissimilarity between the different groups as well as the total similarity within the groups.

...read moreread less

Posted Content

Distributed Representations of Words and Phrases and their Compositionality

Tomas Mikolov, +4 more

- 16 Oct 2013 -

arXiv: Computation and Language

TL;DR: In this paper, the Skip-gram model is used to learn high-quality distributed vector representations that capture a large number of precise syntactic and semantic word relationships and improve both the quality of the vectors and the training speed.

...read moreread less

Proceedings Article

Mask R-CNN

Kaiming He, +3 more

TL;DR: This work presents a conceptually simple, flexible, and general framework for object instance segmentation that outperforms all existing, single-model entries on every task, including the COCO 2016 challenge winners.

...read moreread less

Journal ArticleDOI

The Pascal Visual Object Classes Challenge: A Retrospective

Mark Everingham, +5 more

- 01 Jan 2015 -

International Journal of Computer Vision

TL;DR: A review of the Pascal Visual Object Classes challenge from 2008-2012 and an appraisal of the aspects of the challenge that worked well, and those that could be improved in future challenges.

...read moreread less

Journal ArticleDOI

Distance Metric Learning for Large Margin Nearest Neighbor Classification

Kilian Q. Weinberger, +1 more

- 01 Dec 2009 -

Journal of Machine Learning Research

TL;DR: This paper shows how to learn a Mahalanobis distance metric for kNN classification from labeled examples in a globally integrated manner and finds that metrics trained in this way lead to significant improvements in kNN Classification.

...read moreread less

Collapse

Related Papers (5)

Microsoft COCO: Common Objects in Context

Tsung-Yi Lin, +7 more

Stacked Hourglass Networks for Human Pose Estimation

Alejandro Newell, +2 more

- 22 Mar 2016 -

arXiv: Computer Vision and Pattern Recog...

Associative Embedding:End-to-End Learning for Joint Detection and Grouping

Citations

Cascaded Pyramid Network for Multi-person Pose Estimation

SGN: Sequential Grouping Networks for Instance Segmentation

Recurrent Pixel Embedding for Instance Grouping

Multi-animal pose estimation, identification and tracking with DeepLabCut

Multiple-Human Parsing in the Wild

References

Normalized cuts and image segmentation

Distributed Representations of Words and Phrases and their Compositionality

Mask R-CNN

The Pascal Visual Object Classes Challenge: A Retrospective

Distance Metric Learning for Large Margin Nearest Neighbor Classification

Related Papers (5)

Microsoft COCO: Common Objects in Context

Stacked Hourglass Networks for Human Pose Estimation

Deep Residual Learning for Image Recognition

Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields

Deep High-Resolution Representation Learning for Human Pose Estimation