scispace - formally typeset
Open AccessPosted Content

Associative Embedding:End-to-End Learning for Joint Detection and Grouping

Reads0
Chats0
TLDR
Associative embedding is introduced, a novel method for supervising convolutional neural networks for the task of detection and grouping for multi-person pose estimation and state-of-the-art performance on the MPII and MS-COCO datasets is reported.
Abstract
We introduce associative embedding, a novel method for supervising convolutional neural networks for the task of detection and grouping. A number of computer vision problems can be framed in this manner including multi-person pose estimation, instance segmentation, and multi-object tracking. Usually the grouping of detections is achieved with multi-stage pipelines, instead we propose an approach that teaches a network to simultaneously output detections and group assignments. This technique can be easily integrated into any state-of-the-art network architecture that produces pixel-wise predictions. We show how to apply this method to both multi-person pose estimation and instance segmentation and report state-of-the-art performance for multi-person pose on the MPII and MS-COCO datasets.

read more

Citations
More filters
Proceedings ArticleDOI

Cascaded Pyramid Network for Multi-person Pose Estimation

TL;DR: A novel network structure called Cascaded Pyramid Network (CPN) is presented which targets to relieve the problem from these "hard" keypoints, with state-of-art results on the COCO keypoint benchmark, with average precision at 73.0.
Proceedings ArticleDOI

SGN: Sequential Grouping Networks for Instance Segmentation

TL;DR: This paper proposes Sequential Grouping Networks, a sequence of neural networks, each solving a sub-grouping problem of increasing semantic complexity in order to gradually compose objects out of pixels to tackle the problem of object instance segmentation.
Posted Content

Recurrent Pixel Embedding for Instance Grouping

TL;DR: In this paper, a differentiable, end-to-end trainable framework for solving pixel-level grouping problems such as instance segmentation consisting of two novel components is introduced. But the choice of embedding dimension and margin, relating them to theoretical results on the problem of distributing points uniformly on the sphere, is discussed.
Journal ArticleDOI

Multi-animal pose estimation, identification and tracking with DeepLabCut

TL;DR: In this article , a pose estimation toolbox for multi-animal tracking is presented, which integrates the ability to predict an animal's identity to assist tracking in case of occlusions.
Posted Content

Multiple-Human Parsing in the Wild

TL;DR: This work introduces a new multi-human parsing dataset and a novel multi- human parsing model named MH-Parser, which generates global parsing maps and person instance masks simultaneously in a bottom-up fashion with the help of a new Graph-GAN model.
References
More filters
Journal ArticleDOI

Normalized cuts and image segmentation

TL;DR: This work treats image segmentation as a graph partitioning problem and proposes a novel global criterion, the normalized cut, for segmenting the graph, which measures both the total dissimilarity between the different groups as well as the total similarity within the groups.
Posted Content

Distributed Representations of Words and Phrases and their Compositionality

TL;DR: In this paper, the Skip-gram model is used to learn high-quality distributed vector representations that capture a large number of precise syntactic and semantic word relationships and improve both the quality of the vectors and the training speed.
Proceedings Article

Mask R-CNN

TL;DR: This work presents a conceptually simple, flexible, and general framework for object instance segmentation that outperforms all existing, single-model entries on every task, including the COCO 2016 challenge winners.
Journal ArticleDOI

The Pascal Visual Object Classes Challenge: A Retrospective

TL;DR: A review of the Pascal Visual Object Classes challenge from 2008-2012 and an appraisal of the aspects of the challenge that worked well, and those that could be improved in future challenges.
Journal ArticleDOI

Distance Metric Learning for Large Margin Nearest Neighbor Classification

TL;DR: This paper shows how to learn a Mahalanobis distance metric for kNN classification from labeled examples in a globally integrated manner and finds that metrics trained in this way lead to significant improvements in kNN Classification.
Related Papers (5)