Learning to Segment Every Thing

doi:10.1109/CVPR.2018.00445

Open AccessProceedings ArticleDOI

Learning to Segment Every Thing

Ronghang Hu, +4 more

- pp 4233-4241

Chats0

TLDR

A new partially supervised training paradigm is proposed, together with a novel weight transfer function, that enables training instance segmentation models on a large set of categories all of which have box annotations, but only a small fraction ofWhich have mask annotations.

Abstract:

Most methods for object instance segmentation require all training examples to be labeled with segmentation masks. This requirement makes it expensive to annotate new categories and has restricted instance segmentation models to ~100 well-annotated classes. The goal of this paper is to propose a new partially supervised training paradigm, together with a novel weight transfer function, that enables training instance segmentation models on a large set of categories all of which have box annotations, but only a small fraction of which have mask annotations. These contributions allow us to train Mask R-CNN to detect and segment 3000 visual concepts using box annotations from the Visual Genome dataset and mask annotations from the 80 classes in the COCO dataset. We evaluate our approach in a controlled study on the COCO dataset. This work is a first step towards instance segmentation models that have broad comprehension of the visual world.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Deep Learning for Generic Object Detection: A Survey

Li Liu, +7 more

- 01 Feb 2020 -

International Journal of Computer Vision

TL;DR: A comprehensive survey of the recent achievements in this field brought about by deep learning techniques, covering many aspects of generic object detection: detection frameworks, object feature representation, object proposal generation, context modeling, training strategies, and evaluation metrics.

...read moreread less

Journal ArticleDOI

UNet++: Redesigning Skip Connections to Exploit Multiscale Features in Image Segmentation

Zongwei Zhou, +3 more

- 01 Jun 2020 -

IEEE Transactions on Medical Imaging

TL;DR: UNet++ as mentioned in this paper proposes an efficient ensemble of U-Nets of varying depths, which partially share an encoder and co-learn simultaneously using deep supervision, leading to a highly flexible feature fusion scheme.

...read moreread less

Book ChapterDOI

Unified Perceptual Parsing for Scene Understanding

Tete Xiao, +4 more

TL;DR: A multi-task framework called UPerNet and a training strategy are developed to learn from heterogeneous image annotations and it is shown that it is able to effectively segment a wide range of concepts from images.

...read moreread less

Posted Content

Image Segmentation Using Deep Learning: A Survey

Shervin Minaee, +5 more

- 15 Jan 2020 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: A comprehensive review of recent pioneering efforts in semantic and instance segmentation, including convolutional pixel-labeling networks, encoder-decoder architectures, multiscale and pyramid-based approaches, recurrent networks, visual attention models, and generative models in adversarial settings are provided.

...read moreread less

Posted Content

Activation Functions: Comparison of trends in Practice and Research for Deep Learning

Chigozie Nwankpa, +3 more

- 08 Nov 2018 -

arXiv: Learning

TL;DR: This paper will be the first, to compile the trends in AF applications in practice against the research results from literature, found in deep learning research to date.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Journal ArticleDOI

ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky, +11 more

- 01 Dec 2015 -

International Journal of Computer Vision

TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) as mentioned in this paper is a benchmark in object category classification and detection on hundreds of object categories and millions of images, which has been run annually from 2010 to present, attracting participation from more than fifty institutions.

...read moreread less

Proceedings ArticleDOI

Glove: Global Vectors for Word Representation

Jeffrey Pennington, +2 more

TL;DR: A new global logbilinear regression model that combines the advantages of the two major model families in the literature: global matrix factorization and local context window methods and produces a vector space with meaningful substructure.

...read moreread less

Book ChapterDOI

Microsoft COCO: Common Objects in Context

Tsung-Yi Lin, +7 more

TL;DR: A new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding by gathering images of complex everyday scenes containing common objects in their natural context.

...read moreread less

Proceedings ArticleDOI

Fully convolutional networks for semantic segmentation

Jonathan Long, +2 more

TL;DR: The key insight is to build “fully convolutional” networks that take input of arbitrary size and produce correspondingly-sized output with efficient inference and learning.

...read moreread less

Collapse

International Journal of Computer Vision

Fully convolutional networks for semantic segmentation

Jonathan Long, +2 more

Feature Pyramid Networks for Object Detection

Tsung-Yi Lin, +5 more

Learning to Segment Every Thing

Citations

Deep Learning for Generic Object Detection: A Survey

UNet++: Redesigning Skip Connections to Exploit Multiscale Features in Image Segmentation

Unified Perceptual Parsing for Scene Understanding

Image Segmentation Using Deep Learning: A Survey

Activation Functions: Comparison of trends in Practice and Research for Deep Learning

References

Deep Residual Learning for Image Recognition

ImageNet Large Scale Visual Recognition Challenge

Glove: Global Vectors for Word Representation

Microsoft COCO: Common Objects in Context

Fully convolutional networks for semantic segmentation

Related Papers (5)

Deep Residual Learning for Image Recognition

Microsoft COCO: Common Objects in Context

The Pascal Visual Object Classes (VOC) Challenge

Fully convolutional networks for semantic segmentation

Feature Pyramid Networks for Object Detection