Spatial Memory for Context Reasoning in Object Detection

doi:10.1109/ICCV.2017.440

Open AccessProceedings ArticleDOI

Spatial Memory for Context Reasoning in Object Detection

Xinlei Chen, +1 more

- pp 4106-4116

Chats0

TLDR

Spatial Memory Network (SMN) as mentioned in this paper assembles object instances back into a pseudo-image representation that is easy to be fed into another ConvNet for object-object context reasoning.

Abstract:

Modeling instance-level context and object-object relationships is extremely challenging. It requires reasoning about bounding boxes of different classes, locations etc. Above all, instance-level spatial reasoning inherently requires modeling conditional distributions on previous detections. Unfortunately, our current object detection systems do not have any memory to remember what to condition on! The state-of-the-art object detectors still detect all object in parallel followed by non-maximal suppression (NMS). While memory has been used for tasks such as captioning, they mostly use image-level memory cells without capturing the spatial layout. On the other hand, modeling object-object relationships requires spatial reasoning – not only do we need a memory to store the spatial layout, but also a effective reasoning module to extract spatial patterns. This paper presents a conceptually simple yet powerful solution – Spatial Memory Network (SMN), to model the instance-level context efficiently and effectively. Our spatial memory essentially assembles object instances back into a pseudo “image” representation that is easy to be fed into another ConvNet for object-object context reasoning. This leads to a new sequential reasoning architecture where image and memory are processed in parallel to obtain detections which update the memory again. We show our SMN direction is promising as it provides 2.2% improvement over baseline Faster RCNN on the COCO dataset with VGG161.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Deep Learning for Generic Object Detection: A Survey

Li Liu, +7 more

- 01 Feb 2020 -

International Journal of Computer Vision

TL;DR: A comprehensive survey of the recent achievements in this field brought about by deep learning techniques, covering many aspects of generic object detection: detection frameworks, object feature representation, object proposal generation, context modeling, training strategies, and evaluation metrics.

...read moreread less

Proceedings ArticleDOI

Relation Networks for Object Detection

Han Hu, +4 more

TL;DR: In this article, the authors propose an object relation module to model relations between objects, which is shown effective on improving object recognition and duplicate removal steps in the modern object detection pipeline.

...read moreread less

Journal ArticleDOI

Recent Advances in Deep Learning for Object Detection

Xiongwei Wu, +3 more

- 05 Jul 2020 -

Neurocomputing

TL;DR: A comprehensive survey of recent advances in visual object detection with deep learning can be found in this article, where the authors systematically analyze the existing object detection frameworks and organize the survey into three major parts: detection components, learning strategies, and applications and benchmarks.

...read moreread less

Posted Content

Relation Networks for Object Detection

Han Hu, +4 more

- 30 Nov 2017 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: An object relation module is proposed that processes a set of objects simultaneously through interaction between their appearance feature and geometry, thus allowing modeling of their relations, which gives rise to the first fully end-to-end object detector.

...read moreread less

Journal ArticleDOI

Recent advances in small object detection based on deep learning: A review

Kang Tong, +2 more

- 01 May 2020 -

Image and Vision Computing

TL;DR: This work comprehensively review the existing small object detection methods based on deep learning from five aspects, including multi-scale feature learning, data augmentation, training strategy, context-based detection and GAN- based detection.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Variation-Structured Reinforcement Learning for Visual Relationship and Attribute Detection

Xiaodan Liang, +2 more

TL;DR: Zhang et al. as discussed by the authors proposed a deep Variation-structured Re-inforcement Learning (VRL) framework to sequentially discover object relationships and attributes in the whole image.

...read moreread less

Proceedings ArticleDOI

Reinforcement Learning for Visual Object Detection

Stefan Mathe, +2 more

TL;DR: This paper presents principled sequential models that accumulate evidence collected at a small set of image locations in order to detect visual objects effectively, formulating sequential search as reinforcement learning of the search policy (including the stopping condition).

...read moreread less

Book ChapterDOI

Contextual Priming and Feedback for Faster R-CNN

Abhinav Shrivastava, +1 more

TL;DR: This paper proposes to augment Faster R-CNN with a semantic segmentation network, and uses segmentation to provide top-down iterative feedback using two stage training, and results indicate that all three contributions improve the performance on object detection, semantic segmentsation and region proposal generation.

...read moreread less

Proceedings ArticleDOI

AttentionNet: Aggregating Weak Directions for Accurate Object Detection

Donggeun Yoo, +4 more

TL;DR: AttentionNet is presented, a novel detection method using a deep convolutional neural network, named AttentionNet, which detects objects without any separated models from the object proposal to the post bounding-box regression.

...read moreread less

Proceedings ArticleDOI

Beyond Categories: The Visual Memex Model for Reasoning About Object Relationships

Tomasz Malisiewicz, +1 more

TL;DR: An exemplar-based model of objects and their relationships is presented, the Visual Memex, that encodes both local appearance and 2D spatial context between object instances and may be the critical missing ingredient in scene understanding systems.

...read moreread less

Collapse

International Journal of Computer Vision

Spatial Memory for Context Reasoning in Object Detection

Citations

Deep Learning for Generic Object Detection: A Survey

Relation Networks for Object Detection

Recent Advances in Deep Learning for Object Detection

Relation Networks for Object Detection

Recent advances in small object detection based on deep learning: A review

References

Deep Variation-Structured Reinforcement Learning for Visual Relationship and Attribute Detection

Reinforcement Learning for Visual Object Detection

Contextual Priming and Feedback for Faster R-CNN

AttentionNet: Aggregating Weak Directions for Accurate Object Detection

Beyond Categories: The Visual Memex Model for Reasoning About Object Relationships

Related Papers (5)

Deep Residual Learning for Image Recognition

Microsoft COCO: Common Objects in Context

SSD: Single Shot MultiBox Detector

Feature Pyramid Networks for Object Detection

The Pascal Visual Object Classes (VOC) Challenge