Active Object Localization with Deep Reinforcement Learning

doi:10.1109/ICCV.2015.286

Open AccessProceedings ArticleDOI

Active Object Localization with Deep Reinforcement Learning

- pp 2488-2496

TLDR

In this paper, an active detection model is proposed for localizing objects in scenes, which allows an agent to focus attention on candidate regions for identifying the correct location of a target object.

Abstract:

We present an active detection model for localizing objects in scenes. The model is class-specific and allows an agent to focus attention on candidate regions for identifying the correct location of a target object. This agent learns to deform a bounding box using simple transformation actions, with the goal of determining the most specific location of target objects following top-down reasoning. The proposed localization agent is trained using deep reinforcement learning, and evaluated on the Pascal VOC 2007 dataset. We show that agents guided by the proposed model are able to localize a single instance of an object after analyzing only between 11 and 25 regions in an image, and obtain the best detection results among systems that do not use object proposals for object localization.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Attention to Scale: Scale-Aware Semantic Image Segmentation

Liang-Chieh Chen, +4 more

TL;DR: Zhang et al. as discussed by the authors propose an attention mechanism that learns to softly weight the multi-scale features at each pixel location, which not only outperforms average and max-pooling, but also allows diagnostically visualize the importance of features at different positions and scales.

...read moreread less

Posted Content

Deep Reinforcement Learning: An Overview

Yuxi Li

- 25 Jan 2017 -

arXiv: Learning

TL;DR: This work discusses core RL elements, including value function, in particular, Deep Q-Network (DQN), policy, reward, model, planning, and exploration, and important mechanisms for RL, including attention and memory, unsupervised learning, transfer learning, multi-agent RL, hierarchical RL, and learning to learn.

...read moreread less

Posted Content

Attention to Scale: Scale-aware Semantic Image Segmentation

Liang-Chieh Chen, +4 more

- 10 Nov 2015 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: An attention mechanism that learns to softly weight the multi-scale features at each pixel location is proposed, which not only outperforms averageand max-pooling, but allows us to diagnostically visualize the importance of features at different positions and scales.

...read moreread less

Proceedings ArticleDOI

Action-Decision Networks for Visual Tracking with Deep Reinforcement Learning

Sangdoo Yun, +4 more

TL;DR: Through evaluation of the OTB dataset, the proposed tracker is validated to achieve a competitive performance that is three times faster than state-of-the-art, deep network–based trackers.

...read moreread less

Proceedings Article

Runtime Neural Pruning

Ji Lin, +3 more

TL;DR: A Runtime Neural Pruning (RNP) framework which prunes the deep neural network dynamically at the runtime and preserves the full ability of the original network and conducts pruning according to the input image and current feature maps adaptively.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.

...read moreread less

Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

...read moreread less

Book

Reinforcement Learning: An Introduction

Richard S. Sutton, +1 more

TL;DR: This book provides a clear and simple account of the key ideas and algorithms of reinforcement learning, which ranges from the history of the field's intellectual foundations to the most recent developments and applications.

...read moreread less

Journal Article

Dropout: a simple way to prevent neural networks from overfitting

Nitish Srivastava, +4 more

- 01 Jan 2014 -

Journal of Machine Learning Research

TL;DR: It is shown that dropout improves the performance of neural networks on supervised learning tasks in vision, speech recognition, document classification and computational biology, obtaining state-of-the-art results on many benchmark data sets.

...read moreread less

Journal ArticleDOI

ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky, +11 more

- 01 Dec 2015 -

International Journal of Computer Vision

TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) as mentioned in this paper is a benchmark in object category classification and detection on hundreds of object categories and millions of images, which has been run annually from 2010 to present, attracting participation from more than fifty institutions.

...read moreread less

Collapse

Active Object Localization with Deep Reinforcement Learning

Citations

Attention to Scale: Scale-Aware Semantic Image Segmentation

Deep Reinforcement Learning: An Overview

Attention to Scale: Scale-aware Semantic Image Segmentation

Action-Decision Networks for Visual Tracking with Deep Reinforcement Learning

Runtime Neural Pruning

References

Very Deep Convolutional Networks for Large-Scale Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

Reinforcement Learning: An Introduction

Dropout: a simple way to prevent neural networks from overfitting

ImageNet Large Scale Visual Recognition Challenge

Related Papers (5)

Human-level control through deep reinforcement learning

Deep Residual Learning for Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

Reinforcement Learning: An Introduction