SC-RPN: A Strong Correlation Learning Framework for Region Proposal

doi:10.1109/TIP.2021.3069547

Open AccessJournal ArticleDOI

SC-RPN: A Strong Correlation Learning Framework for Region Proposal

Wenbin Zou, +5 more

- 05 Apr 2021 -

IEEE Transactions on Image Processing

- Vol. 30, pp 4084-4098

Chats0

TLDR

Wang et al. as discussed by the authors proposed a two-stage strong correlation learning framework, abbreviated as SC-RPN, which aims to set up stronger relationship among different modules in the region proposal task.

Abstract:

Current state-of-the-art two-stage detectors heavily rely on region proposals to guide the accurate detection for objects. In previous region proposal approaches, the interaction between different functional modules is correlated weakly, which limits or decreases the performance of region proposal approaches. In this paper, we propose a novel two-stage strong correlation learning framework, abbreviated as SC-RPN, which aims to set up stronger relationship among different modules in the region proposal task. Firstly, we propose a Light-weight IoU-Mask branch to predict intersection-over-union (IoU) mask and refine region classification scores as well, it is used to prevent high-quality region proposals from being filtered. Furthermore, a sampling strategy named Size-Aware Dynamic Sampling (SADS) is proposed to ensure sampling consistency between different stages. In addition, point-based representation is exploited to generate region proposals with stronger fitting ability. Without bells and whistles, SC-RPN achieves AR1000 14.5% higher than that of Region Proposal Network (RPN), surpassing all the existing region proposal approaches. We also integrate SC-RPN into Fast R-CNN and Faster R-CNN to test its effectiveness on object detection task, the experimental results achieve a gain of 3.2% and 3.8% in terms of mAP compared to the original ones.

Figures

Fig. 3. The generation of supervision information and the training phase of Light-weight IoU-Mask branch. As a result, red points in ground-truth IoU map denote the ground-truth of training samples, while white points denote the irrelevant samples.

Fig. 2. The training phase of SC-RPN with Size-Aware Dynamic Sampling (SADS). In the first stage, positive samples are output and all of them are regarded as regression samples. Then in the second stage, positive and negative samples are output, among which all the positive samples are regarded as regression samples, while all the positive samples and the negative samples after random sampling are regarded as classification samples.

TABLE II PERFORMANCE OF OBJECT DETECTORS ON MS COCO 2017 TEST-DEV SPLIT.

TABLE I PERFORMANCE OF REGION PROPOSAL APPROACHES ON MS COCO 2017 VAL SPLIT.

TABLE IV ABLATION ANALYSIS OF DIFFERENT SAMPLING STRATEGY COMBINATIONS IN SADS. POSR AND SAT DENOTE POSITIVE REGION ASSIGNER AND SIZE-AWARE THRESHOLD ASSIGNER, RESPECTIVELY.

TABLE III COMPONENT-WISE PERFORMANCE OF SC-RPN.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Diagnosis of Alzheimer’s disease via an attention-based multi-scale convolutional neural network

- 01 Feb 2022 -

Knowledge Based Systems

TL;DR: Wang et al. as mentioned in this paper proposed a multi-scale convolutional neural network (MSCNet) to enhance the model's feature representation ability, and a channel attention mechanism was introduced to improve the interdependence between channels and adaptively recalibrate the channel direction's characteristic response.

...read moreread less

Journal ArticleDOI

An effective risk identification method for power fence operation based on neighborhood correlation network and vector calculation

Ma Fuqi, +6 more

- 01 Nov 2021 -

Energy Reports

TL;DR: In this paper, the authors proposed a risk identification method based on neighborhood correlation networks and vector calculation to determine the safety area of a power operation site by detecting the key targets of fence operation.

...read moreread less

Journal ArticleDOI

Deep learning based object detection for resource constrained devices: Systematic review, future trends and challenges ahead

Vidya Kamath, +1 more

- 01 Feb 2023 -

Neurocomputing

TL;DR: In this paper , the authors identify current trends in resource-constrained applications for deep learning-based object detectors in terms of the technique used to create the model, the type of input image involved, the device used, and the specific application addressed by the model.

...read moreread less

Journal ArticleDOI

TransWeaver: Weave Image Pairs for Class Agnostic Common Object Detection

- 01 Jan 2023 -

IEEE Transactions on Image Processing

TL;DR: TransWeaver as mentioned in this paper takes image pairs as input and flexibly captures the inherent correlation between candidate objects from two images, which can further describe the commonality of image pairs at the object level.

...read moreread less

Journal ArticleDOI

TransWeaver: Weave Image Pairs for Class Agnostic Common Object Detection

Xiaoqian Guo, +3 more

- 17 May 2023 -

IEEE Transactions on Image Processing

TL;DR: TransWeaver as discussed by the authors takes image pairs as input and flexibly captures the inherent correlation between candidate objects from two images, which can further describe the commonality of image pairs at the object level.

...read moreread less

References

PDF

Open Access

More filters

Book ChapterDOI

Microsoft COCO: Common Objects in Context

Tsung-Yi Lin, +7 more

TL;DR: A new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding by gathering images of complex everyday scenes containing common objects in their natural context.

...read moreread less

Proceedings ArticleDOI

Fully convolutional networks for semantic segmentation

Jonathan Long, +2 more

TL;DR: The key insight is to build “fully convolutional” networks that take input of arbitrary size and produce correspondingly-sized output with efficient inference and learning.

...read moreread less

Proceedings ArticleDOI

You Only Look Once: Unified, Real-Time Object Detection

Joseph Redmon, +3 more

TL;DR: Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background, and outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.

...read moreread less

Journal ArticleDOI

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Shaoqing Ren, +3 more

- 01 Jun 2017 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This work introduces a Region Proposal Network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals and further merge RPN and Fast R-CNN into a single network by sharing their convolutionAL features.

...read moreread less

Proceedings ArticleDOI

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

Ross Girshick, +3 more

TL;DR: RCNN as discussed by the authors combines CNNs with bottom-up region proposals to localize and segment objects, and when labeled training data is scarce, supervised pre-training for an auxiliary task, followed by domain-specific fine-tuning, yields a significant performance boost.

...read moreread less