Group R-CNN for Weakly Semi-supervised Object Detection with Points

doi:10.1109/cvpr52688.2022.00920

Open AccessProceedings ArticleDOI

Group R-CNN for Weakly Semi-supervised Object Detection with Points

Chats0

TLDR

Group R-CNN as discussed by the authors uses instance-level proposal grouping to generate a group of proposals for each point annotation and thus can obtain a high recall rate. But this method is not suitable for weakly semi-supervised object detection with points.

Abstract:

We study the problem of weakly semi-supervised object detection with points (WSSOD-P), where the training data is combined by a small set of fully annotated images with bounding boxes and a large set of weakly-labeled images with only a single point annotated for each instance. The core of this task is to train a point-to-box regressor on well-labeled images that can be used to predict credible bounding boxes for each point annotation. We challenge the prior belief that existing CNN-based detectors are not compatible with this task. Based on the classic R-CNN architecture, we propose an effective point-to-box regressor: Group R-CNN. Group R-CNN first uses instance-level proposal grouping to generate a group of proposals for each point annotation and thus can obtain a high recall rate. To better distinguish different instances and improve precision, we propose instance-level proposal assignment to replace the vanilla assignment strategy adopted in original R-CNN methods. As naive instance-level assignment brings converging difficulty, we propose instance aware representation learning which consists of instance aware feature enhancement and instance-aware parameter generation to overcome this issue. Comprehensive experiments on the MS-COCO benchmark demonstrate the effectiveness of our method. Specifically, Group R-CNN significantly outperforms the prior method Point DETR by 3.9 mAP with 5% well-labeled images, which is the most challenging scenario. The source code can be found at https://github.com/jshilong/GroupRCNN.

Group R-CNN for Weakly Semi-supervised Object Detection with Points

Citations

Research on YOLOv7-based defect detection method for automotive running lights

Multi-Object Multi-Camera Tracking Based on Deep Learning for Intelligent Transportation: A Review

Weakly Semi-supervised Detection in Lung Ultrasound Videos

References

Deep Residual Learning for Image Recognition

Feature Pyramid Networks for Object Detection

The Pascal Visual Object Classes (VOC) Challenge

Focal Loss for Dense Object Detection

YOLO9000: Better, Faster, Stronger

Related Papers (5)

Empty Region Detection in an Image Using Deep Convolutional Neural Network

Real Time Multiple Object Tracking using Deep Features and Localization Information

RepPoints: Point Set Representation for Object Detection

Simultaneous object detection and localization using convolutional neural networks

A Coarse to Fine Object Proposal Framework for Autonomous Driving Object Detection Using Binocular Image