scispace - formally typeset
Open AccessProceedings ArticleDOI

Group R-CNN for Weakly Semi-supervised Object Detection with Points

Reads0
Chats0
TLDR
Group R-CNN as discussed by the authors uses instance-level proposal grouping to generate a group of proposals for each point annotation and thus can obtain a high recall rate. But this method is not suitable for weakly semi-supervised object detection with points.
Abstract
We study the problem of weakly semi-supervised object detection with points (WSSOD-P), where the training data is combined by a small set of fully annotated images with bounding boxes and a large set of weakly-labeled images with only a single point annotated for each instance. The core of this task is to train a point-to-box regressor on well-labeled images that can be used to predict credible bounding boxes for each point annotation. We challenge the prior belief that existing CNN-based detectors are not compatible with this task. Based on the classic R-CNN architecture, we propose an effective point-to-box regressor: Group R-CNN. Group R-CNN first uses instance-level proposal grouping to generate a group of proposals for each point annotation and thus can obtain a high recall rate. To better distinguish different instances and improve precision, we propose instance-level proposal assignment to replace the vanilla assignment strategy adopted in original R-CNN methods. As naive instance-level assignment brings converging difficulty, we propose instance aware representation learning which consists of instance aware feature enhancement and instance-aware parameter generation to overcome this issue. Comprehensive experiments on the MS-COCO benchmark demonstrate the effectiveness of our method. Specifically, Group R-CNN significantly outperforms the prior method Point DETR by 3.9 mAP with 5% well-labeled images, which is the most challenging scenario. The source code can be found at https://github.com/jshilong/GroupRCNN.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Research on YOLOv7-based defect detection method for automotive running lights

TL;DR: Wang et al. as discussed by the authors proposed a YOLOv7-based method for detecting defects in automotive running lights, which achieved an accuracy of 89.7% on the self-built dataset, while the image inference speed was maintained at 61 frames per second.
Journal ArticleDOI

Multi-Object Multi-Camera Tracking Based on Deep Learning for Intelligent Transportation: A Review

Bing Han
- 01 Apr 2023 - 
TL;DR: In this article , a comprehensive review of multi-object multi-camera tracking based on deep learning for intelligent transportation is provided, where the main object detectors for MOMCT are introduced in detail.
Book ChapterDOI

Weakly Semi-supervised Detection in Lung Ultrasound Videos

TL;DR: In this paper , the authors proposed a method for improving object detection in medical videos through weak supervision from video-level labels, and applied this approach to the clinically important task of detecting lung consolidations (seen in respiratory infections such as COVID-19 pneumonia) in medical ultrasound videos.
References
More filters
Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
Proceedings ArticleDOI

Feature Pyramid Networks for Object Detection

TL;DR: This paper exploits the inherent multi-scale, pyramidal hierarchy of deep convolutional networks to construct feature pyramids with marginal extra cost and achieves state-of-the-art single-model results on the COCO detection benchmark without bells and whistles.
Journal ArticleDOI

The Pascal Visual Object Classes (VOC) Challenge

TL;DR: The state-of-the-art in evaluated methods for both classification and detection are reviewed, whether the methods are statistically different, what they are learning from the images, and what the methods find easy or confuse.
Proceedings ArticleDOI

Focal Loss for Dense Object Detection

TL;DR: This paper proposes to address the extreme foreground-background class imbalance encountered during training of dense detectors by reshaping the standard cross entropy loss such that it down-weights the loss assigned to well-classified examples, and develops a novel Focal Loss, which focuses training on a sparse set of hard examples and prevents the vast number of easy negatives from overwhelming the detector during training.
Proceedings ArticleDOI

YOLO9000: Better, Faster, Stronger

TL;DR: YOLO9000 as discussed by the authors is a state-of-the-art real-time object detection system that can detect over 9000 object categories in real time using a novel multi-scale training method, offering an easy tradeoff between speed and accuracy.
Related Papers (5)