Focal Loss for Dense Object Detection

doi:10.1109/TPAMI.2018.2858826

Open AccessJournal ArticleDOI

Focal Loss for Dense Object Detection

Tsung-Yi Lin, +4 more

- 01 Feb 2020 -

IEEE Transactions on Pattern Analysis an...

- Vol. 42, Iss: 2, pp 318-327

Chats0

TLDR

Focal loss as discussed by the authors focuses training on a sparse set of hard examples and prevents the vast number of easy negatives from overwhelming the detector during training, which improves the accuracy of one-stage detectors.

Abstract:

The highest accuracy object detectors to date are based on a two-stage approach popularized by R-CNN, where a classifier is applied to a sparse set of candidate object locations. In contrast, one-stage detectors that are applied over a regular, dense sampling of possible object locations have the potential to be faster and simpler, but have trailed the accuracy of two-stage detectors thus far. In this paper, we investigate why this is the case. We discover that the extreme foreground-background class imbalance encountered during training of dense detectors is the central cause. We propose to address this class imbalance by reshaping the standard cross entropy loss such that it down-weights the loss assigned to well-classified examples. Our novel Focal Loss focuses training on a sparse set of hard examples and prevents the vast number of easy negatives from overwhelming the detector during training. To evaluate the effectiveness of our loss, we design and train a simple dense detector we call RetinaNet. Our results show that when trained with the focal loss, RetinaNet is able to match the speed of previous one-stage detectors while surpassing the accuracy of all existing state-of-the-art two-stage detectors. Code is at: https://github.com/facebookresearch/Detectron .

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Prime Sample Attention in Object Detection

Yuhang Cao, +3 more

TL;DR: The notion of Prime Samples, those that play a key role in driving the detection performance are proposed, and a simple yet effective sampling and learning strategy called PrIme Sample Attention (PISA) is developed that directs the focus of the training process towards such samples.

...read moreread less

Posted Content

SqueezeSegV2: Improved Model Structure and Unsupervised Domain Adaptation for Road-Object Segmentation from a LiDAR Point Cloud

Bichen Wu, +4 more

- 22 Sep 2018 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This work introduces a new model SqueezeSegV2, which is more robust against dropout noises in LiDAR point cloud and therefore achieves significant accuracy improvement, and a domain-adaptation training pipeline consisting of three major components: learned intensity rendering, geodesic correlation alignment, and progressive domain calibration.

...read moreread less

Journal ArticleDOI

DeepSTORM3D: dense 3D localization microscopy and PSF design by deep learning

Elias Nehme, +9 more

- 15 Jun 2020 -

Nature Methods

TL;DR: DeepSTORM3D uses deep learning for accurate localization of point emitters in densely labeled samples in three dimensions for volumetric localization microscopy with high temporal resolution, as well as for optimal point-spread function design.

...read moreread less

Proceedings ArticleDOI

D2Det: Towards High Quality Object Detection and Instance Segmentation

Jiale Cao, +5 more

TL;DR: A novel two-stage detection method, D2Det, that collectively addresses both precise localization and accurate classification is proposed and a discriminative RoI pooling scheme that samples from various sub-regions of a proposal and performs adaptive weighting to obtain discriminating features is introduced.

...read moreread less

Proceedings ArticleDOI

AugFPN: Improving Multi-Scale Feature Learning for Object Detection

Chaoxu Guo, +4 more

TL;DR: Guo et al. as discussed by the authors proposed a new feature pyramid architecture named AugFPN, which consists of three components: Consistent Supervision, Residual Feature Augmentation, and Soft RoI Selection.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Proceedings ArticleDOI

Histograms of oriented gradients for human detection

Navneet Dalal, +1 more

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.

...read moreread less

Book ChapterDOI

Microsoft COCO: Common Objects in Context

Tsung-Yi Lin, +7 more

TL;DR: A new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding by gathering images of complex everyday scenes containing common objects in their natural context.

...read moreread less

Proceedings ArticleDOI

Fully convolutional networks for semantic segmentation

Jonathan Long, +2 more

TL;DR: The key insight is to build “fully convolutional” networks that take input of arbitrary size and produce correspondingly-sized output with efficient inference and learning.

...read moreread less

Collapse

Focal Loss for Dense Object Detection

Citations

Prime Sample Attention in Object Detection

SqueezeSegV2: Improved Model Structure and Unsupervised Domain Adaptation for Road-Object Segmentation from a LiDAR Point Cloud

DeepSTORM3D: dense 3D localization microscopy and PSF design by deep learning

D2Det: Towards High Quality Object Detection and Instance Segmentation

AugFPN: Improving Multi-Scale Feature Learning for Object Detection

References

Deep Residual Learning for Image Recognition

ImageNet Classification with Deep Convolutional Neural Networks

Histograms of oriented gradients for human detection

Microsoft COCO: Common Objects in Context

Fully convolutional networks for semantic segmentation

Related Papers (5)

Deep Residual Learning for Image Recognition

You Only Look Once: Unified, Real-Time Object Detection

Microsoft COCO: Common Objects in Context

U-Net: Convolutional Networks for Biomedical Image Segmentation

ImageNet Classification with Deep Convolutional Neural Networks