scispace - formally typeset
Proceedings ArticleDOI

Dot Distance for Tiny Object Detection in Aerial Images

TLDR
DotD as discussed by the authors is defined as normalized Euclidean distance between the center points of two bounding boxes, which is a new metric for tiny object detection where anchor-based and anchor-free detectors are used.
Abstract
Object detection has achieved great progress with the development of anchor-based and anchor-free detectors. However, the detection of tiny objects is still challenging due to the lack of appearance information. In this paper, we observe that Intersection over Union (IoU), the most widely used metric in object detection, is sensitive to slight offsets between predicted bounding boxes and ground truths when detecting tiny objects. Although some new metrics such as GIoU, DIoU and CIoU are proposed, their performance on tiny object detection is still below the expected level by a large margin. In this paper, we propose a simple but effective new metric called Dot Distance (DotD) for tiny object detection where DotD is defined as normalized Euclidean distance between the center points of two bounding boxes. Extensive experiments on tiny object detection dataset show that anchor-based detectors’ performance is highly improved over their baselines with the application of DotD.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Towards Large-Scale Small Object Detection: Survey and Benchmarks

TL;DR: Two large-scale Small Object Detection dAtasets (SODA), SODA-D and S ODA-A, which focus on the Driving and Aerial scenarios respectively are constructed, and the performance of mainstream methods on SOD a is evaluated.
Proceedings ArticleDOI

RFLA: Gaussian Receptive Field based Label Assignment for Tiny Object Detection

TL;DR: A Gaussian Receptive Field based Label Assignment (RFLA) strategy for tiny object detection that outperforms the state-of-the-art competitors with 4.0 AP points on the AI-TOD dataset and designs a Hierarchical Label assignment module based on RFD to achieve balanced learning for tiny objects.
Journal ArticleDOI

Detecting tiny objects in aerial images: A normalized Wasserstein distance and a new benchmark

TL;DR: Zhang et al. as discussed by the authors proposed Normalized Wasserstein Distance (NWD) and RanKing-based Assignment (RKA) strategy for tiny object detection.
Journal ArticleDOI

Automatic Detection and Segmentation of Thrombi in Abdominal Aortic Aneurysms Using a Mask Region-Based Convolutional Neural Network with Optimized Loss Functions

TL;DR: This work proposes a fully automated method for the whole process of the detection and segmentation of thrombi, which is based on a well-established mask region-based convolutional neural network (Mask R-CNN) framework that is improved with optimized loss functions.
Journal ArticleDOI

RFLA: Gaussian Receptive Field Based Label Assignment for Tiny Object Detection

TL;DR: Zhang et al. as discussed by the authors proposed a Gaussian Receptive Field based Label Assignment (RFLA) strategy for tiny object detection, which utilizes the prior information that the feature receptive field follows Gaussian distribution.
References
More filters
Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
Journal ArticleDOI

ImageNet Large Scale Visual Recognition Challenge

TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) as mentioned in this paper is a benchmark in object category classification and detection on hundreds of object categories and millions of images, which has been run annually from 2010 to present, attracting participation from more than fifty institutions.
Book ChapterDOI

Microsoft COCO: Common Objects in Context

TL;DR: A new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding by gathering images of complex everyday scenes containing common objects in their natural context.
Proceedings ArticleDOI

You Only Look Once: Unified, Real-Time Object Detection

TL;DR: Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background, and outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.
Journal ArticleDOI

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

TL;DR: This work introduces a Region Proposal Network (RPN) that shares full-image convolutional features with the detection network, thus enabling nearly cost-free region proposals and further merge RPN and Fast R-CNN into a single network by sharing their convolutionAL features.
Related Papers (5)