scispace - formally typeset
Book ChapterDOI

VisDrone-DET2018: The Vision Meets Drone Object Detection in Image Challenge Results

Pengfei Zhu, +104 more
- pp 437-468
Reads0
Chats0
TLDR
A large-scale drone-based dataset, including 8, 599 images with rich annotations, including object bounding boxes, object categories, occlusion, truncation ratios, etc, is released, to narrow the gap between current object detection performance and the real-world requirements.
Abstract
Object detection is a hot topic with various applications in computer vision, e.g., image understanding, autonomous driving, and video surveillance. Much of the progresses have been driven by the availability of object detection benchmark datasets, including PASCAL VOC, ImageNet, and MS COCO. However, object detection on the drone platform is still a challenging task, due to various factors such as view point change, occlusion, and scales. To narrow the gap between current object detection performance and the real-world requirements, we organized the Vision Meets Drone (VisDrone2018) Object Detection in Image challenge in conjunction with the 15th European Conference on Computer Vision (ECCV 2018). Specifically, we release a large-scale drone-based dataset, including 8, 599 images (6, 471 for training, 548 for validation, and 1, 580 for testing) with rich annotations, including object bounding boxes, object categories, occlusion, truncation ratios, etc. Featuring a diverse real-world scenarios, the dataset was collected using various drone models, in different scenarios (across 14 different cities spanned over thousands of kilometres), and under various weather and lighting conditions. We mainly focus on ten object categories in object detection, i.e., pedestrian, person, car, van, bus, truck, motor, bicycle, awning-tricycle, and tricycle. Some rarely occurring special vehicles (e.g., machineshop truck, forklift truck, and tanker) are ignored in evaluation. The dataset is extremely challenging due to various factors, including large scale and pose variations, occlusion, and clutter background. We present the evaluation protocol of the VisDrone-DET2018 challenge and the comparison results of 38 detectors on the released dataset, which are publicly available on the challenge website: http://www.aiskyeye.com/. We expect the challenge to largely boost the research and development in object detection in images on drone platforms.

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings ArticleDOI

Clustered Object Detection in Aerial Images

TL;DR: Zhang et al. as discussed by the authors proposed a cluster proposal sub-network (CPNet), a scale estimation sub-networks (ScaleNet), and a dedicated detection network (DetecNet) to detect small objects in aerial images.
Posted Content

Vision Meets Drones: Past, Present and Future

TL;DR: The VisDrone dataset, which is captured over various urban/suburban areas of 14 different cities across China from North to South, is described, being the largest such dataset ever published, and enables extensive evaluation and investigation of visual analysis algorithms on the drone platform.
Proceedings ArticleDOI

Density Map Guided Object Detection in Aerial Images

TL;DR: This paper proposes a Density-Map guided object detection Network (DMNet), which is inspired from the observation that the object density map of an image presents how objects distribute in terms of the pixel intensity of the map.
Proceedings ArticleDOI

SlimYOLOv3: Narrower, Faster and Better for Real-Time UAV Applications

TL;DR: SlimYOLOv3 as mentioned in this paper proposes to learn efficient deep object detectors through channel pruning of convolutional layers, which enforce channel-level sparsity of CNNs by imposing L1 regularization on channel scaling factors and prune less informative feature channels to obtain "slim" object detectors.
Journal ArticleDOI

Advancing Image Understanding in Poor Visibility Environments: A Collective Benchmark Study

TL;DR: The UG2+ challenge Track 2 competition in IEEE CVPR 2019 is launched, aiming to evoke a comprehensive discussion and exploration about whether and how low-level vision techniques can benefit the high-level automatic visual recognition in various scenarios.
References
More filters
Proceedings ArticleDOI

Deep Residual Learning for Image Recognition

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.
Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.
Proceedings ArticleDOI

Histograms of oriented gradients for human detection

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.
Journal ArticleDOI

ImageNet Large Scale Visual Recognition Challenge

TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) as mentioned in this paper is a benchmark in object category classification and detection on hundreds of object categories and millions of images, which has been run annually from 2010 to present, attracting participation from more than fifty institutions.
Book ChapterDOI

Microsoft COCO: Common Objects in Context

TL;DR: A new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding by gathering images of complex everyday scenes containing common objects in their natural context.
Related Papers (5)