Vision Meets Drones: Past, Present and Future

Open AccessPosted Content

Vision Meets Drones: Past, Present and Future

Pengfei Zhu, +5 more

- 16 Jan 2020 -

arXiv: Computer Vision and Pattern Recog...

Chats0

TLDR

The VisDrone dataset, which is captured over various urban/suburban areas of 14 different cities across China from North to South, is described, being the largest such dataset ever published, and enables extensive evaluation and investigation of visual analysis algorithms on the drone platform.

Abstract:

Drones, or general UAVs, equipped with cameras have been fast deployed with a wide range of applications, including agriculture, aerial photography, and surveillance. Consequently, automatic understanding of visual data collected from drones becomes highly demanding, bringing computer vision and drones more and more closely. To promote and track the evelopments of object detection and tracking algorithms, we have organized two challenge workshops in conjunction with ECCV 2018, and ICCV 2019, attracting more than 100 teams around the world. We provide a large-scale drone captured dataset, VisDrone, which includes four tracks, i.e., (1) image object detection, (2) video object detection, (3) single object tracking, and (4) multi-object tracking. In this paper, we first presents a thorough review of object detection and tracking datasets and benchmarks, and discuss the challenges of collecting large-scale drone-based object detection and tracking datasets with fully manual annotations. After that, we describe our VisDrone dataset, which is captured over various urban/suburban areas of 14 different cities across China from North to South. Being the largest such dataset ever published, VisDrone enables extensive evaluation and investigation of visual analysis algorithms on the drone platform. We provide a detailed analysis of the current state of the field of large-scale object detection and tracking on drones, and conclude the challenge as well as propose future directions. We expect the benchmark largely boost the research and development in video analysis on drone platforms. All the datasets and experimental results can be downloaded from the website: this https URL.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

HOTA: A Higher Order Metric for Evaluating Multi-Object Tracking

Jonathon Luiten, +6 more

- 16 Sep 2020 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: This work presents a novel MOT evaluation metric, higher order tracking accuracy (HOTA), which explicitly balances the effect of performing accurate detection, association and localization into a single unified metric for comparing trackers.

...read moreread less

Journal ArticleDOI

HOTA: A Higher Order Metric for Evaluating Multi-object Tracking.

Jonathon Luiten, +7 more

- 01 Feb 2021 -

International Journal of Computer Vision

TL;DR: Higher order tracking accuracy (HOTA) as mentioned in this paper is proposed to explicitly balance the effect of performing accurate detection, association and localization into a single unified metric for comparing trackers, which is able to capture important aspects of MOT performance not previously taken into account by established metrics.

...read moreread less

Journal ArticleDOI

A Survey on Cellular-connected UAVs: Design Challenges, Enabling 5G/B5G Innovations, and Experimental Advancements

Debashisha Mishra, +1 more

- 09 Dec 2020 -

Computer Networks

TL;DR: In this article, the authors present an in-depth exploration of integration synergies between 5G/B5G cellular systems and UAV technology, where the UAV is integrated as a new aerial user equipment (UE) to already deployed cellular networks.

...read moreread less

Posted ContentDOI

Deep Learning for UAV-based Object Detection and Tracking: A Survey

Xin Wu, +4 more

- 25 Oct 2021 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: A comprehensive survey on the research progress and prospects of DL-based UAV object detection and tracking methods can be found in this article, where the authors outline the challenges, statistics of existing methods, and provide solutions from the perspectives of deep learning-based models in three research topics: object detection from the image and video, and object tracking from the video.

...read moreread less

Journal ArticleDOI

Automatic Person Detection in Search and Rescue Operations Using Deep CNN Detectors

Sasa Sambolek, +1 more

- 04 Mar 2021 -

IEEE Access

TL;DR: In this article, the reliability of existing state-of-the-art detectors such as Faster R-CNN, YOLOv4, RetinaNet, and Cascade RCNN on a VisDrone benchmark and custom-made dataset SARD build to simulate rescue scenes was investigated.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

Proceedings ArticleDOI

Histograms of oriented gradients for human detection

Navneet Dalal, +1 more

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.

...read moreread less

Journal ArticleDOI

ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky, +11 more

- 01 Dec 2015 -

International Journal of Computer Vision

TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) as mentioned in this paper is a benchmark in object category classification and detection on hundreds of object categories and millions of images, which has been run annually from 2010 to present, attracting participation from more than fifty institutions.

...read moreread less

Book ChapterDOI

Microsoft COCO: Common Objects in Context

Tsung-Yi Lin, +7 more

TL;DR: A new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding by gathering images of complex everyday scenes containing common objects in their natural context.

...read moreread less

Proceedings ArticleDOI

You Only Look Once: Unified, Real-Time Object Detection

Joseph Redmon, +3 more

TL;DR: Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background, and outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.

...read moreread less

Collapse

arXiv: Computer Vision and Pattern Recog...

SSD: Single Shot MultiBox Detector

Wei Liu, +6 more

Feature Pyramid Networks for Object Detection

Tsung-Yi Lin, +5 more

Vision Meets Drones: Past, Present and Future

Citations

HOTA: A Higher Order Metric for Evaluating Multi-Object Tracking

HOTA: A Higher Order Metric for Evaluating Multi-object Tracking.

A Survey on Cellular-connected UAVs: Design Challenges, Enabling 5G/B5G Innovations, and Experimental Advancements

Deep Learning for UAV-based Object Detection and Tracking: A Survey

Automatic Person Detection in Search and Rescue Operations Using Deep CNN Detectors

References

ImageNet: A large-scale hierarchical image database

Histograms of oriented gradients for human detection

ImageNet Large Scale Visual Recognition Challenge

Microsoft COCO: Common Objects in Context

You Only Look Once: Unified, Real-Time Object Detection

Related Papers (5)

Microsoft COCO: Common Objects in Context

Deep Residual Learning for Image Recognition

YOLOv3: An Incremental Improvement.

SSD: Single Shot MultiBox Detector

Feature Pyramid Networks for Object Detection