scispace - formally typeset
Open AccessProceedings ArticleDOI

MV-YOLO: Motion Vector-Aided Tracking by Semantic Object Detection

Reads0
Chats0
TLDR
A hybrid tracker that leverages motion information from the compressed video stream and a general-purpose semantic object detector acting on decoded frames to construct a fast and efficient tracking engine is presented.
Abstract
Object tracking is the cornerstone of many visual analytics systems. While considerable progress has been made in this area in recent years, robust, efficient, and accurate tracking in real-world video remains a challenge. In this paper, we present a hybrid tracker that leverages motion information from the compressed video stream and a general-purpose semantic object detector acting on decoded frames to construct a fast and efficient tracking engine. The proposed approach is compared with several well-known recent trackers on the OTB tracking dataset. The results indicate advantages of the proposed method in terms of speed and/or accuracy. Other desirable features of the proposed method are its simplicity and deployment efficiency, which stems from the fact that it reuses the resources and information that may already exist in the system for other reasons.

read more

Citations
More filters
Journal ArticleDOI

A Fast Learning Method for Accurate and Robust Lane Detection Using Two-Stage Feature Extraction with YOLO v3.

TL;DR: An adaptive lane feature learning algorithm which can automatically learn the features of a lane in various scenarios is proposed, and the accuracy and speed of the second-stage model reached a high level.
Book ChapterDOI

Enabling Pedestrian Safety using Computer Vision Techniques: A Case Study of the 2018 Uber Inc. Self-driving Car Crash

TL;DR: This work analyzes the Uber car crash and sheds light on the question, “Could the Uber Car Crash have been avoided?” and applies state-of-the-art Computer Vision models to this highly practical scenario.
Journal ArticleDOI

A lightweight Tiny-YOLOv3 vehicle detection approach

TL;DR: A lightweight real-time vehicle detection model developed to run on common computing devices using the pre-trained Tiny-YOLOv3 network and subsequently pruned and simplified by training on the BIT-vehicle dataset, and excluding some of the unnecessary layers.
Journal ArticleDOI

Real-Time Online Multi-Object Tracking in Compressed Domain

TL;DR: This work divides the frames into key and non-key frames and track objects in the compressed domain and proposes an appearance convolutional neural network which can be jointly trained with the detector to make data association more reliable.
Book ChapterDOI

Enabling Pedestrian Safety Using Computer Vision Techniques: A Case Study of the 2018 Uber Inc. Self-driving Car Crash

TL;DR: In this article, the authors analyzed the Uber car crash and shed light on the question, "Could the Uber Car Crash have been avoided?" They applied state-of-the-art Computer Vision models to this highly practical scenario.
References
More filters
Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.
Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.
Proceedings ArticleDOI

You Only Look Once: Unified, Real-Time Object Detection

TL;DR: Compared to state-of-the-art detection systems, YOLO makes more localization errors but is less likely to predict false positives on background, and outperforms other detection methods, including DPM and R-CNN, when generalizing from natural images to other domains like artwork.
Posted Content

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

TL;DR: Faster R-CNN as discussed by the authors proposes a Region Proposal Network (RPN) to generate high-quality region proposals, which are used by Fast R-NN for detection.
Proceedings ArticleDOI

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

TL;DR: RCNN as discussed by the authors combines CNNs with bottom-up region proposals to localize and segment objects, and when labeled training data is scarce, supervised pre-training for an auxiliary task, followed by domain-specific fine-tuning, yields a significant performance boost.
Related Papers (5)