scispace - formally typeset
Open Access

Pyramidal implementation of the Lucas Kanade feature tracker description of the algorithm

Reads0
Chats0
TLDR
It is essential to define the notion of similarity in a 2D neighborhood sense and the image velocity d is defined as being the vector that minimizes the residual function defined as follows.
Abstract
1 Problem Statement Let I and J be two 2D grayscaled images. The two quantities I(x) = I(x, y) and J(x) = J(x, y) are then the grayscale value of the two images are the location x = [x y] , where x and y are the two pixel coordinates of a generic image point x. The image I will sometimes be referenced as the first image, and the image J as the second image. For practical issues, the images I and J are discret function (or arrays), and the upper left corner pixel coordinate vector is [0 0] . Let nx and ny be the width and height of the two images. Then the lower right pixel coordinate vector is [nx − 1 ny − 1] . Consider an image point u = [ux uy] on the first image I. The goal of feature tracking is to find the location v = u + d = [ux+dx uy +dy] on the second image J such as I(u) and J(v) are “similar”. The vector d = [dx dy] is the image velocity at x, also known as the optical flow at x. Because of the aperture problem, it is essential to define the notion of similarity in a 2D neighborhood sense. Let ωx and ωy two integers. We define the image velocity d as being the vector that minimizes the residual function defined as follows:

read more

Content maybe subject to copyright    Report

Citations
More filters
Proceedings ArticleDOI

Are we ready for autonomous driving? The KITTI vision benchmark suite

TL;DR: The autonomous driving platform is used to develop novel challenging benchmarks for the tasks of stereo, optical flow, visual odometry/SLAM and 3D object detection, revealing that methods ranking high on established datasets such as Middlebury perform below average when being moved outside the laboratory to the real world.
Journal ArticleDOI

Tracking-Learning-Detection

TL;DR: A novel tracking framework (TLD) that explicitly decomposes the long-term tracking task into tracking, learning, and detection, and develops a novel learning method (P-N learning) which estimates the errors by a pair of “experts”: P-expert estimates missed detections, and N-ex Expert estimates false alarms.
Proceedings ArticleDOI

Forward-Backward Error: Automatic Detection of Tracking Failures

TL;DR: It is demonstrated that the proposed error enables reliable detection of tracking failures and selection of reliable trajectories in video sequences and is complementary to commonly used normalized cross-correlation (NCC).
Proceedings ArticleDOI

Learning to Track: Online Multi-object Tracking by Decision Making

TL;DR: This work forms the online MOT problem as decision making in Markov Decision Processes (MDPs), where the lifetime of an object is modeled with a MDP, and a similarity function for data association is equivalent to learning a policy for the MDP.
Journal ArticleDOI

Full-frame video stabilization with motion inpainting

TL;DR: This work proposes a practical and robust approach of video stabilization that produces full-frame stabilized videos with good visual quality and develops a complete video stabilizer which can naturally keep the original image quality in the stabilized videos.
References
More filters
Proceedings ArticleDOI

Good features to track

TL;DR: A feature selection criterion that is optimal by construction because it is based on how the tracker works, and a feature monitoring method that can detect occlusions, disocclusions, and features that do not correspond to points in the world are proposed.