scispace - formally typeset
Topic

Block-matching algorithm

About: Block-matching algorithm is a(n) research topic. Over the lifetime, 9590 publication(s) have been published within this topic receiving 165336 citation(s).
Papers
More filters

Proceedings Article
01 Jan 2004-
TL;DR: A system that estimates the motion of a stereo head or a single moving camera based on video input in real-time with low delay and the motion estimates are used for navigational purposes.
Abstract: We present a system that estimates the motion of a stereo head or a single moving camera based on video input. The system operates in real-time with low delay and the motion estimates are used for navigational purposes. The front end of the system is a feature tracker. Point features are matched between pairs of frames and linked into image trajectories at video rate. Robust estimates of the camera motion are then produced from the feature tracks using a geometric hypothesize-and-test architecture. This generates what we call visual odometry, i.e. motion estimates from visual input alone. No prior knowledge of the scene nor the motion is necessary. The visual odometry can also be used in conjunction with information from other sources such as GPS, inertia sensors, wheel encoders, etc. The pose estimation method has been applied successfully to video from aerial, automotive and handheld platforms. We focus on results with an autonomous ground vehicle. We give examples of camera trajectories estimated purely from images over previously unseen distances and periods of time.

1,657 citations


Book
01 Aug 1995-
TL;DR: Digital Video Processing, Second Edition, reflects important advances in image processing, computer vision, and video compression, including new applications such as digital cinema, ultra-high-resolution video, and 3D video.
Abstract: Over the years, thousands of engineering students and professionals relied on Digital Video Processing as the definitive, in-depth guide to digital image and video processing technology. Now, Dr. A. Murat Tekalp has completely revamped the first edition to reflect todays technologies, techniques, algorithms, and trends. Digital Video Processing, Second Edition, reflects important advances in image processing, computer vision, and video compression, including new applications such as digital cinema, ultra-high-resolution video, and 3D video. This edition offers rigorous, comprehensive, balanced, and quantitative coverage of image filtering, motion estimation, tracking, segmentation, video filtering, and compression. Now organized and presented as a true tutorial, it contains updated problem sets and new MATLAB projects in every chapter. Coverage includes Multi-dimensional signals/systems: transforms, sampling, and lattice conversion Digital images and video: human vision, analog/digital video, and video quality Image filtering: gradient estimation, edge detection, scaling, multi-resolution representations, enhancement, de-noising, and restoration Motion estimation: image formation; motion models; differential, matching, optimization, and transform-domain methods; and 3D motion and shape estimation Video segmentation: color and motion segmentation, change detection, shot boundary detection, video matting, video tracking, and performance evaluation Multi-frame filtering: motion-compensated filtering, multi-frame standards conversion, multi-frame noise filtering, restoration, and super-resolution Image compression: lossless compression, JPEG, wavelets, and JPEG2000 Video compression: early standards, ITU-T H.264/MPEG-4 AVC, HEVC, Scalable Video Compression, and stereo/multi-view approaches

1,349 citations


Journal ArticleDOI
03 Jan 1993-Multimedia Systems
TL;DR: A twin-comparison approach has been developed to solve the problem of detecting transitions implemented by special effects, and a motion analysis algorithm is applied to determine whether an actual transition has occurred.
Abstract: Partitioning a video source into meaningful segments is an important step for video indexing. We present a comprehensive study of a partitioning system that detects segment boundaries. The system is based on a set of difference metrics and it measures the content changes between video frames. A twin-comparison approach has been developed to solve the problem of detecting transitions implemented by special effects. To eliminate the false interpretation of camera movements as transitions, a motion analysis algorithm is applied to determine whether an actual transition has occurred. A technique for determining the threshold for a difference metric and a multi-pass approach to improve the computation speed and accuracy have also been developed.

1,345 citations


Proceedings ArticleDOI
07 Dec 2015-
Abstract: Real-world videos often have complex dynamics, methods for generating open-domain video descriptions should be senstive to temporal structure and allow both input (sequence of frames) and output (sequence of words) of variable length. To approach this problem we propose a novel end-to-end sequence-to-sequence model to generate captions for videos. For this we exploit recurrent neural networks, specifically LSTMs, which have demonstrated state-of-the-art performance in image caption generation. Our LSTM model is trained on video-sentence pairs and learns to associate a sequence of video frames to a sequence of words in order to generate a description of the event in the video clip. Our model naturally is able to learn the temporal structure of the sequence of frames as well as the sequence model of the generated sentences, i.e. a language model. We evaluate several variants of our model that exploit different visual features on a standard set of YouTube videos and two movie description datasets (M-VAD and MPII-MD).

1,149 citations


Journal ArticleDOI
TL;DR: A novel observation model based on motion compensated subsampling is proposed for a video sequence and Bayesian restoration with a discontinuity-preserving prior image model is used to extract a high-resolution video still given a short low-resolution sequence.
Abstract: The human visual system appears to be capable of temporally integrating information in a video sequence in such a way that the perceived spatial resolution of a sequence appears much higher than the spatial resolution of an individual frame. While the mechanisms in the human visual system that do this are unknown, the effect is not too surprising given that temporally adjacent frames in a video sequence contain slightly different, but unique, information. This paper addresses the use of both the spatial and temporal information present in a short image sequence to create a single high-resolution video frame. A novel observation model based on motion compensated subsampling is proposed for a video sequence. Since the reconstruction problem is ill-posed, Bayesian restoration with a discontinuity-preserving prior image model is used to extract a high-resolution video still given a short low-resolution sequence. Estimates computed from a low-resolution image sequence containing a subpixel camera pan show dramatic visual and quantitative improvements over bilinear, cubic B-spline, and Bayesian single frame interpolations. Visual and quantitative improvements are also shown for an image sequence containing objects moving with independent trajectories. Finally, the video frame extraction algorithm is used for the motion-compensated scan conversion of interlaced video data, with a visual comparison to the resolution enhancement obtained from progressively scanned frames.

1,025 citations


Network Information
Related Topics (5)
Multiview Video Coding

9.7K papers, 198.8K citations

93% related
Context-adaptive variable-length coding

3.1K papers, 70.2K citations

92% related
Coding tree unit

7K papers, 133.5K citations

92% related
Quarter-pixel motion

8.4K papers, 146.2K citations

92% related
Context-adaptive binary arithmetic coding

8.2K papers, 170.7K citations

92% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
20218
202010
201916
201843
2017145
2016261

Top Attributes

Show by:

Topic's top 5 most impactful authors

Marta Karczewicz

48 papers, 1.7K citations

Wen Gao

29 papers, 584 citations

Oscar C. Au

28 papers, 416 citations

Aggelos K. Katsaggelos

26 papers, 430 citations

Edward J. Delp

25 papers, 315 citations