scispace - formally typeset
Journal ArticleDOI

Motion analysis in 3D DCT domain and its application to video coding

Nikola Božinović, +1 more
- 01 Jul 2005 - 
- Vol. 20, Iss: 6, pp 510-528
Reads0
Chats0
TLDR
It is shown that global, constant-velocity, translational motion in an image sequence induces in the DCT domain spectral occupancy planes, similarly to the FT domain, however, these planes are subject to spectral folding.
Abstract
Global, constant-velocity, translational motion in an image sequence induces a characteristic energy footprint in the Fourier-transform (FT) domain; spectrum is limited to a plane with orientation defined by the direction of motion. By detecting these spectral occupancy planes, methods have been proposed to estimate such global motion. Since the discrete cosine transform (DCT) is a ubiquitous tool of all video compression standards to date, we investigate in this paper properties of motion in the DCT domain. We show that global, constant-velocity, translational motion in an image sequence induces in the DCT domain spectral occupancy planes, similarly to the FT domain. Unlike in the FT case, however, these planes are subject to spectral folding. Based on this analysis, we propose a motion estimation method in the DCT domain, and we show that results comparable to standard block matching can be obtained. Moreover, by realizing that significant energy in the DCT domain concentrates around a folded plane, we propose a new approach to video compression. The approach is based on 3D DCT applied to a group of frames, followed by motion-adaptive scanning of DCT coefficients (akin to “zig-zag” scanning in MPEG coders), their adaptive quantization, and final entropy coding. We discuss the design of the complete 3D DCT coder and we carry out a performance comparison of the new coder with ubiquitous hybrid coders.

read more

Citations
More filters
Journal ArticleDOI

Blind Image Quality Assessment: A Natural Scene Statistics Approach in the DCT Domain

TL;DR: An efficient general-purpose blind/no-reference image quality assessment (IQA) algorithm using a natural scene statistics model of discrete cosine transform (DCT) coefficients, which requires minimal training and adopts a simple probabilistic model for score prediction.
Journal ArticleDOI

Spatiotemporal Statistics for Video Quality Assessment

TL;DR: A new NR-VQA metric based on the spatiotemporal natural video statistics in 3D discrete cosine transform (3D-DCT) domain is proposed, which is universal for multiple types of distortions and robust to different databases.
Proceedings ArticleDOI

Hyperspectral Face Recognition using 3D-DCT and Partial Least Squares

TL;DR: The three dimensional Discrete Cosine Transform (3D-DCT) is proposed for feature extraction and it is shown that compared to other transforms, such as the Fourier transform, the transformed coefficients are real and thus require less data to process.
Proceedings ArticleDOI

New low complexity DCT based video compression method

TL;DR: A new video compression approach which tends to hard exploit the pertinent temporal redundancy in the video frames to improve compression efficiency with minimum processing complexity is presented.
Proceedings ArticleDOI

Distributions of 3D DCT coefficients for video

TL;DR: This study performs two goodness-of-fit tests to determine the distribution that best fits the 3D DCT coefficients of the luminance components of video sequences with low motion or structured motion and indicates that the DC coefficient can be well approximated by a Gaussian distribution and a majority of the high-energy AC coefficients can be approximating by a Gamma distribution.
References
More filters
Book

Discrete-Time Signal Processing

TL;DR: In this paper, the authors provide a thorough treatment of the fundamental theorems and properties of discrete-time linear systems, filtering, sampling, and discrete time Fourier analysis.
Journal ArticleDOI

Discrete Cosine Transform

TL;DR: In this article, a discrete cosine transform (DCT) is defined and an algorithm to compute it using the fast Fourier transform is developed, which can be used in the area of digital processing for the purposes of pattern recognition and Wiener filtering.
Journal ArticleDOI

Context-based adaptive binary arithmetic coding in the H.264/AVC video compression standard

TL;DR: Context-based adaptive binary arithmetic coding (CABAC) as a normative part of the new ITU-T/ISO/IEC standard H.264/AVC for video compression is presented, and significantly outperforms the baseline entropy coding method of H.265.
Related Papers (5)