scispace - formally typeset
Search or ask a question
Topic

Inter frame

About: Inter frame is a research topic. Over the lifetime, 4154 publications have been published within this topic receiving 63549 citations.


Papers
More filters
Journal ArticleDOI
TL;DR: A multiscale video representation using wavelet decomposition and variable-block-size multiresolution motion estimation (MRME) is presented and appears suitable for the broadcast environment where various standards may coexist simultaneously.
Abstract: A multiscale video representation using wavelet decomposition and variable-block-size multiresolution motion estimation (MRME) is presented. The multiresolution/multifrequency nature of the discrete wavelet transform makes it an ideal tool for representing video sources with different resolutions and scan formats. The proposed variable-block-size MRME scheme utilizes motion correlation among different scaled subbands and adapts to their importance at different layers. The algorithm is well suited for interframe HDTV coding applications and facilitates conversions and interactions between different video coding standards. Four scenarios for the proposed motion-compensated coding schemes are compared. A pel-recursive motion estimation scheme is implemented in a multiresolution form. The proposed approach appears suitable for the broadcast environment where various standards may coexist simultaneously. >

115 citations

Journal ArticleDOI
01 Dec 2009
TL;DR: This work builds a complete video resizing framework by incorporating motion-aware constraints with an adaptation of the scale-and-stretch optimization recently proposed by Wang and colleagues, and streaming implementation of the framework allows efficient resizing of long video sequences with low memory cost.
Abstract: Temporal coherence is crucial in content-aware video retargeting. To date, this problem has been addressed by constraining temporally adjacent pixels to be transformed coherently. However, due to the motion-oblivious nature of this simple constraint, the retargeted videos often exhibit flickering or waving artifacts, especially when significant camera or object motions are involved. Since the feature correspondence across frames varies spatially with both camera and object motion, motion-aware treatment of features is required for video resizing. This motivated us to align consecutive frames by estimating interframe camera motion and to constrain relative positions in the aligned frames. To preserve object motion, we detect distinct moving areas of objects across multiple frames and constrain each of them to be resized consistently. We build a complete video resizing framework by incorporating our motion-aware constraints with an adaptation of the scale-and-stretch optimization recently proposed by Wang and colleagues. Our streaming implementation of the framework allows efficient resizing of long video sequences with low memory cost. Experiments demonstrate that our method produces spatiotemporally coherent retargeting results even for challenging examples with complex camera and object motion, which are difficult to handle with previous techniques.

114 citations

Proceedings ArticleDOI
12 Nov 2007
TL;DR: The performance of template matching prediction is further improved and a scheme to predictively encode the decimated version of a target block in flat regions to suppress the prediction errors is proposed.
Abstract: A template matching prediction based on a group of reconstructed pixels surrounding a target block enables prediction of pixels in the target block without motion information. The predictor of a target block is produced by minimizing the matching error of the template. Due to the freedom possessed by the template, the residuals of a target block may become large in flat regions. Our previous paper proposed to predictively encode the decimated version of a target block in flat regions to suppress the prediction errors. In this paper, the performance of template matching prediction is further improved. Multiple candidates are created by template matching at decoder. An average of the multiple candidates then forms the final predictor, which can reduce coding noise residing in the reference frames. Simulation results show that the proposed scheme improves coding efficiency of H.264 up to 7.9%.

113 citations

Patent
Seung K. Pack1, Tae Y. Chung1
18 Dec 1992
TL;DR: In this paper, an intraframe and an interframe process is defined such that the present frame image data is compressed in a variable length compressing manner by way of two-dimensional discrete coding transform.
Abstract: The image signal band compressing method employs a three-dimensional motion compensating technique, an intraframe and an interframe processes which are alternatively executed. The transfer rate of the intraframe to the interframe is set to 4:1 in a unit of fixed length. The intraframe process is defined such that the present frame image data is compressed in a variable length compressing manner by way of two-dimensional discrete coding transform. The interframe process is defined such that motion data is estimated by comparing the present frame and the preceding frame, the present frame is expected on the basis of the motion data and the difference data between the motion compensated image data and the present frame data.

113 citations

Patent
05 Nov 1992
TL;DR: In this paper, an interframe motion prediction method for predicting the motion in a bidirectionally predictive-coded frame from an intra coded frame and a predictive coded frame was proposed.
Abstract: An interframe motion predicting method for prediction of the motion in a bidirectionally predictive-coded frame from an intra-coded frame and a predictive-coded frame, predicts the motion in another bidirectionally predictive-coded frame from the preceding bidirectionally predictive-coded frame and the predictive-coded frame. A picture signal coding apparatus executes orthogonal transformation of a picture signal, then quantizes the transformed data, and codes the data thus quantized. The apparatus includes a local decoder for the quantized data; first and second memories for storing the decoded picture data of an intra-coded or bidirectionally predictive-coded frame, and a predictive-coded frame respectively; a predictive picture generator for generating a predictive picture of a second bidirectionally predictive-coded frame; and a difference calculator for calculating the difference between the predictive picture and the original picture signal corresponding thereto. A picture signal decoding apparatus includes inverse multiplexer for separating the coded data into interframe predictive error data and vector coded data; a decoder for generating decoded picture data on the basis of such error data; first and second memories for storing the decoded picture data of the intra-coded frame and the predictive-coded frame respectively; a predictive picture generator for generating a predicted picture of a second bidirectionally predictive-coded frame; and a frame switching selector for selectively rearranging the decoded picture data in the order of reproduction.

110 citations


Network Information
Related Topics (5)
Feature (computer vision)
128.2K papers, 1.7M citations
86% related
Feature extraction
111.8K papers, 2.1M citations
86% related
Image segmentation
79.6K papers, 1.8M citations
86% related
Convolutional neural network
74.7K papers, 2M citations
83% related
Image processing
229.9K papers, 3.5M citations
82% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202324
202272
202162
202084
2019110
201897