scispace - formally typeset
Search or ask a question
Topic

Inter frame

About: Inter frame is a research topic. Over the lifetime, 4154 publications have been published within this topic receiving 63549 citations.


Papers
More filters
Journal ArticleDOI
TL;DR: The proposed automatic shot detection method, by employing the fast feature descriptor of Oriented FAST and Rotated BRIEF fused with SSIM, can outperform the existing shot detection methods, including the rule-based and learning-based methods, by testing on the video sequences from the Open-video project and RAI dataset.
Abstract: Shots are the basic units for analyzing and retrieving video, and also the essential elements in creating video datasets. The traditional methods of shot detection exhibit unsatisfactory performance for being too sensitive to motion or too much time-consuming. This paper proposes an automatic shot detection method, by employing the fast feature descriptor of Oriented FAST and Rotated BRIEF (ORB) fused with Structural Similarity (SSIM). Firstly, ORB descriptor is used to preselect candidate segments with a high tolerance for rapidly extracting the features of twenty-frame intervals in video sequences. Then, the cut transition is detected by comparing ORB features, fused with SSIM, of consecutive frames in the candidate segment. Finally, the gradual transition is detected by determining the maximum amount of the continuous increasing/decreasing interframe differences in the candidate segment without cut transition. Experimental result indicates that the proposed method can achieve an F1-Score of 92.5% and five times of real-time speed with one CPU on 106049 test frames from the Open-video project, YouTube, and YOUKU. In addition, the proposed method can outperform the existing shot detection methods, including the rule-based and learning-based methods, by testing on the video sequences from the Open-video project and RAI dataset.

16 citations

Patent
19 Sep 2012
TL;DR: In this paper, the authors proposed a video encoding and decoding method which is used in a video communication system, wherein the predictive encoding type and reference frame type of a video encoder and decoder are controlled according to the network state and a notice message of the video decoding device.
Abstract: The invention relates to a video encoding and decoding method which is used in a video communication system, wherein the predictive encoding type and reference frame type of a video encoding device are controlled according to the network state and a notice message of a video decoding device. When a network is congested, the video encoding device utilizes a reference frame as a long-term referenceframe to perform interframe prediction encoding and judges whether the video decoding device receives the corresponding reference frame according to the notice message of the video decoding device, thus the video encoding device and the video decoding device can utilize the same reference frame. The invention also provides the video encoding device and video decoding device which are used in the method in the invention. The video encoding and decoding method, video encoding device and video decoding device provided by the invention are used to encode and decode a video image by determining the long-term reference frame, thus the video encoding device and the video decoding device can utilize the same reference frame, the decoding error phenomenon caused by packet loss can be avoided and the quality of the video image in video communication can be increased.

16 citations

Patent
Andre Zaccarin1
21 Dec 2001
TL;DR: In this article, the authors proposed a method to find a set of blocks that have a line-of-pixels similar to a line of pixels in a target block of another frame.
Abstract: In recent years, it has become increasingly common to transmit sequences of digital images (video data) from one point to another, particularly over computer networks, such as the World-Wide-Web portion of the Internet. To reduce transmission times, computers and other devices that transmit and receive video data often include a video encoder that encodes or compress the data based on the redundancy or similarity between consecutive video frames. Many encoders use motion estimation as a key part of the compression. However, motion estimation itself can be time consuming to perform. Accordingly, the present inventor devised some unique techniques that allow for faster motion estimation. One exemplary technique subsamples a search area of a reference frame to find a set of blocks that have a line of pixels similar to a line of pixels in a target block of another frame. The set of blocks found based on the line similarity are then compared in greater detail to the target block to determine the one best suited for estimating a motion vector for the target block.

16 citations

Proceedings ArticleDOI
07 Nov 2009
TL;DR: Experimental results have shown that the proposed stereoscopic video error concealment method can effectively restore the lost frames at the decoder and deliver attractive performance, in terms of objective measurement (in peak signal-to-noise ratio) and subjective visual quality.
Abstract: At low bit-rate video communications, packet loss may easily cause whole-frame loss that, in return, leads to annoying frame drop phenomenon. In this paper, a novel error concealment algorithm is specifically developed for stereoscopic video, called the disparity-based frame difference projection (DFDP), to recover the lost frames at the decoder. The proposed DFDP contains three key components: 1) change detection, 2) disparity estimation, and 3) frame difference projection, which exploits both the intra-view frame difference from one view and interview correlation to estimate the lost frame in another view. The change region computed on the correctly received frame will be used to predict the change region between current missing frame and its previous frame through the estimated disparity, which is the summation of the estimated global disparity and the estimated local disparity. Experimental results have shown that the proposed stereoscopic video error concealment method can effectively restore the lost frames at the decoder and deliver attractive performance, in terms of objective measurement (in peak signal-to-noise ratio) and subjective visual quality.

16 citations

Patent
24 Dec 2013
TL;DR: In this article, a non-key frame is selected from a plurality of frames, each with a playback timestamp, and the selected frame is displayed as if it were a key frame.
Abstract: Methods and systems for seeking to a non-key frame in a compressed video. The compressed video has a plurality of frames, each with a playback timestamp. At least one of the frames is a key frame. A selection of a non-key frame is received, and a temporally proximate key frame is identified. The set of frames between the identified key frame and the selected frame is played in a reduced time interval, and the selected frame is displayed as if it were a key frame.

16 citations


Network Information
Related Topics (5)
Feature (computer vision)
128.2K papers, 1.7M citations
86% related
Feature extraction
111.8K papers, 2.1M citations
86% related
Image segmentation
79.6K papers, 1.8M citations
86% related
Convolutional neural network
74.7K papers, 2M citations
83% related
Image processing
229.9K papers, 3.5M citations
82% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202324
202272
202162
202084
2019110
201897