scispace - formally typeset
Search or ask a question
Author

James Bankoski

Bio: James Bankoski is an academic researcher from Google. The author has contributed to research in topics: Reference frame & Motion vector. The author has an hindex of 15, co-authored 72 publications receiving 1066 citations.


Papers
More filters
01 Nov 2011
TL;DR: This document describes the VP8 compressed video data format, together with a discussion of the decoding procedure for the format.
Abstract: This document describes the VP8 compressed video data format, together with a discussion of the decoding procedure for the format.

251 citations

Journal ArticleDOI
26 Feb 2021
TL;DR: A technical overview of the AV1 codec design that enables the compression performance gains with considerations for hardware feasibility is provided.
Abstract: The AV1 video compression format is developed by the Alliance for Open Media consortium. It achieves more than a 30% reduction in bit rate compared to its predecessor VP9 for the same decoded video quality. This article provides a technical overview of the AV1 codec design that enables the compression performance gains with considerations for hardware feasibility.

95 citations

Patent
Eric Ameres1, James Bankoski1, Adrian Grange1, Timothy S. Murphy1, Paul Wilkins1, Yaowu Xu1 
14 Sep 2012
TL;DR: In this article, a method of compressing video data having at least one frame having at most one block and each block having an array of pixels is provided, which transforms the pixels of each block into coefficients and creates an optimal transmission order of the coefficients.
Abstract: A method of compressing video data having at least one frame having at least one block and each block having an array of pixels is provided. The method transforms the pixels of each block into coefficients and creates an optimal transmission order of the coefficients. The method also optimizes the speed of processing compressed video data by partitioning the data bitstream and coding each partition independently. The method also predicts fractional pixel motion by selecting an interpolation method for each given plurality or block of pixels depending upon at least one metric related to each given block and varies the method from block to block. The method also enhances error recovery for a current frame using a frame prior to the frame immediately before the current frame as the only reference frame for lessening quality loss during data transmission. Enhanced motion vector coding is also provided.

91 citations

Patent
James Bankoski1, Yaowu Xu1, Paul Wilkins1
10 Sep 2009
TL;DR: In this article, a method for digital video encoding prediction comprising creating a constructed reference frame using an encoder and compressing a series of source video frames using the constructed referenceframe to obtain a bitstream including a compressed digital video signal for a subsequent decoding process is presented.
Abstract: Disclosed herein is a method for digital video encoding prediction comprising creating a constructed reference frame using an encoder and compressing a series of source video frames using the constructed reference frame to obtain a bitstream including a compressed digital video signal for a subsequent decoding process. The constructed reference frame is omitted from the series of digital video frames during the subsequent viewing process.

77 citations

Patent
12 May 2004
TL;DR: In this article, a method of compressing video data having at least one frame having at most one block and each block having an array of pixels is provided, which transforms the pixels of each block into coefficients and creates an optimal transmission order of the coefficients.
Abstract: A method of compressing video data having at least one frame having at least one block and each block having an array of pixels is provided. The method transforms the pixels of each block into coefficients and creates an optimal transmission order of the coefficients. The method also optimizes the speed of processing compressed video data by partitioning the data bitstream and coding each partition independently. The method also predicts fractional pixel motion by selecting an interpolation method for each given plurality or block of pixels depending upon at least one metric related to each given block and varies the method from block to block. The method also enhances error recovery for a current frame using a frame prior to the frame immediately before the current frame as the only reference frame for lessening quality loss during data transmission. Enhanced motion vector coding is also provided.

75 citations


Cited by
More filters
Patent
18 Nov 2013
TL;DR: In this article, a modular intelligent transportation system, comprising an environmentally protected enclosure, a system communications bus, a processor module, communicating with said bus, having a image data input and an audio input, the processor module analyzing the image data and/or audio input for data patterns represented therein, having at least one available option slot, a power supply, and a communication link for external communications.
Abstract: A modular intelligent transportation system, comprising an environmentally protected enclosure, a system communications bus, a processor module, communicating with said bus, having a image data input and an audio input, the processor module analyzing the image data and/or audio input for data patterns represented therein, having at least one available option slot, a power supply, and a communication link for external communications, in which at least one available option slot can be occupied by a wireless local area network access point, having a communications path between said communications link and said wireless access point, or other modular components.

377 citations

Patent
30 Sep 2013
TL;DR: In this paper, an image processing device and a method that make it possible to suppress block noise is presented, which can be applied to an image processor and can be used to suppress noise in a filter process determination unit.
Abstract: The present disclosure pertains to an image-processing device and method that make it possible to suppress block noise. A βLUT_input calculator and a clip processor determine βLUT_input, which is a value inputted to an existing-β generator and an expanded-β generator. When the value of βLUT_input qp from the clip processor is 51 or less, the existing-β generator determines β using a LUT conforming to the HVEC standard, and supplies same to a filter process determination unit. When the value of βLUT_input qp from the clip processor is larger than 51, the expanded-β generator determines the expanded β and supplies same to the filter process determination part. This disclosure, for example, can be applied to an image processing device.

340 citations

Proceedings ArticleDOI
Jim Bankoski1, Paul Wilkins1, Yaowu Xu1
11 Jul 2011
TL;DR: This paper provides a technical overview of the format, with an emphasis on its unique features, and discusses how these features benefit VP8 in achieving high compression efficiency and low decoding complexity at the same time.
Abstract: VP8 is an open source video compression format supported by a consortium of technology companies. This paper provides a technical overview of the format, with an emphasis on its unique features. The paper also discusses how these features benefit VP8 in achieving high compression efficiency and low decoding complexity at the same time.

327 citations

Proceedings ArticleDOI
24 Jun 2018
TL;DR: A brief technical overview of key coding techniques in AV1 is provided along with preliminary compression performance comparison against VP9 and HEVC.
Abstract: AV1 is an emerging open-source and royalty-free video compression format, which is jointly developed and finalized in early 2018 by the Alliance for Open Media (AOMedia) industry consortium. The main goal of AV1 development is to achieve substantial compression gain over state-of-the-art codecs while maintaining practical decoding complexity and hardware feasibility. This paper provides a brief technical overview of key coding techniques in AV1 along with preliminary compression performance comparison against VP9 and HEVC.

260 citations

Patent
20 Aug 2007
TL;DR: In this article, an encoder uses an input file of data and a key to produce an output symbol with key I. The output symbols are generally independent of each other, and an unbounded number (subject to the resolution of I) can be generated, if needed.
Abstract: An encoder uses an input file of data and a key to produce an output symbol. An output symbol with key I is generated by determining a weight, W(I), for the output symbol to be generated, selecting W(I) of the input symbols associated with the output symbol according to a function of I, and generating the output symbol's value B(I) from a predetermined value function F(I) of the selected W(I) input symbols. An encoder can be called repeatedly to generate multiple output symbols. The output symbols are generally independent of each other, and an unbounded number (subject to the resolution of I) can be generated, if needed. A decoder receives some or all of the output symbols generated. The number of output symbols needed to decode an input file is equal to, or slightly greater than, the number of input symbols comprising the file, assuming that input symbols and output symbols represent the same number of bits of data.

235 citations