scispace - formally typeset
Search or ask a question
Author

Dmytro Rusanovskyy

Bio: Dmytro Rusanovskyy is an academic researcher from Nokia. The author has contributed to research in topics: Multiview Video Coding & Adaptive filter. The author has an hindex of 18, co-authored 40 publications receiving 1649 citations. Previous affiliations of Dmytro Rusanovskyy include Tampere University of Technology.

Papers
More filters
Patent
25 Sep 2013
TL;DR: In this article, a method for pixel-wise joint filtering of depth maps from a plurality of viewing angles is described, which enables to suppress the noise in the depth map data and provides improved performance for a view synthesis.
Abstract: There is disclosed a method, an apparatus, a server, a client and a non-transitory computer readable medium comprising a computer program stored therein for video coding and decoding. Depth pictures from a plurality of viewing angles are projected into a single viewing angle, making it possible to have pixel-wise joint filtering to be applied to all projected depth values. This approach enables to suppress the noise in the depth map data and provides improved performance for a view synthesis.

354 citations

Patent
25 Apr 2013
TL;DR: In this article, the order of the texture view component and the depth view component in an access unit is determined and at least one indication of the order is encoded, and the decoding of the view component is adapted on the basis of order.
Abstract: There is disclosed a method, apparatus and computer program product in which at least one view component of a first type and at least one view component of a second type are obtained. The order of the texture view component and the depth view component in an access unit is determined and at least one indication of the order is encoded. The coding of the view components is adapted on the basis of the order. There is also disclosed a method, apparatus and computer program product in which at least one encoded view component of a first type and at least one encoded view component of a second type are received. Also at least one encoded indication of the order of the view components is received. The at least one encoded indication is decoded and the decoding of the view component sis adapted on the basis of the order.

319 citations

Patent
16 Apr 2013
TL;DR: In this article, a method, an apparatus, a server, a client and a non-transitory computer readable medium comprising a computer program stored therein for motion compensated video coding and decoding is disclosed.
Abstract: There is disclosed a method, an apparatus, a server, a client and a non-transitory computer readable medium comprising a computer program stored therein for motion compensated video coding and decoding. Texture block motion information is used to derive disparity/depth motion information. Alternatively, disparity/depth motion information is used to derive texture block motion information.

200 citations

Journal ArticleDOI
TL;DR: Coding efficiency improvements are achieved with lower complexity than the H.264/AVC Baseline Profile, particularly suiting the proposal for high resolution, high quality applications in resource-constrained environments.
Abstract: This paper describes a low complexity video codec with high coding efficiency. It was proposed to the high efficiency video coding (HEVC) standardization effort of moving picture experts group and video coding experts group, and has been partially adopted into the initial HEVC test model under consideration design. The proposal utilizes a quadtree-based coding structure with support for macroblocks of size 64 × 64, 32 × 32, and 16 × 16 pixels. Entropy coding is performed using a low complexity variable length coding scheme with improved context adaptation compared to the context adaptive variable length coding design in H.264/AVC. The proposal's interpolation and deblocking filter designs improve coding efficiency, yet have low complexity. Finally, intra-picture coding methods have been improved to provide better subjective quality than H.264/AVC. The subjective quality of the proposed codec has been evaluated extensively within the HEVC project, with results indicating that similar visual quality to H.264/AVC High Profile anchors is achieved, measured by mean opinion score, using significantly fewer bits. Coding efficiency improvements are achieved with lower complexity than the H.264/AVC Baseline Profile, particularly suiting the proposal for high resolution, high quality applications in resource-constrained environments.

156 citations

Patent
08 Jan 2008
TL;DR: In this article, a predefined base filter has fixed coefficient values and a prediction signal representative of the difference between a video frame and a reference image is calculated from the reference image based on a pre-defined base filter and motion estimation performed on the video frame.
Abstract: In digital video image encoding and decoding, a filter type is selected based on symmetrical properties of the images and coefficient values of an interpolation filter are calculated based on the selected filter type. Coefficient values, filter tap-length and selected filter-type are provided in the encoded video data. Coefficient values are also calculated based on a prediction signal representative of the different between a video frame and a reference image. The prediction signal is calculated from the reference image based on a predefined base filter and motion estimation performed on the video frame. The predefined base filter has fixed coefficient values. Coefficient values are selected from interpolation of pixel values in a selected image segment in the video frame. Symmetry properties of images can be a vertical symmetry, a horizontal symmetry and a combination thereof, so that only a portion of the filter coefficients are coded.

95 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: This paper shows how this denoising method is generalized to become a relatively simple super-resolution algorithm with no explicit motion estimation, and results show that the proposed method is very successful in providing super- resolution on general sequences.
Abstract: Super-resolution reconstruction proposes a fusion of several low-quality images into one higher quality result with better optical resolution. Classic super-resolution techniques strongly rely on the availability of accurate motion estimation for this fusion task. When the motion is estimated inaccurately, as often happens for nonglobal motion fields, annoying artifacts appear in the super-resolved outcome. Encouraged by recent developments on the video denoising problem, where state-of-the-art algorithms are formed with no explicit motion estimation, we seek a super-resolution algorithm of similar nature that will allow processing sequences with general motion patterns. In this paper, we base our solution on the Nonlocal-Means (NLM) algorithm. We show how this denoising method is generalized to become a relatively simple super-resolution algorithm with no explicit motion estimation. Results on several test movies show that the proposed method is very successful in providing super-resolution on general sequences.

845 citations

Proceedings ArticleDOI
TL;DR: This work presents a novel approach to still image denoising based on effective filtering in 3D transform domain by combining sliding-window transform processing with block-matching, and shows that the proposed method delivers state-of-art Denoising performance, both in terms of objective criteria and visual quality.
Abstract: We present a novel approach to still image denoising based on effective filtering in 3D transform domain by combining sliding-window transform processing with block-matching. We process blocks within the image in a sliding manner and utilize the block-matching concept by searching for blocks which are similar to the currently processed one. The matched blocks are stacked together to form a 3D array and due to the similarity between them, the data in the array exhibit high level of correlation. We exploit this correlation by applying a 3D decorrelating unitary transform and effectively attenuate the noise by shrinkage of the transform coefficients. The subsequent inverse 3D transform yields estimates of all matched blocks. After repeating this procedure for all image blocks in sliding manner, the final estimate is computed as weighed average of all overlapping blockestimates. A fast and efficient algorithm implementing the proposed approach is developed. The experimental results show that the proposed method delivers state-of-art denoising performance, both in terms of objective criteria and visual quality.

672 citations

Proceedings ArticleDOI
03 Sep 2007
TL;DR: An effective video denoising method based on highly sparse signal representation in local 3D transform domain that achieves state-of-the-art denoised performance in terms of both peak signal-to-noise ratio and subjective visual quality is proposed.
Abstract: We propose an effective video denoising method based on highly sparse signal representation in local 3D transform domain. A noisy video is processed in blockwise manner and for each processed block we form a 3D data array that we call “group” by stacking together blocks found similar to the currently processed one. This grouping is realized as a spatio-temporal predictive-search block-matching, similar to techniques used for motion estimation. Each formed 3D group is filtered by a 3D transform-domain shrinkage (hard-thresholding and Wiener filtering), the result of which are estimates of all grouped blocks. This filtering — that we term “collaborative filtering” — exploits the correlation between grouped blocks and the corresponding highly sparse representation of the true signal in the transform domain. Since, in general, the obtained block estimates are mutually overlapping, we aggregate them by a weighted average in order to form a non-redundant estimate of the video. Significant improvement of this approach is achieved by using a two-step algorithm where an intermediate estimate is produced by grouping and collaborative hard-thresholding and then used both for improving the grouping and for applying collaborative empirical Wiener filtering. We develop an efficient realization of this video denoising algorithm. The experimental results show that at reasonable computational cost it achieves state-of-the-art denoising performance in terms of both peak signal-to-noise ratio and subjective visual quality.

496 citations

Patent
Miska Hannuksela1
23 Dec 2014
TL;DR: In this paper, the authors propose a method to skip decoding of the next decodable access unit based on decoding at least one access unit of the first sequence of access units.
Abstract: A method comprises receiving a first sequence of access units and a second sequence of access units; decoding at least one access unit of the first sequence of access units; decoding a first decodable access unit of the second sequence of access units; determining whether a next decodable access unit in the second sequence of access units can be decoded before an output time of the next decodable access unit in the second sequence of access units; and skipping decoding of the next decodable access unit based on determining that the next decodable access unit cannot be decoded before the at least one of the decoding time and the output time of the next decodable access unit.

490 citations

Patent
18 Nov 2013
TL;DR: In this article, a modular intelligent transportation system, comprising an environmentally protected enclosure, a system communications bus, a processor module, communicating with said bus, having a image data input and an audio input, the processor module analyzing the image data and/or audio input for data patterns represented therein, having at least one available option slot, a power supply, and a communication link for external communications.
Abstract: A modular intelligent transportation system, comprising an environmentally protected enclosure, a system communications bus, a processor module, communicating with said bus, having a image data input and an audio input, the processor module analyzing the image data and/or audio input for data patterns represented therein, having at least one available option slot, a power supply, and a communication link for external communications, in which at least one available option slot can be occupied by a wireless local area network access point, having a communications path between said communications link and said wireless access point, or other modular components.

377 citations