scispace - formally typeset
Search or ask a question
Author

Enming Luo

Bio: Enming Luo is an academic researcher from University of California, San Diego. The author has contributed to research in topics: Video denoising & Non-local means. The author has an hindex of 8, co-authored 25 publications receiving 324 citations. Previous affiliations of Enming Luo include Wilmington University & Facebook.

Papers
More filters
Journal ArticleDOI
TL;DR: A data-dependent denoising procedure to restore noisy images which finds patches from a database that contains relevant patches by considering a localized Bayesian prior and demonstrating the superiority of the new algorithm over existing methods.
Abstract: We propose a data-dependent denoising procedure to restore noisy images. Different from existing denoising algorithms which search for patches from either the noisy image or a generic database, the new algorithm finds patches from a database that contains relevant patches. We formulate the denoising problem as an optimal filter design problem and make two contributions. First, we determine the basis function of the denoising filter by solving a group sparsity minimization problem. The optimization formulation generalizes existing denoising algorithms and offers systematic analysis of the performance. Improvement methods are proposed to enhance the patch search process. Second, we determine the spectral coefficients of the denoising filter by considering a localized Bayesian prior. The localized prior leverages the similarity of the targeted database, alleviates the intensive Bayesian computation, and links the new method to the classical linear minimum mean squared error estimation. We demonstrate applications of the proposed method in a variety of scenarios, including text images, multiview images, and face images. Experimental results show the superiority of the new algorithm over existing methods.

107 citations

Journal ArticleDOI
TL;DR: The experimental results show that the proposed adaptation algorithm yields consistently better denoising results than the one without adaptation and is superior to several state-of-the-art algorithms.
Abstract: We propose an adaptive learning procedure to learn patch-based image priors for image denoising. The new algorithm, called the expectation-maximization (EM) adaptation, takes a generic prior learned from a generic external database and adapts it to the noisy image to generate a specific prior. Different from existing methods that combine internal and external statistics in ad hoc ways, the proposed algorithm is rigorously derived from a Bayesian hyper-prior perspective. There are two contributions of this paper. First, we provide full derivation of the EM adaptation algorithm and demonstrate methods to improve the computational complexity. Second, in the absence of the latent clean image, we show how EM adaptation can be modified based on pre-filtering. The experimental results show that the proposed adaptation algorithm yields consistently better denoising results than the one without adaptation and is superior to several state-of-the-art algorithms.

61 citations

Patent
27 Sep 2013
TL;DR: In this paper, the upsampling filter information and the encoded enhancement layer pictures may be sent in an outpui video bitstream by an encoder, which is performed by an algorithm that determines whether knowledge of a category related to the video sequence exists.
Abstract: Systems, methods, and instrumentalities are disclosed for adaptive upsampling for multi¬ layer video coding. A method of communicating video data may involve applying an upsampling filter to a video sequence to create encoded enhancement layer pictures. The upsampling filter may be applied at a sequence level of the video sequence to create the enhancement layer bitstream. The upsampling filter may be selected from a plurality of candidate upsampling filters, for example, by determining whether knowledge of a category related to the video sequence exists and selecting a candidate upsampling filter that is designed for the category related to the video sequence. Upsampling filter information may be encoded. The encoded upsampling information may comprise a plurality of coefficients of the upsampling filter. The encoded upsampling filter information and the encoded enhancement layer pictures may be sent in an outpui video bitstream. The method may be performed, for example, by an encoder.

30 citations

Proceedings ArticleDOI
11 Jul 2011
TL;DR: A new method was derived from a theoretical model by extracting the low frequency in an image set by creating a low frequency template and using it as a prediction for each image to compute its residue.
Abstract: In advance of the imaging capturing technology, large amount of similar images are created. Instead of compressing each similar image individually, removing the inter-image redundancy would reduce the storage and transmission time. However, only a few set redundancy methods are proposed to deal with the problem. In this paper, a new method was derived from a theoretical model by extracting the low frequency in an image set. For the similar images, the values of their low frequency components are very close to that of their neighboring pixel in the spatial domain. In our model, a low frequency template is created and used as a prediction for each image to compute its residue. This model proves the reduction in the entropy and hence the bit rates. Experiments were conducted and proved there were up to 30% gains over the existing methods.

26 citations

Proceedings ArticleDOI
04 May 2014
TL;DR: This paper proposes to denoise images using targeted external image databases to determine the basis functions of the optimal filter by means of group sparsity, and demonstrates superior denoising results over existing algorithms.
Abstract: Classical image denoising algorithms based on single noisy images and generic image databases will soon reach their performance limits. In this paper, we propose to denoise images using targeted external image databases. Formulating denoising as an optimal filter design problem, we utilize the targeted databases to (1) determine the basis functions of the optimal filter by means of group sparsity; (2) determine the spectral coefficients of the optimal filter by means of localized priors. For a variety of scenarios such as text images, multiview images, and face images, we demonstrate superior denoising results over existing algorithms.

26 citations


Cited by
More filters
Proceedings ArticleDOI
10 Nov 2017
TL;DR: In this article, a spatio-temporal sub-pixel convolution network is proposed to exploit temporal redundancies and improve reconstruction accuracy while maintaining real-time speed, and a novel joint motion compensation and video super-resolution algorithm that is orders of magnitude more efficient than competing methods.
Abstract: Convolutional neural networks have enabled accurate image super-resolution in real-time. However, recent attempts to benefit from temporal correlations in video super-resolution have been limited to naive or inefficient architectures. In this paper, we introduce spatio-temporal sub-pixel convolution networks that effectively exploit temporal redundancies and improve reconstruction accuracy while maintaining real-time speed. Specifically, we discuss the use of early fusion, slow fusion and 3D convolutions for the joint processing of multiple consecutive video frames. We also propose a novel joint motion compensation and video super-resolution algorithm that is orders of magnitude more efficient than competing methods, relying on a fast multi-resolution spatial transformer module that is end-to-end trainable. These contributions provide both higher accuracy and temporally more consistent videos, which we confirm qualitatively and quantitatively. Relative to single-frame models, spatio-temporal networks can either reduce the computational cost by 30% whilst maintaining the same quality or provide a 0.2dB gain for a similar computational cost. Results on publicly available datasets demonstrate that the proposed algorithms surpass current state-of-the-art performance in both accuracy and efficiency.

622 citations

Proceedings ArticleDOI
01 Oct 2019
TL;DR: In this paper, a single-stage blind real image denoising network (RIDNet) was proposed by employing a modular architecture, which uses residual on the residual structure to ease the flow of low-frequency information and apply feature attention to exploit the channel dependencies.
Abstract: Deep convolutional neural networks perform better on images containing spatially invariant noise (synthetic noise); however, its performance is limited on real-noisy photographs and requires multiple stage network modeling. To advance the practicability of the denoising algorithms, this paper proposes a novel single-stage blind real image denoising network (RIDNet) by employing a modular architecture. We use residual on the residual structure to ease the flow of low-frequency information and apply feature attention to exploit the channel dependencies. Furthermore, the evaluation in terms of quantitative metrics and visual quality on three synthetic and four real noisy datasets against 19 state-of-the-art algorithms demonstrate the superiority of our RIDNet.

285 citations

Posted Content
TL;DR: A novel single-stage blind real image denoising network (RIDNet) is proposed by employing a modular architecture that uses residual on the residual structure to ease the flow of low-frequency information and apply feature attention to exploit the channel dependencies.
Abstract: Deep convolutional neural networks perform better on images containing spatially invariant noise (synthetic noise); however, their performance is limited on real-noisy photographs and requires multiple stage network modeling. To advance the practicability of denoising algorithms, this paper proposes a novel single-stage blind real image denoising network (RIDNet) by employing a modular architecture. We use a residual on the residual structure to ease the flow of low-frequency information and apply feature attention to exploit the channel dependencies. Furthermore, the evaluation in terms of quantitative metrics and visual quality on three synthetic and four real noisy datasets against 19 state-of-the-art algorithms demonstrate the superiority of our RIDNet.

243 citations

Journal ArticleDOI
TL;DR: This paper aims to present algorithms to develop counter algorithms to fade away its negative effects to train deep neural networks efficiently and divides them into one of the two subgroups: noise model based and noise model free methods.
Abstract: Image classification systems recently made a giant leap with the advancement of deep neural networks. However, these systems require an excessive amount of labeled data to be adequately trained. Gathering a correctly annotated dataset is not always feasible due to several factors, such as the expensiveness of the labeling process or difficulty of correctly classifying data, even for the experts. Because of these practical challenges, label noise is a common problem in real-world datasets, and numerous methods to train deep neural networks with label noise are proposed in the literature. Although deep neural networks are known to be relatively robust to label noise, their tendency to overfit data makes them vulnerable to memorizing even random noise. Therefore, it is crucial to consider the existence of label noise and develop counter algorithms to fade away its adverse effects to train deep neural networks efficiently. Even though an extensive survey of machine learning techniques under label noise exists, the literature lacks a comprehensive survey of methodologies centered explicitly around deep learning in the presence of noisy labels. This paper aims to present these algorithms while categorizing them into one of the two subgroups: noise model based and noise model free methods. Algorithms in the first group aim to estimate the noise structure and use this information to avoid the adverse effects of noisy labels. Differently, methods in the second group try to come up with inherently noise robust algorithms by using approaches like robust losses, regularizers or other learning paradigms.

213 citations

Journal ArticleDOI
TL;DR: This article focuses on classifying and comparing some of the significant works in the field of denoising and explains why some methods work optimally and others tend to create artefacts and remove fine structural details under general conditions.

211 citations