Robust Video Super-Resolution with Learned Temporal Dynamics

doi:10.1109/ICCV.2017.274

Proceedings ArticleDOI

Robust Video Super-Resolution with Learned Temporal Dynamics

Ding Liu, +6 more

- pp 2526-2534

Chats0

TLDR

This work proposes a temporal adaptive neural network that can adaptively determine the optimal scale of temporal dependency and reduces the complexity of motion between neighboring frames using a spatial alignment network which is much more robust and efficient than competing alignment methods.

Abstract:

Video super-resolution (SR) aims to generate a highresolution (HR) frame from multiple low-resolution (LR) frames in a local temporal window. The inter-frame temporal relation is as crucial as the intra-frame spatial relation for tackling this problem. However, how to utilize temporal information efficiently and effectively remains challenging since complex motion is difficult to model and can introduce adverse effects if not handled properly. We address this problem from two aspects. First, we propose a temporal adaptive neural network that can adaptively determine the optimal scale of temporal dependency. Filters on various temporal scales are applied to the input LR sequence before their responses are adaptively aggregated. Second, we reduce the complexity of motion between neighboring frames using a spatial alignment network which is much more robust and efficient than competing alignment methods and can be jointly trained with the temporal adaptive network in an end-to-end manner. Our proposed models with learned temporal dynamics are systematically evaluated on public video datasets and achieve state-of-the-art SR results compared with other recent video SR approaches. Both of the temporal adaptation and the spatial alignment modules are demonstrated to considerably improve SR quality over their plain counterparts.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Benchmarking Single-Image Dehazing and Beyond

Boyi Li, +6 more

- 01 Jan 2019 -

IEEE Transactions on Image Processing

TL;DR: In this article, the authors present a comprehensive study and evaluation of existing single image dehazing algorithms, using a new large-scale benchmark consisting of both synthetic and real-world hazy images, called Realistic Single-Image DEhazing (RESIDE).

...read moreread less

Journal ArticleDOI

Deep Learning for Image Super-Resolution: A Survey

Zhihao Wang, +2 more

- 01 Oct 2021 -

IEEE Transactions on Pattern Analysis an...

TL;DR: A survey on recent advances of image super-resolution techniques using deep learning approaches in a systematic way, which can roughly group the existing studies of SR techniques into three major categories: supervised SR, unsupervised SR, and domain-specific SR.

...read moreread less

Proceedings ArticleDOI

EDVR: Video Restoration With Enhanced Deformable Convolutional Networks

Xintao Wang, +4 more

TL;DR: This work proposes a novel Video Restoration framework with Enhanced Deformable convolutions, termed EDVR, and proposes a Temporal and Spatial Attention (TSA) fusion module, in which attention is applied both temporally and spatially, so as to emphasize important features for subsequent restoration.

...read moreread less

Proceedings ArticleDOI

DeblurGAN-v2: Deblurring (Orders-of-Magnitude) Faster and Better

Orest Kupyn, +3 more

TL;DR: It is demonstrated that DeblurGAN-V2 has very competitive performance on several popular benchmarks, in terms of deblurring quality (both objective and subjective), as well as efficiency, and is effective for general image restoration tasks too.

...read moreread less

Proceedings ArticleDOI

Gated Fusion Network for Single Image Dehazing

Wenqi Ren, +6 more

TL;DR: An efficient algorithm to directly restore a clear image from a hazy input using an end-to-end trainable neural network that consists of an encoder and a decoder is proposed.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Proceedings ArticleDOI

Going deeper with convolutions

Christian Szegedy, +8 more

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

Proceedings Article

Rectified Linear Units Improve Restricted Boltzmann Machines

Vinod Nair, +1 more

TL;DR: Restricted Boltzmann machines were developed using binary stochastic hidden units that learn features that are better for object recognition on the NORB dataset and face verification on the Labeled Faces in the Wild dataset.

...read moreread less

Posted Content

Caffe: Convolutional Architecture for Fast Feature Embedding

Yangqing Jia, +7 more

- 20 Jun 2014 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: Caffe as discussed by the authors is a BSD-licensed C++ library with Python and MATLAB bindings for training and deploying general-purpose convolutional neural networks and other deep models efficiently on commodity architectures.

...read moreread less

Proceedings ArticleDOI

Caffe: Convolutional Architecture for Fast Feature Embedding

Yangqing Jia, +7 more

TL;DR: Caffe provides multimedia scientists and practitioners with a clean and modifiable framework for state-of-the-art deep learning algorithms and a collection of reference models for training and deploying general-purpose convolutional neural networks and other deep models efficiently on commodity architectures.

...read moreread less

Collapse

Robust Video Super-Resolution with Learned Temporal Dynamics

Citations

Benchmarking Single-Image Dehazing and Beyond

Deep Learning for Image Super-Resolution: A Survey

EDVR: Video Restoration With Enhanced Deformable Convolutional Networks

DeblurGAN-v2: Deblurring (Orders-of-Magnitude) Faster and Better

Gated Fusion Network for Single Image Dehazing

References

ImageNet Classification with Deep Convolutional Neural Networks

Going deeper with convolutions

Rectified Linear Units Improve Restricted Boltzmann Machines

Caffe: Convolutional Architecture for Fast Feature Embedding

Caffe: Convolutional Architecture for Fast Feature Embedding

Related Papers (5)

Accurate Image Super-Resolution Using Very Deep Convolutional Networks

Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network

Learning a Deep Convolutional Network for Image Super-Resolution

Adam: A Method for Stochastic Optimization

Enhanced Deep Residual Networks for Single Image Super-Resolution