Learning Parallax Attention for Stereo Image Super-Resolution

doi:10.1109/CVPR.2019.01253

Open AccessProceedings ArticleDOI

Learning Parallax Attention for Stereo Image Super-Resolution

- pp 12250-12259

TLDR

A parallax-attention mechanism with a global receptive field along the epipolar line to handle different stereo images with large disparity variations is introduced and a new and the largest dataset for stereo image SR is proposed.

Abstract:

Stereo image pairs can be used to improve the performance of super-resolution (SR) since additional information is provided from a second viewpoint. However, it is challenging to incorporate this information for SR since disparities between stereo images vary significantly. In this paper, we propose a parallax-attention stereo superresolution network (PASSRnet) to integrate the information from a stereo image pair for SR. Specifically, we introduce a parallax-attention mechanism with a global receptive field along the epipolar line to handle different stereo images with large disparity variations. We also propose a new and the largest dataset for stereo image SR (namely, Flickr1024). Extensive experiments demonstrate that the parallax-attention mechanism can capture correspondence between stereo images to improve SR performance with a small computational and memory cost. Comparative results show that our PASSRnet achieves the state-of-the-art performance on the Middlebury, KITTI 2012 and KITTI 2015 datasets.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Deep Learning for Image Super-Resolution: A Survey

Zhihao Wang, +2 more

- 01 Oct 2021 -

IEEE Transactions on Pattern Analysis an...

TL;DR: A survey on recent advances of image super-resolution techniques using deep learning approaches in a systematic way, which can roughly group the existing studies of SR techniques into three major categories: supervised SR, unsupervised SR, and domain-specific SR.

...read moreread less

Journal ArticleDOI

Deformable 3D Convolution for Video Super-Resolution

Xinyi Ying, +5 more

- 06 Apr 2020 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: A deformable 3D convolution network (D3Dnet) is proposed to incorporate spatio-temporal information from both spatial and temporal dimensions for video SR, and achieves state-of-the-art SR performance.

...read moreread less

Journal ArticleDOI

ANU-Net: Attention-based nested U-Net to exploit full resolution features for medical image segmentation

Chen Li, +6 more

- 01 Aug 2020 -

Computers & Graphics

TL;DR: An attention-based nested segmentation network, named ANU-Net, which has a deep supervised encoder-decoder architecture and a redesigned dense skip connection and achieved very competitive performance for four kinds of medical image segmentation tasks.

...read moreread less

Proceedings ArticleDOI

Attention Unet++: A Nested Attention-Aware U-Net for Liver CT Image Segmentation

Chen Li, +6 more

TL;DR: A nested attention-aware segmentation network, named Attention UNet++, which has a deep supervised encoder-decoder architecture and a redesigned dense skip connection and achieved very competitive performance on MICCAI 2017 Liver Tumor Segmentation Challenge Dataset.

...read moreread less

Journal Article

VRT: A Video Restoration Transformer

Jingyun Liang, +7 more

- 28 Jan 2022 -

arXiv.org

TL;DR: Experimental results on video super-resolution, video deblurring, video denoising, video frame interpolation and space-time videosuper-resolution demonstrate that VRT outperforms the state-of-the-art methods by large margins.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Proceedings Article

Attention is All you Need

Ashish Vaswani, +7 more

TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.

...read moreread less

Proceedings ArticleDOI

Are we ready for autonomous driving? The KITTI vision benchmark suite

Andreas Geiger, +2 more

TL;DR: The autonomous driving platform is used to develop novel challenging benchmarks for the tasks of stereo, optical flow, visual odometry/SLAM and 3D object detection, revealing that methods ranking high on established datasets such as Middlebury perform below average when being moved outside the laboratory to the real world.

...read moreread less

Proceedings ArticleDOI

Non-local Neural Networks

Xiaolong Wang, +3 more

TL;DR: In this article, the non-local operation computes the response at a position as a weighted sum of the features at all positions, which can be used to capture long-range dependencies.

...read moreread less

Journal ArticleDOI

A taxonomy and evaluation of dense two-frame stereo correspondence algorithms

Daniel Scharstein, +2 more

- 09 Dec 2001 -

International Journal of Computer Vision

TL;DR: This paper has designed a stand-alone, flexible C++ implementation that enables the evaluation of individual components and that can easily be extended to include new algorithms.

...read moreread less

Collapse

IEEE Transactions on Pattern Analysis an...

Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network

Wenzhe Shi, +7 more

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

Learning Parallax Attention for Stereo Image Super-Resolution

Citations

Deep Learning for Image Super-Resolution: A Survey

Deformable 3D Convolution for Video Super-Resolution

ANU-Net: Attention-based nested U-Net to exploit full resolution features for medical image segmentation

Attention Unet++: A Nested Attention-Aware U-Net for Liver CT Image Segmentation

VRT: A Video Restoration Transformer

References

Adam: A Method for Stochastic Optimization

Attention is All you Need

Are we ready for autonomous driving? The KITTI vision benchmark suite

Non-local Neural Networks

A taxonomy and evaluation of dense two-frame stereo correspondence algorithms

Related Papers (5)

Deep Residual Learning for Image Recognition

Learning a Deep Convolutional Network for Image Super-Resolution

Image Super-Resolution Using Deep Convolutional Networks

Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network

Adam: A Method for Stochastic Optimization