AIM 2020: Scene Relighting and Illumination Estimation Challenge

Home
/
Papers
/
AIM 2020: Scene Relighting and Illumination Estimation Challenge

Posted Content•

AIM 2020: Scene Relighting and Illumination Estimation Challenge

Majed El Helou, Ruofan Zhou, Sabine Süsstrunk, Radu Timofte, Mahmoud Afifi, Michael S. Brown, Kele Xu, Hengxing Cai, Yuzhong Liu, Li-Wen Wang, Zhi-Song Liu, Chu-Tak Li, Sourya Dipta Das, Nisarg Shah, Akashdeep Jassal, Tongtong Zhao, Shanshan Zhao, Sabari Nathan, M. Parisa Beham, R. Suganya, Qing Wang, Zhongyun Hu, Xin Huang, Yaning Li, Maitreya Suin, Kuldeep Purohit, A. N. Rajagopalan, Densen Puthussery, Hrishikesh P S, Melvin Kuriakose, C. V. Jiji, Yu Zhu, Liping Dong, Zhuolong Jiang, Chenghua Li, Cong Leng, Jian Cheng - Show less +33 more

27 Sep 2020-arXiv: Computer Vision and Pattern Recognition-

TL;DR: The novel VIDIT dataset used in the AIM 2020 challenge and the different proposed solutions and final evaluation results over the 3 challenge tracks are presented.

read less

Abstract: We review the AIM 2020 challenge on virtual image relighting and illumination estimation. This paper presents the novel VIDIT dataset used in the challenge and the different proposed solutions and final evaluation results over the 3 challenge tracks. The first track considered one-to-one relighting; the objective was to relight an input photo of a scene with a different color temperature and illuminant orientation (i.e., light source position). The goal of the second track was to estimate illumination settings, namely the color temperature and orientation, from a given image. Lastly, the third track dealt with any-to-any relighting, thus a generalization of the first track. The target color temperature and orientation, rather than being pre-determined, are instead given by a guide image. Participants were allowed to make use of their track 1 and 2 solutions for track 3. The tracks had 94, 52, and 56 registered participants, respectively, leading to 20 confirmed submissions in the final competition stage.

...read moreread less

Citations

PDF

Open Access

More filters

Posted Content•

Efficient Image Super-Resolution Using Pixel Attention

[...]

Hengyuan Zhao¹, Xiangtao Kong¹, Jingwen He¹, Yu Qiao¹, Chao Dong¹ - Show less +1 more•Institutions (1)

Chinese Academy of Sciences¹

02 Oct 2020-arXiv: Image and Video Processing

TL;DR: This work designs a lightweight convolutional neural network for image super resolution with a newly proposed pixel attention scheme that could achieve similar performance as the lightweight networks - SRResNet and CARN, but with only 272K parameters.

...read moreread less

Abstract: This work aims at designing a lightweight convolutional neural network for image super resolution (SR). With simplicity bare in mind, we construct a pretty concise and effective network with a newly proposed pixel attention scheme. Pixel attention (PA) is similar as channel attention and spatial attention in formulation. The difference is that PA produces 3D attention maps instead of a 1D attention vector or a 2D map. This attention scheme introduces fewer additional parameters but generates better SR results. On the basis of PA, we propose two building blocks for the main branch and the reconstruction branch, respectively. The first one - SC-PA block has the same structure as the Self-Calibrated convolution but with our PA layer. This block is much more efficient than conventional residual/dense blocks, for its twobranch architecture and attention scheme. While the second one - UPA block combines the nearest-neighbor upsampling, convolution and PA layers. It improves the final reconstruction quality with little parameter cost. Our final model- PAN could achieve similar performance as the lightweight networks - SRResNet and CARN, but with only 272K parameters (17.92% of SRResNet and 17.09% of CARN). The effectiveness of each proposed component is also validated by ablation study. The code is available at this https URL.

...read moreread less

128 citations

Proceedings Article•DOI•

NTIRE 2019 Challenge on Real Image Super-Resolution: Methods and Results

[...]

Jianrui Cai¹, Shuhang Gu¹, Radu Timofte¹, Lei Zhang¹, Xiao Liu¹, Ding Yukang¹, Dongliang He¹, Chao Li¹, Yi Fu¹, Shilei Wen¹, Ruicheng Feng¹, Jinjin Gu¹, Yu Qiao¹, Chao Dong¹, Dongwon Park¹, Se Young Chun¹, Sanghoon Yoon¹, Junhyung Kwak¹, Donghee Son¹, Syed Waqas Zamir¹, Aditya Arora¹, Salman H. Khan¹, Fahad Shahbaz Khan¹, Ling Shao¹, Zhengping Wei¹, Lei Liu¹, Hong Cai¹, Darui Li¹, Fujie Gao¹, Zheng Hui¹, Xiumei Wang¹, Xinbo Gao¹, Guoan Cheng¹, Ai Matsune¹, Qiuyu Li¹, Leilei Zhu¹, Huaijuan Zang¹, Shu Zhan¹, Yajun Qiu¹, Ruxin wang¹, Jiawei Li¹, Yongcheng Jing¹, Mingli Song¹, Pengju Liu¹, Kai Zhang¹, Jingdong Liu¹, Jiye Liu¹, Hongzhi Zhang¹, Wangmeng Zuo¹, Wenyi Tang¹, Jing Liu¹, Youngjung Kim¹, Changyeop Shin¹, Minbeom Kim¹, Sungho Kim¹, Pablo Navarrete Michelini¹, Hanwen Liu¹, Dan Zhu¹, Xuan Xu¹, Xin Li¹, Furui Bai¹, Xiaopeng Sun¹, Lin Zha¹, Yuanfei Huang¹, Wen Lu¹, Yanpeng Cao¹, Du Chen¹, Zewei He¹, Sun Anshun¹, Siliang Tang¹, Fan Hongfei¹, Xiang Li¹, Li Guo¹, Zhang Wenjie¹, Zhang Yumei¹, Qingwen He¹, Jinghui Qin¹, Lishan Huang¹, Yukai Shi¹, Pengxu Wei¹, Wushao Wen¹, Liang Lin¹, Jun Yu¹, Guochen Xie¹, Mengyan Li¹, Rong Chen¹, Xiaotong Luo¹, Chen Hong¹, Yanyun Qu¹, Cuihua Li¹, Zhi-Song Liu¹, Li-Wen Wang¹, Chu-Tak Li¹, Can Zhao¹, Bowen Li¹, Chung-Chi Tsai¹, Shang-Chih Chuang¹, Joon-Hee Choi¹, Joon-Soo Kim¹, Xiaoyun Jiang¹, Ze Pan¹, Qunbo Lv¹, Zheng Tan¹, Peidong He¹ - Show less +100 more•Institutions (1)

Hong Kong Polytechnic University¹

16 Jun 2019

TL;DR: The 3rd NTIRE challenge on single-image super-resolution (restoration of rich details in a low-resolution image) is reviewed with a focus on proposed solutions and results and the state-of-the-art in real-world single image super- resolution.

...read moreread less

Abstract: This paper reviewed the 3rd NTIRE challenge on single-image super-resolution (restoration of rich details in a low-resolution image) with a focus on proposed solutions and results. The challenge had 1 track, which was aimed at the real-world single image super-resolution problem with an unknown scaling factor. Participants were mapping low-resolution images captured by a DSLR camera with a shorter focal length to their high-resolution images captured at a longer focal length. With this challenge, we introduced a novel real-world super-resolution dataset (RealSR). The track had 403 registered participants, and 36 teams competed in the final testing phase. They gauge the state-of-the-art in real-world single image super-resolution.

...read moreread less

118 citations

Posted Content•

AIM 2020 Challenge on Efficient Super-Resolution: Methods and Results

[...]

Kai Zhang, Martin Danelljan, Yawei Li, Radu Timofte, Jie Liu, Jie Tang, Gangshan Wu, Yu Zhu, Xiangyu He, Wenjie Xu, Chenghua Li, Cong Leng, Jian Cheng, Guangyang Wu, Wenyi Wang, Xiaohong Liu, Hengyuan Zhao, Xiangtao Kong, Jingwen He, Yu Qiao, Chao Dong, Xiaotong Luo, Liang Chen, Jiangtao Zhang, Maitreya Suin, Kuldeep Purohit, A. N. Rajagopalan, Xiaochuan Li, Zhiqiang Lang, Jiangtao Nie, Wei Wei, Lei Zhang, Abdul Muqeet, Jiwon Hwang, Subin Yang, JungHeum Kang, Sung-Ho Bae, Yongwoo Kim, Yanyun Qu, Geun-Woo Jeon, Jun-Ho Choi, Jun-Hyuk Kim, Jong-Seok Lee, Steven Marty, Eric Marty, Dongliang Xiong, Siang Chen, Lin Zha, Jiande Jiang, Xinbo Gao, Wen Lu, Haicheng Wang, Vineeth Bhaskara, Alex Levinshtein, Stavros Tsogkas, Allan D. Jepson, Xiangzhen Kong, Tongtong Zhao, Shanshan Zhao, Hrishikesh P S, Densen Puthussery, C. V. Jiji, Nan Nan, Shuai Liu, Jie Cai, Zibo Meng, Jiaming Ding, Chiu Man Ho, Xuehui Wang, Qiong Yan, Yuzhi Zhao, Long Chen, Long Sun, Wenhao Wang, Zhenbing Liu, Rushi Lan, Rao Muhammad Umer, Christian Micheloni - Show less +74 more

15 Sep 2020-arXiv: Image and Video Processing

TL;DR: The AIM 2020 challenge on efficient single image super-resolution was to super-resolve an input image with a magnification factor x4 based on a set of prior examples of low and corresponding high resolution images with focus on the proposed solutions and results.

...read moreread less

Abstract: This paper reviews the AIM 2020 challenge on efficient single image super-resolution with focus on the proposed solutions and results. The challenge task was to super-resolve an input image with a magnification factor x4 based on a set of prior examples of low and corresponding high resolution images. The goal is to devise a network that reduces one or several aspects such as runtime, parameter count, FLOPs, activations, and memory consumption while at least maintaining PSNR of MSRResNet. The track had 150 registered participants, and 25 teams submitted the final results. They gauge the state-of-the-art in efficient single image super-resolution.

...read moreread less

55 citations

Cites background from "AIM 2020: Scene Relighting and Illu..."

...This challenge is one of the AIM 2020 associated challenges on: scene relighting and illumination estimation [11], image extreme inpainting [40], learned image signal processing pipeline [19], rendering realistic bokeh [20], real image super-resolution [51], efficient super-resolution [58], video temporal superresolution [44] and video extreme super-resolution [12]....
[...]

Posted Content•

AIM 2020 Challenge on Learned Image Signal Processing Pipeline

[...]

Andrey Ignatov¹, Radu Timofte¹, Zhilu Zhang², Ming Liu², Haolin Wang², Wangmeng Zuo², Jiawei Zhang, Ruimao Zhang, Zhanglin Peng, Sijie Ren, Linhui Dai³, Xiaohong Liu³, Chengqi Li³, Jun Chen³, Yuichi Ito, Bhavya Vasudeva⁴, Puneesh Deora⁴, Umapada Pal⁴, Zhenyu Guo⁵, Yu Zhu⁵, Tian Liang⁵, Chenghua Li⁵, Cong Leng⁵, Zhihong Pan⁶, Baopu Li⁶, Byung-Hoon Kim⁷, Joonyoung Song⁷, Jong Chul Ye⁷, JaeHyun Baek⁸, Magauiya Zhussip, Yeskendir Koishekenov, Hwechul Cho Ye, Xin Liu, Xueying Hu, Jun Jiang, Jinwei Gu, Kai Li⁹, Pengliang Tan⁹, Bingxin Hou¹⁰ - Show less +35 more•Institutions (10)

ETH Zurich¹, Harbin Institute of Technology², McMaster University³, Indian Statistical Institute⁴, Chinese Academy of Sciences⁵, Baidu⁶, KAIST⁷, Amazon.com⁸, Beijing University of Posts and Telecommunications⁹, Santa Clara University¹⁰

10 Nov 2020-arXiv: Computer Vision and Pattern Recognition

TL;DR: This paper reviews the second AIM learned ISP challenge and provides the description of the proposed solutions and results, defining the state-of-the-art for practical image signal processing pipeline modeling.

...read moreread less

Abstract: This paper reviews the second AIM learned ISP challenge and provides the description of the proposed solutions and results. The participating teams were solving a real-world RAW-to-RGB mapping problem, where to goal was to map the original low-quality RAW images captured by the Huawei P20 device to the same photos obtained with the Canon 5D DSLR camera. The considered task embraced a number of complex computer vision subtasks, such as image demosaicing, denoising, white balancing, color and contrast correction, demoireing, etc. The target metric used in this challenge combined fidelity scores (PSNR and SSIM) with solutions' perceptual results measured in a user study. The proposed solutions significantly improved the baseline results, defining the state-of-the-art for practical image signal processing pipeline modeling.

...read moreread less

44 citations

Book Chapter•DOI•

AIM 2020: Scene Relighting and Illumination Estimation Challenge

[...]

Majed El Helou¹, Ruofan Zhou¹, Sabine Süsstrunk¹, Radu Timofte², Mahmoud Afifi³, Michael S. Brown³, Kele Xu⁴, Hengxing Cai⁴, Yuzhong Liu⁴, Li-Wen Wang⁵, Zhi-Song Liu⁵, Chu-Tak Li⁵, Sourya Dipta Das⁶, Nisarg Shah⁷, Akashdeep Jassal⁸, Tongtong Zhao⁹, Shanshan Zhao, Sabari Nathan, M. Parisa Beham¹⁰, R. Suganya¹¹, Qing Wang¹², Zhongyun Hu¹², Xin Huang¹², Yaning Li¹², Maitreya Suin¹³, Kuldeep Purohit¹³, A. N. Rajagopalan¹³, Densen Puthussery¹⁴, P. S. Hrishikesh¹⁴, Melvin Kuriakose¹⁴, C. V. Jiji¹⁴, Yu Zhu¹⁵, Liping Dong¹⁵, Zhuolong Jiang¹⁵, Chenghua Li¹⁵, Cong Leng¹⁵, Jian Cheng¹⁵ - Show less +33 more•Institutions (15)

École Polytechnique Fédérale de Lausanne¹, ETH Zurich², York University³, National University of Defense Technology⁴, Hong Kong Polytechnic University⁵, Jadavpur University⁶, Indian Institute of Technology, Jodhpur⁷, PEC University of Technology⁸, Dalian Maritime University⁹, Sethu Institute of Technology¹⁰, Thiagarajar College of Engineering¹¹, Northwestern Polytechnical University¹², Indian Institute of Technology Madras¹³, College of Engineering, Trivandrum¹⁴, Chinese Academy of Sciences¹⁵

23 Aug 2020

TL;DR: The AIM 2020 challenge on virtual image relighting and illumination estimation as discussed by the authors focused on one-to-one relighting, where the objective was to relight an input photo of a scene with a different color temperature and illuminant orientation.

...read moreread less

39 citations

1
2
3
4
…
5

References

PDF

Open Access

More filters

Proceedings Article•DOI•

Deep Residual Learning for Image Recognition

[...]

Kaiming He¹, Xiangyu Zhang¹, Shaoqing Ren¹, Jian Sun¹•Institutions (1)

Microsoft¹

27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

Abstract: Deeper neural networks are more difficult to train. We present a residual learning framework to ease the training of networks that are substantially deeper than those used previously. We explicitly reformulate the layers as learning residual functions with reference to the layer inputs, instead of learning unreferenced functions. We provide comprehensive empirical evidence showing that these residual networks are easier to optimize, and can gain accuracy from considerably increased depth. On the ImageNet dataset we evaluate residual nets with a depth of up to 152 layers—8× deeper than VGG nets [40] but still having lower complexity. An ensemble of these residual nets achieves 3.57% error on the ImageNet test set. This result won the 1st place on the ILSVRC 2015 classification task. We also present analysis on CIFAR-10 with 100 and 1000 layers. The depth of representations is of central importance for many visual recognition tasks. Solely due to our extremely deep representations, we obtain a 28% relative improvement on the COCO object detection dataset. Deep residual nets are foundations of our submissions to ILSVRC & COCO 2015 competitions1, where we also won the 1st places on the tasks of ImageNet detection, ImageNet localization, COCO detection, and COCO segmentation.

...read moreread less

123,388 citations

Proceedings Article•

Adam: A Method for Stochastic Optimization

[...]

Diederik P. Kingma¹, Jimmy Ba²•Institutions (2)

University of Amsterdam¹, University of Toronto²

01 Jan 2015

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Abstract: We introduce Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments. The method is straightforward to implement, is computationally efficient, has little memory requirements, is invariant to diagonal rescaling of the gradients, and is well suited for problems that are large in terms of data and/or parameters. The method is also appropriate for non-stationary objectives and problems with very noisy and/or sparse gradients. The hyper-parameters have intuitive interpretations and typically require little tuning. Some connections to related algorithms, on which Adam was inspired, are discussed. We also analyze the theoretical convergence properties of the algorithm and provide a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework. Empirical results demonstrate that Adam works well in practice and compares favorably to other stochastic optimization methods. Finally, we discuss AdaMax, a variant of Adam based on the infinity norm.

...read moreread less

111,197 citations

"AIM 2020: Scene Relighting and Illu..." refers methods in this paper

...The training uses the Adam optimizer [26] with `1 loss....
[...]
...The Adam optimizer [26] is used with cross entropy loss....
[...]

Book Chapter•DOI•

U-Net: Convolutional Networks for Biomedical Image Segmentation

[...]

Olaf Ronneberger¹, Philipp Fischer¹, Thomas Brox¹•Institutions (1)

University of Freiburg¹

05 Oct 2015

TL;DR: Neber et al. as discussed by the authors proposed a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently, which can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segmentation of neuronal structures in electron microscopic stacks.

...read moreread less

Abstract: There is large consent that successful training of deep networks requires many thousand annotated training samples. In this paper, we present a network and training strategy that relies on the strong use of data augmentation to use the available annotated samples more efficiently. The architecture consists of a contracting path to capture context and a symmetric expanding path that enables precise localization. We show that such a network can be trained end-to-end from very few images and outperforms the prior best method (a sliding-window convolutional network) on the ISBI challenge for segmentation of neuronal structures in electron microscopic stacks. Using the same network trained on transmitted light microscopy images (phase contrast and DIC) we won the ISBI cell tracking challenge 2015 in these categories by a large margin. Moreover, the network is fast. Segmentation of a 512x512 image takes less than a second on a recent GPU. The full implementation (based on Caffe) and the trained networks are available at http://lmb.informatik.uni-freiburg.de/people/ronneber/u-net .

...read moreread less

49,590 citations

Journal Article•DOI•

Image quality assessment: from error visibility to structural similarity

[...]

Zhou Wang¹, Alan C. Bovik², Hamid R. Sheikh², Eero P. Simoncelli³•Institutions (3)

Center for Neural Science¹, University of Texas at Austin², Howard Hughes Medical Institute³

01 Apr 2004-IEEE Transactions on Image Processing

TL;DR: In this article, a structural similarity index is proposed for image quality assessment based on the degradation of structural information, which can be applied to both subjective ratings and objective methods on a database of images compressed with JPEG and JPEG2000.

...read moreread less

Abstract: Objective methods for assessing perceptual image quality traditionally attempted to quantify the visibility of errors (differences) between a distorted image and a reference image using a variety of known properties of the human visual system. Under the assumption that human visual perception is highly adapted for extracting structural information from a scene, we introduce an alternative complementary framework for quality assessment based on the degradation of structural information. As a specific example of this concept, we develop a structural similarity index and demonstrate its promise through a set of intuitive examples, as well as comparison to both subjective ratings and state-of-the-art objective methods on a database of images compressed with JPEG and JPEG2000. A MATLAB implementation of the proposed algorithm is available online at http://www.cns.nyu.edu//spl sim/lcv/ssim/.

...read moreread less

40,609 citations

Proceedings Article•DOI•

Densely Connected Convolutional Networks

[...]

Gao Huang¹, Zhuang Liu², Laurens van der Maaten³, Kilian Q. Weinberger¹•Institutions (3)

Cornell University¹, Tsinghua University², Facebook³

21 Jul 2017

TL;DR: DenseNet as mentioned in this paper proposes to connect each layer to every other layer in a feed-forward fashion, which can alleviate the vanishing gradient problem, strengthen feature propagation, encourage feature reuse, and substantially reduce the number of parameters.

...read moreread less

Abstract: Recent work has shown that convolutional networks can be substantially deeper, more accurate, and efficient to train if they contain shorter connections between layers close to the input and those close to the output. In this paper, we embrace this observation and introduce the Dense Convolutional Network (DenseNet), which connects each layer to every other layer in a feed-forward fashion. Whereas traditional convolutional networks with L layers have L connections—one between each layer and its subsequent layer—our network has L(L+1)/2 direct connections. For each layer, the feature-maps of all preceding layers are used as inputs, and its own feature-maps are used as inputs into all subsequent layers. DenseNets have several compelling advantages: they alleviate the vanishing-gradient problem, strengthen feature propagation, encourage feature reuse, and substantially reduce the number of parameters. We evaluate our proposed architecture on four highly competitive object recognition benchmark tasks (CIFAR-10, CIFAR-100, SVHN, and ImageNet). DenseNets obtain significant improvements over the state-of-the-art on most of them, whilst requiring less memory and computation to achieve high performance. Code and pre-trained models are available at https://github.com/liuzhuang13/DenseNet.

...read moreread less

27,821 citations