Home
/
Authors
/
Wen Lu

Author

Wen Lu

Other affiliations: Hong Kong Polytechnic University

Bio: Wen Lu is an academic researcher from Xidian University. The author has contributed to research in topics: Watermark & Visual perception. The author has an hindex of 9, co-authored 15 publications receiving 259 citations. Previous affiliations of Wen Lu include Hong Kong Polytechnic University.

Topics: Watermark, Visual perception, Peripheral vision, Hidden Markov model, Foveal ...read more

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

NTIRE 2019 Challenge on Real Image Super-Resolution: Methods and Results

[...]

Jianrui Cai¹, Shuhang Gu¹, Radu Timofte¹, Lei Zhang¹, Xiao Liu¹, Ding Yukang¹, Dongliang He¹, Chao Li¹, Yi Fu¹, Shilei Wen¹, Ruicheng Feng¹, Jinjin Gu¹, Yu Qiao¹, Chao Dong¹, Dongwon Park¹, Se Young Chun¹, Sanghoon Yoon¹, Junhyung Kwak¹, Donghee Son¹, Syed Waqas Zamir¹, Aditya Arora¹, Salman H. Khan¹, Fahad Shahbaz Khan¹, Ling Shao¹, Zhengping Wei¹, Lei Liu¹, Hong Cai¹, Darui Li¹, Fujie Gao¹, Zheng Hui¹, Xiumei Wang¹, Xinbo Gao¹, Guoan Cheng¹, Ai Matsune¹, Qiuyu Li¹, Leilei Zhu¹, Huaijuan Zang¹, Shu Zhan¹, Yajun Qiu¹, Ruxin wang¹, Jiawei Li¹, Yongcheng Jing¹, Mingli Song¹, Pengju Liu¹, Kai Zhang¹, Jingdong Liu¹, Jiye Liu¹, Hongzhi Zhang¹, Wangmeng Zuo¹, Wenyi Tang¹, Jing Liu¹, Youngjung Kim¹, Changyeop Shin¹, Minbeom Kim¹, Sungho Kim¹, Pablo Navarrete Michelini¹, Hanwen Liu¹, Dan Zhu¹, Xuan Xu¹, Xin Li¹, Furui Bai¹, Xiaopeng Sun¹, Lin Zha¹, Yuanfei Huang¹, Wen Lu¹, Yanpeng Cao¹, Du Chen¹, Zewei He¹, Sun Anshun¹, Siliang Tang¹, Fan Hongfei¹, Xiang Li¹, Li Guo¹, Zhang Wenjie¹, Zhang Yumei¹, Qingwen He¹, Jinghui Qin¹, Lishan Huang¹, Yukai Shi¹, Pengxu Wei¹, Wushao Wen¹, Liang Lin¹, Jun Yu¹, Guochen Xie¹, Mengyan Li¹, Rong Chen¹, Xiaotong Luo¹, Chen Hong¹, Yanyun Qu¹, Cuihua Li¹, Zhi-Song Liu¹, Li-Wen Wang¹, Chu-Tak Li¹, Can Zhao¹, Bowen Li¹, Chung-Chi Tsai¹, Shang-Chih Chuang¹, Joon-Hee Choi¹, Joon-Soo Kim¹, Xiaoyun Jiang¹, Ze Pan¹, Qunbo Lv¹, Zheng Tan¹, Peidong He¹ - Show less +100 more•Institutions (1)

Hong Kong Polytechnic University¹

16 Jun 2019

TL;DR: The 3rd NTIRE challenge on single-image super-resolution (restoration of rich details in a low-resolution image) is reviewed with a focus on proposed solutions and results and the state-of-the-art in real-world single image super- resolution.

...read moreread less

Abstract: This paper reviewed the 3rd NTIRE challenge on single-image super-resolution (restoration of rich details in a low-resolution image) with a focus on proposed solutions and results. The challenge had 1 track, which was aimed at the real-world single image super-resolution problem with an unknown scaling factor. Participants were mapping low-resolution images captured by a DSLR camera with a shorter focal length to their high-resolution images captured at a longer focal length. With this challenge, we introduced a novel real-world super-resolution dataset (RealSR). The track had 403 registered participants, and 36 teams competed in the final testing phase. They gauge the state-of-the-art in real-world single image super-resolution.

...read moreread less

118 citations

Journal Article•DOI•

A Gated Peripheral-Foveal Convolutional Neural Network for Unified Image Aesthetic Prediction

[...]

Xiaodan Zhang¹, Xinbo Gao¹, Wen Lu¹, Lihuo He¹•Institutions (1)

Xidian University¹

15 Apr 2019-IEEE Transactions on Multimedia

TL;DR: Zhang et al. as discussed by the authors proposed a gated peripheral-foveal convolutional neural network (GPFCN) to learn fine-grained details for image aesthetic assessment.

...read moreread less

Abstract: Learning fine-grained details is a key issue in image aesthetic assessment. Most of the previous methods extract the fine-grained details via random cropping strategy, which may undermine the integrity of semantic information. Extensive studies show that humans perceive fine-grained details with a mixture of foveal vision and peripheral vision. Fovea has the highest possible visual acuity and is responsible for seeing the details. The peripheral vision is used for perceiving the broad spatial scene and selecting the attended regions for the fovea. Inspired by these observations, we propose a gated peripheral-foveal convolutional neural network. It is a dedicated double-subnet neural network (i.e., a peripheral subnet and a foveal subnet). The former aims to mimic the functions of peripheral vision to encode the holistic information and provide the attended regions. The latter aims to extract fine-grained features on these key regions. Considering that the peripheral vision and foveal vision play different roles in processing different visual stimuli, we further employ a gated information fusion network to weigh their contributions. The weights are determined through the fully connected layers followed by a sigmoid function. We conduct comprehensive experiments on the standard Aesthetic Visual Analysis (AVA) dataset and Photo.net dataset for unified aesthetic prediction tasks: 1) aesthetic quality classification; 2) aesthetic score regression; and 3) aesthetic score distribution prediction. The experimental results demonstrate the effectiveness of the proposed method.

...read moreread less

51 citations

Posted Content•

A Gated Peripheral-Foveal Convolutional Neural Network for Unified Image Aesthetic Prediction

[...]

Xiaodan Zhang¹, Xinbo Gao¹, Wen Lu¹, Lihuo He¹•Institutions (1)

Xidian University¹

19 Dec 2018-arXiv: Computer Vision and Pattern Recognition

TL;DR: A gated peripheral-foveal convolutional neural network that aims to mimic the functions of peripheral vision to encode the holistic information and provide the attended regions for the fovea and a gated information fusion network to weigh their contributions.

...read moreread less

Abstract: Learning fine-grained details is a key issue in image aesthetic assessment. Most of the previous methods extract the fine-grained details via random cropping strategy, which may undermine the integrity of semantic information. Extensive studies show that humans perceive fine-grained details with a mixture of foveal vision and peripheral vision. Fovea has the highest possible visual acuity and is responsible for seeing the details. The peripheral vision is used for perceiving the broad spatial scene and selecting the attended regions for the fovea. Inspired by these observations, we propose a Gated Peripheral-Foveal Convolutional Neural Network (GPF-CNN). It is a dedicated double-subnet neural network, i.e. a peripheral subnet and a foveal subnet. The former aims to mimic the functions of peripheral vision to encode the holistic information and provide the attended regions. The latter aims to extract fine-grained features on these key regions. Considering that the peripheral vision and foveal vision play different roles in processing different visual stimuli, we further employ a gated information fusion (GIF) network to weight their contributions. The weights are determined through the fully connected layers followed by a sigmoid function. We conduct comprehensive experiments on the standard AVA and this http URL datasets for unified aesthetic prediction tasks: (i) aesthetic quality classification; (ii) aesthetic score regression; and (iii) aesthetic score distribution prediction. The experimental results demonstrate the effectiveness of the proposed method.

...read moreread less

41 citations

Journal Article•DOI•

Interpretable Detail-Fidelity Attention Network for Single Image Super-Resolution.

[...]

Yuanfei Huang¹, Jie Li¹, Xinbo Gao¹, Yanting Hu², Wen Lu¹ - Show less +1 more•Institutions (2)

Xidian University¹, Xinjiang Medical University²

28 Sep 2020-arXiv: Computer Vision and Pattern Recognition

TL;DR: A purposeful and interpretable detail-fidelity attention network to progressively process these smoothes and details in a divide-and-conquer manner, which is a novel and specific prospect of image super-resolution for the purpose of improving detail fidelity.

...read moreread less

Abstract: Benefiting from the strong capabilities of deep CNNs for feature representation and nonlinear mapping, deep-learning-based methods have achieved excellent performance in single image super-resolution. However, most existing SR methods depend on the high capacity of networks which is initially designed for visual recognition, and rarely consider the initial intention of super-resolution for detail fidelity. Aiming at pursuing this intention, there are two challenging issues to be solved: (1) learning appropriate operators which is adaptive to the diverse characteristics of smoothes and details; (2) improving the ability of model to preserve the low-frequency smoothes and reconstruct the high-frequency details. To solve them, we propose a purposeful and interpretable detail-fidelity attention network to progressively process these smoothes and details in divide-and-conquer manner, which is a novel and specific prospect of image super-resolution for the purpose on improving the detail fidelity, instead of blindly designing or employing the deep CNNs architectures for merely feature representation in local receptive fields. Particularly, we propose a Hessian filtering for interpretable feature representation which is high-profile for detail inference, a dilated encoder-decoder and a distribution alignment cell to improve the inferred Hessian features in morphological manner and statistical manner respectively. Extensive experiments demonstrate that the proposed methods achieve superior performances over the state-of-the-art methods quantitatively and qualitatively. Code is available at this https URL.

...read moreread less

28 citations

Patent•

Improved visual attention model-based method of natural scene object detection

[...]

Gao Xinbo, Han Bing, Li Jie, Deng Cheng, Wen Lu, Tian Chunna, Wang Xiumei, Wang Ying - Show less +4 more

05 Dec 2012

26 citations

Cited by

PDF

Open Access

More filters

Patent•

Information processing apparatus, information processing method and program

[...]

Fuminori Homma¹, Tatsushi Nashida¹•Institutions (1)

Sony Broadcast & Professional Research Laboratories¹

31 Aug 2011

TL;DR: In this article, a method for modifying an image is presented, which consists of displaying an image, the image comprising a portion of an object; determining if an edge of the object is in a location within the portion; and detecting movement in a member direction, of an operating member with respect to the edge.

...read moreread less

Abstract: A method is provided for modifying an image. The method comprises displaying an image, the image comprising a portion of an object; and determining if an edge of the object is in a location within the portion. The method further comprises detecting movement, in a member direction, of an operating member with respect to the edge. The method still further comprises moving, if the edge is not in the location, the object in an object direction corresponding to the detected movement; and modifying, if the edge is in the location, the image in response to the detected movement, the modified image comprising the edge in the location.

...read moreread less

434 citations

Book Chapter•DOI•

Learning Enriched Features for Real Image Restoration and Enhancement

[...]

Syed Waqas Zamir, Aditya Arora, Salman Khan¹, Munawar Hayat¹, Fahad Shahbaz Khan¹, Ming-Hsuan Yang², Ling Shao¹ - Show less +3 more•Institutions (2)

Zayed University¹, University of California, Merced²

23 Aug 2020

TL;DR: MIRNet as mentioned in this paper proposes a multi-scale residual block containing several key elements: (a) parallel multi-resolution convolution streams for extracting mult-scale features, (b) information exchange across the multiresolution streams, (c) spatial and channel attention mechanisms for capturing contextual information, and (d) attention-based multiscale feature aggregation.

...read moreread less

Abstract: With the goal of recovering high-quality image content from its degraded version, image restoration enjoys numerous applications, such as in surveillance, computational photography and medical imaging. Recently, convolutional neural networks (CNNs) have achieved dramatic improvements over conventional approaches for image restoration task. Existing CNN-based methods typically operate either on full-resolution or on progressively low-resolution representations. In the former case, spatially precise but contextually less robust results are achieved, while in the latter case, semantically reliable but spatially less accurate outputs are generated. In this paper, we present an architecture with the collective goals of maintaining spatially-precise high-resolution representations through the entire network and receiving strong contextual information from the low-resolution representations. The core of our approach is a multi-scale residual block containing several key elements: (a) parallel multi-resolution convolution streams for extracting multi-scale features, (b) information exchange across the multi-resolution streams, (c) spatial and channel attention mechanisms for capturing contextual information, and (d) attention based multi-scale feature aggregation. In a nutshell, our approach learns an enriched set of features that combines contextual information from multiple scales, while simultaneously preserving the high-resolution spatial details. Extensive experiments on five real image benchmark datasets demonstrate that our method, named as MIRNet, achieves state-of-the-art results for image denoising, super-resolution, and image enhancement. The source code and pre-trained models are available at https://github.com/swz30/MIRNet.

...read moreread less

357 citations

Proceedings Article•DOI•

NTIRE 2019 Challenge on Video Deblurring and Super-Resolution: Dataset and Study

[...]

Seungjun Nah, Sungyong Baik, Seokil Hong, Gyeongsik Moon, Sanghyun Son, Radu Timofte¹, Kyoung Mu Lee - Show less +3 more•Institutions (1)

ETH Zurich¹

16 Jun 2019

TL;DR: It is found that the NTIRE 2019 challenges push the state-of-the-art in video deblurring and super-resolution, reaching compelling performance on the newly proposed REDS dataset.

...read moreread less

Abstract: This paper introduces a novel large dataset for video deblurring, video super-resolution and studies the state-of-the-art as emerged from the NTIRE 2019 video restoration challenges. The video deblurring and video super-resolution challenges are each the first challenge of its kind, with 4 competitions, hundreds of participants and tens of proposed solutions. Our newly collected REalistic and Diverse Scenes dataset (REDS) was employed by the challenges. In our study, we compare the solutions from the challenges to a set of representative methods from the literature and evaluate them on our proposed REDS dataset. We find that the NTIRE 2019 challenges push the state-of-the-art in video deblurring and super-resolution, reaching compelling performance on our newly proposed REDS dataset.

...read moreread less

328 citations

Proceedings Article•DOI•

Toward Real-World Single Image Super-Resolution: A New Benchmark and a New Model

[...]

Jianrui Cai¹, Hui Zeng¹, Hongwei Yong¹, Zisheng Cao, Lei Zhang¹ - Show less +1 more•Institutions (1)

Hong Kong Polytechnic University¹

01 Oct 2019

TL;DR: Li et al. as mentioned in this paper proposed a Laplacian pyramid based kernel prediction network (LP-KPN), which efficiently learns per-pixel kernels to recover the HR image, which achieved better visual quality with sharper edges and finer textures on real-world scenes.

...read moreread less

Abstract: Most of the existing learning-based single image super-resolution (SISR) methods are trained and evaluated on simulated datasets, where the low-resolution (LR) images are generated by applying a simple and uniform degradation (i.e., bicubic downsampling) to their high-resolution (HR) counterparts. However, the degradations in real-world LR images are far more complicated. As a consequence, the SISR models trained on simulated data become less effective when applied to practical scenarios. In this paper, we build a real-world super-resolution (RealSR) dataset where paired LR-HR images on the same scene are captured by adjusting the focal length of a digital camera. An image registration algorithm is developed to progressively align the image pairs at different resolutions. Considering that the degradation kernels are naturally non-uniform in our dataset, we present a Laplacian pyramid based kernel prediction network (LP-KPN), which efficiently learns per-pixel kernels to recover the HR image. Our extensive experiments demonstrate that SISR models trained on our RealSR dataset deliver better visual quality with sharper edges and finer textures on real-world scenes than those trained on simulated datasets. Though our RealSR dataset is built by using only two cameras (Canon 5D3 and Nikon D810), the trained model generalizes well to other camera devices such as Sony a7II and mobile phones.

...read moreread less

318 citations

Journal Article•DOI•

A Deep Journey into Super-resolution: A Survey

[...]

Saeed Anwar¹, Salman Khan, Nick Barnes¹•Institutions (1)

Australian National University¹

28 May 2020-ACM Computing Surveys

TL;DR: Deep convolutional networks–based super-resolution is a fast-growing field with numerous practical applications and this exposition extensively compare more than 30 state-of-the-art super-resolves.

...read moreread less

Abstract: Deep convolutional networks–based super-resolution is a fast-growing field with numerous practical applications. In this exposition, we extensively compare more than 30 state-of-the-art super-resolution Convolutional Neural Networks (CNNs) over three classical and three recently introduced challenging datasets to benchmark single image super-resolution. We introduce a taxonomy for deep learning–based super-resolution networks that groups existing methods into nine categories including linear, residual, multi-branch, recursive, progressive, attention-based, and adversarial designs. We also provide comparisons between the models in terms of network complexity, memory footprint, model input and output, learning details, the type of network losses, and important architectural differences (e.g., depth, skip-connections, filters). The extensive evaluation performed shows the consistent and rapid growth in the accuracy in the past few years along with a corresponding boost in model complexity and the availability of large-scale datasets. It is also observed that the pioneering methods identified as the benchmarks have been significantly outperformed by the current contenders. Despite the progress in recent years, we identify several shortcomings of existing techniques and provide future research directions towards the solution of these open problems. Datasets and codes for evaluation are publicly available at https://github.com/saeed-anwar/SRsurvey.

...read moreread less

162 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66

Collapse