Home
/
Authors
/
Wangmeng Zuo

Author

Wangmeng Zuo

Other affiliations: ETH Zurich, Dalian University of Technology, Harbin Institute of Technology ...read more

Bio: Wangmeng Zuo is an academic researcher from Hong Kong Polytechnic University. The author has contributed to research in topics: Image restoration & Real image. The author has an hindex of 7, co-authored 12 publications receiving 479 citations. Previous affiliations of Wangmeng Zuo include ETH Zurich & Dalian University of Technology.

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Log-Euclidean Kernels for Sparse Representation and Dictionary Learning

[...]

Peihua Li¹, Qilong Wang², Wangmeng Zuo³, Wangmeng Zuo⁴, Lei Zhang³ - Show less +1 more•Institutions (4)

Dalian University of Technology¹, Heilongjiang University², Hong Kong Polytechnic University³, Harbin Institute of Technology⁴

01 Dec 2013

TL;DR: This paper proposes a kernel based method for sparse representation (SR) and dictionary learning (DL) of SPD matrices by developing a broad family of kernels that satisfies Mercer's condition and considers the geometric structure in the DL process by updating atom matrices in the Riemannian space.

...read moreread less

Abstract: The symmetric positive definite (SPD) matrices have been widely used in image and vision problems. Recently there are growing interests in studying sparse representation (SR) of SPD matrices, motivated by the great success of SR for vector data. Though the space of SPD matrices is well-known to form a Lie group that is a Riemannian manifold, existing work fails to take full advantage of its geometric structure. This paper attempts to tackle this problem by proposing a kernel based method for SR and dictionary learning (DL) of SPD matrices. We disclose that the space of SPD matrices, with the operations of logarithmic multiplication and scalar logarithmic multiplication defined in the Log-Euclidean framework, is a complete inner product space. We can thus develop a broad family of kernels that satisfies Mercer's condition. These kernels characterize the geodesic distance and can be computed efficiently. We also consider the geometric structure in the DL process by updating atom matrices in the Riemannian space instead of in the Euclidean space. The proposed method is evaluated with various vision problems and shows notable performance gains over state-of-the-arts.

...read moreread less

139 citations

Proceedings Article•DOI•

NTIRE 2019 Challenge on Real Image Super-Resolution: Methods and Results

[...]

Jianrui Cai¹, Shuhang Gu¹, Radu Timofte¹, Lei Zhang¹, Xiao Liu¹, Ding Yukang¹, Dongliang He¹, Chao Li¹, Yi Fu¹, Shilei Wen¹, Ruicheng Feng¹, Jinjin Gu¹, Yu Qiao¹, Chao Dong¹, Dongwon Park¹, Se Young Chun¹, Sanghoon Yoon¹, Junhyung Kwak¹, Donghee Son¹, Syed Waqas Zamir¹, Aditya Arora¹, Salman H. Khan¹, Fahad Shahbaz Khan¹, Ling Shao¹, Zhengping Wei¹, Lei Liu¹, Hong Cai¹, Darui Li¹, Fujie Gao¹, Zheng Hui¹, Xiumei Wang¹, Xinbo Gao¹, Guoan Cheng¹, Ai Matsune¹, Qiuyu Li¹, Leilei Zhu¹, Huaijuan Zang¹, Shu Zhan¹, Yajun Qiu¹, Ruxin wang¹, Jiawei Li¹, Yongcheng Jing¹, Mingli Song¹, Pengju Liu¹, Kai Zhang¹, Jingdong Liu¹, Jiye Liu¹, Hongzhi Zhang¹, Wangmeng Zuo¹, Wenyi Tang¹, Jing Liu¹, Youngjung Kim¹, Changyeop Shin¹, Minbeom Kim¹, Sungho Kim¹, Pablo Navarrete Michelini¹, Hanwen Liu¹, Dan Zhu¹, Xuan Xu¹, Xin Li¹, Furui Bai¹, Xiaopeng Sun¹, Lin Zha¹, Yuanfei Huang¹, Wen Lu¹, Yanpeng Cao¹, Du Chen¹, Zewei He¹, Sun Anshun¹, Siliang Tang¹, Fan Hongfei¹, Xiang Li¹, Li Guo¹, Zhang Wenjie¹, Zhang Yumei¹, Qingwen He¹, Jinghui Qin¹, Lishan Huang¹, Yukai Shi¹, Pengxu Wei¹, Wushao Wen¹, Liang Lin¹, Jun Yu¹, Guochen Xie¹, Mengyan Li¹, Rong Chen¹, Xiaotong Luo¹, Chen Hong¹, Yanyun Qu¹, Cuihua Li¹, Zhi-Song Liu¹, Li-Wen Wang¹, Chu-Tak Li¹, Can Zhao¹, Bowen Li¹, Chung-Chi Tsai¹, Shang-Chih Chuang¹, Joon-Hee Choi¹, Joon-Soo Kim¹, Xiaoyun Jiang¹, Ze Pan¹, Qunbo Lv¹, Zheng Tan¹, Peidong He¹ - Show less +100 more•Institutions (1)

Hong Kong Polytechnic University¹

16 Jun 2019

TL;DR: The 3rd NTIRE challenge on single-image super-resolution (restoration of rich details in a low-resolution image) is reviewed with a focus on proposed solutions and results and the state-of-the-art in real-world single image super- resolution.

...read moreread less

Abstract: This paper reviewed the 3rd NTIRE challenge on single-image super-resolution (restoration of rich details in a low-resolution image) with a focus on proposed solutions and results. The challenge had 1 track, which was aimed at the real-world single image super-resolution problem with an unknown scaling factor. Participants were mapping low-resolution images captured by a DSLR camera with a shorter focal length to their high-resolution images captured at a longer focal length. With this challenge, we introduced a novel real-world super-resolution dataset (RealSR). The track had 403 registered participants, and 36 teams competed in the final testing phase. They gauge the state-of-the-art in real-world single image super-resolution.

...read moreread less

118 citations

Proceedings Article•DOI•

Texture Enhanced Image Denoising via Gradient Histogram Preservation

[...]

Wangmeng Zuo¹, Wangmeng Zuo², Lei Zhang¹, Chunwei Song², David Zhang¹ - Show less +1 more•Institutions (2)

Hong Kong Polytechnic University¹, Harbin Institute of Technology²

23 Jun 2013

TL;DR: A texture enhanced image denoising (TEID) method is proposed by enforcing the gradient distribution of the denoised image to be close to the estimated gradient Distribution of the original image, developed to enhance the texture structures while removing noise.

...read moreread less

Abstract: Image denoising is a classical yet fundamental problem in low level vision, as well as an ideal test bed to evaluate various statistical image modeling methods. One of the most challenging problems in image denoising is how to preserve the fine scale texture structures while removing noise. Various natural image priors, such as gradient based prior, nonlocal self-similarity prior, and sparsity prior, have been extensively exploited for noise removal. The denoising algorithms based on these priors, however, tend to smooth the detailed image textures, degrading the image visual quality. To address this problem, in this paper we propose a texture enhanced image denoising (TEID) method by enforcing the gradient distribution of the denoised image to be close to the estimated gradient distribution of the original image. A novel gradient histogram preservation (GHP) algorithm is developed to enhance the texture structures while removing noise. Our experimental results demonstrate that the proposed GHP based TEID can well preserve the texture features of the denoised images, making them look more natural.

...read moreread less

102 citations

Proceedings Article•DOI•

NTIRE 2019 Challenge on Real Image Denoising: Methods and Results

[...]

Abdelrahman Abdelhamed¹, Radu Timofte¹, Michael S. Brown¹, Songhyun Yu¹, Bumjun Park¹, Jechang Jeong¹, Seung-Won Jung¹, Dong-Wook Kim¹, Jae-Ryun Chung¹, Jiaming Liu¹, Yuzhi Wang¹, Chi-Hao Wu¹, Qin Xu¹, Chuan Wang¹, Shaofan Cai¹, Yifan Ding¹, Haoqiang Fan¹, Jue Wang¹, Kai Zhang¹, Wangmeng Zuo¹, Magauiya Zhussip¹, Dongwon Park¹, Shakarim Soltanayev¹, Se Young Chun¹, Zhiwei Xiong¹, Chang Chen¹, Muhammad Haris¹, Kazutoshi Akita¹, Tomoki Yoshida¹, Greg Shakhnarovich¹, Norimichi Ukita¹, Syed Waqas Zamir¹, Aditya Arora¹, Salman Khan¹, Fahad Shahbaz Khan¹, Ling Shao¹, Sung-Jea Ko¹, Dong-Pan Lim¹, Seung-Wook Kim¹, Seo-Won Ji¹, Sang-Won Lee¹, Wenyi Tang¹, Yuchen Fan¹, Yuqian Zhou¹, Ding Liu¹, Thomas S. Huang¹, Deyu Meng¹, Lei Zhang¹, Hongwei Yong¹, Yiyun Zhao¹, Pengliang Tang¹, Yue Lu¹, Raimondo Schettini¹, Simone Bianco¹, Simone Zini¹, Chi Li¹, Yang Wang¹, Zhiguo Cao¹ - Show less +54 more•Institutions (1)

York University¹

16 Jun 2019

TL;DR: The proposed methods by the 15 teams represent the current state-of-the-art performance in image denoising targeting real noisy images.

...read moreread less

Abstract: This paper reviews the NTIRE 2019 challenge on real image denoising with focus on the proposed methods and their results. The challenge has two tracks for quantitatively evaluating image denoising performance in (1) the Bayer-pattern raw-RGB and (2) the standard RGB (sRGB) color spaces. The tracks had 216 and 220 registered participants, respectively. A total of 15 teams, proposing 17 methods, competed in the final phase of the challenge. The proposed methods by the 15 teams represent the current state-of-the-art performance in image denoising targeting real noisy images.

...read moreread less

99 citations

Journal Article•DOI•

Deep CNNs Meet Global Covariance Pooling: Better Representation and Generalization

[...]

Qilong Wang¹, Jiangtao Xie², Wangmeng Zuo², Lei Zhang³, Peihua Li² - Show less +1 more•Institutions (3)

Tianjin University¹, Dalian University of Technology², Harbin Institute of Technology³

01 Aug 2021-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: Wang et al. as discussed by the authors proposed a global matrix power normalized COVariance (MPN-COV) pooling, which can capture robust covariance estimation given deep features of high dimension and small sample size.

...read moreread less

Abstract: Compared with global average pooling in existing deep convolutional neural networks (CNNs), global covariance pooling can capture richer statistics of deep features, having potential for improving representation and generalization abilities of deep CNNs. However, integration of global covariance pooling into deep CNNs brings two challenges: (1) robust covariance estimation given deep features of high dimension and small sample size; (2) appropriate usage of geometry of covariances. To address these challenges, we propose a global Matrix Power Normalized COVariance (MPN-COV) Pooling . Our MPN-COV conforms to a robust covariance estimator, very suitable for scenario of high dimension and small sample size. It can also be regarded as Power-Euclidean metric between covariances, effectively exploiting their geometry. Furthermore, a global Gaussian embedding network is proposed to incorporate first-order statistics into MPN-COV. For fast training of MPN-COV networks, we implement an iterative matrix square root normalization, avoiding GPU unfriendly eigen-decomposition inherent in MPN-COV. Additionally, progressive $1\times 1$ 1 × 1 convolutions and group convolution are introduced to compress covariance representations. The proposed methods are highly modular, readily plugged into existing deep CNNs. Extensive experiments are conducted on large-scale object classification, scene categorization, fine-grained visual recognition and texture classification, showing our methods outperform the counterparts and obtain state-of-the-art performance.

...read moreread less

72 citations

Cited by

PDF

Open Access

More filters

Posted Content•

Deep High-Resolution Representation Learning for Visual Recognition

[...]

Jingdong Wang¹, Ke Sun², Tianheng Cheng³, Borui Jiang⁴, Chaorui Deng⁵, Yang Zhao⁶, Dong Liu², Yadong Mu⁴, Mingkui Tan⁵, Xinggang Wang³, Wenyu Liu³, Bin Xiao¹ - Show less +8 more•Institutions (6)

Microsoft¹, University of Science and Technology of China², Huazhong University of Science and Technology³, Peking University⁴, South China University of Technology⁵, Griffith University⁶

20 Aug 2019-arXiv: Computer Vision and Pattern Recognition

TL;DR: The superiority of the proposed HRNet in a wide range of applications, including human pose estimation, semantic segmentation, and object detection, is shown, suggesting that the HRNet is a stronger backbone for computer vision problems.

...read moreread less

Abstract: High-resolution representations are essential for position-sensitive vision problems, such as human pose estimation, semantic segmentation, and object detection. Existing state-of-the-art frameworks first encode the input image as a low-resolution representation through a subnetwork that is formed by connecting high-to-low resolution convolutions \emph{in series} (e.g., ResNet, VGGNet), and then recover the high-resolution representation from the encoded low-resolution representation. Instead, our proposed network, named as High-Resolution Network (HRNet), maintains high-resolution representations through the whole process. There are two key characteristics: (i) Connect the high-to-low resolution convolution streams \emph{in parallel}; (ii) Repeatedly exchange the information across resolutions. The benefit is that the resulting representation is semantically richer and spatially more precise. We show the superiority of the proposed HRNet in a wide range of applications, including human pose estimation, semantic segmentation, and object detection, suggesting that the HRNet is a stronger backbone for computer vision problems. All the codes are available at~{\url{this https URL}}.

...read moreread less

1,278 citations

Journal Article•DOI•

Deep High-Resolution Representation Learning for Visual Recognition

[...]

Microsoft¹, University of Science and Technology of China², Huazhong University of Science and Technology³, Peking University⁴, South China University of Technology⁵, Griffith University⁶

01 Oct 2021-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: The High-Resolution Network (HRNet) as mentioned in this paper maintains high-resolution representations through the whole process by connecting the high-to-low resolution convolution streams in parallel and repeatedly exchanging the information across resolutions.

...read moreread less

Abstract: High-resolution representations are essential for position-sensitive vision problems, such as human pose estimation, semantic segmentation, and object detection. Existing state-of-the-art frameworks first encode the input image as a low-resolution representation through a subnetwork that is formed by connecting high-to-low resolution convolutions in series (e.g., ResNet, VGGNet), and then recover the high-resolution representation from the encoded low-resolution representation. Instead, our proposed network, named as High-Resolution Network (HRNet), maintains high-resolution representations through the whole process. There are two key characteristics: (i) Connect the high-to-low resolution convolution streams in parallel and (ii) repeatedly exchange the information across resolutions. The benefit is that the resulting representation is semantically richer and spatially more precise. We show the superiority of the proposed HRNet in a wide range of applications, including human pose estimation, semantic segmentation, and object detection, suggesting that the HRNet is a stronger backbone for computer vision problems. All the codes are available at https://github.com/HRNet .

...read moreread less

1,162 citations

Proceedings Article•DOI•

Multi-Stage Progressive Image Restoration

[...]

Syed Waqas Zamir, Aditya Arora, Salman Khan¹, Munawar Hayat², Fahad Shahbaz Khan¹, Ming-Hsuan Yang³, Ling Shao - Show less +3 more•Institutions (3)

Zayed University¹, Monash University², University of California, Merced³

04 Feb 2021

TL;DR: MPRNet as discussed by the authors proposes a multi-stage architecture that progressively learns restoration functions for the degraded inputs, thereby breaking down the overall recovery process into more manageable steps, and introduces a novel per-pixel adaptive design that leverages in-situ supervised attention to reweight the local features.

...read moreread less

Abstract: Image restoration tasks demand a complex balance between spatial details and high-level contextualized information while recovering images. In this paper, we propose a novel synergistic design that can optimally balance these competing goals. Our main proposal is a multi-stage architecture, that progressively learns restoration functions for the degraded inputs, thereby breaking down the overall recovery process into more manageable steps. Specifically, our model first learns the contextualized features using encoder-decoder architectures and later combines them with a high-resolution branch that retains local information. At each stage, we introduce a novel per-pixel adaptive design that leverages in-situ supervised attention to reweight the local features. A key ingredient in such a multi-stage architecture is the information exchange between different stages. To this end, we propose a two-faceted approach where the information is not only exchanged sequentially from early to late stages, but lateral connections between feature processing blocks also exist to avoid any loss of information. The resulting tightly interlinked multi-stage architecture, named as MPRNet, delivers strong performance gains on ten datasets across a range of tasks including image deraining, deblurring, and denoising. The source code and pre-trained models are available at https://github.com/swz30/MPRNet.

...read moreread less

716 citations

Journal Article•DOI•

Introduction to Functional Analysis.

[...]

R. E. Edwards, A. E. Taylor

01 Dec 1959-Biometrika

640 citations

Book Chapter•DOI•

Learning Enriched Features for Real Image Restoration and Enhancement

[...]

Syed Waqas Zamir, Aditya Arora, Salman Khan¹, Munawar Hayat¹, Fahad Shahbaz Khan¹, Ming-Hsuan Yang², Ling Shao¹ - Show less +3 more•Institutions (2)

Zayed University¹, University of California, Merced²

23 Aug 2020

TL;DR: MIRNet as mentioned in this paper proposes a multi-scale residual block containing several key elements: (a) parallel multi-resolution convolution streams for extracting mult-scale features, (b) information exchange across the multiresolution streams, (c) spatial and channel attention mechanisms for capturing contextual information, and (d) attention-based multiscale feature aggregation.

...read moreread less

Abstract: With the goal of recovering high-quality image content from its degraded version, image restoration enjoys numerous applications, such as in surveillance, computational photography and medical imaging. Recently, convolutional neural networks (CNNs) have achieved dramatic improvements over conventional approaches for image restoration task. Existing CNN-based methods typically operate either on full-resolution or on progressively low-resolution representations. In the former case, spatially precise but contextually less robust results are achieved, while in the latter case, semantically reliable but spatially less accurate outputs are generated. In this paper, we present an architecture with the collective goals of maintaining spatially-precise high-resolution representations through the entire network and receiving strong contextual information from the low-resolution representations. The core of our approach is a multi-scale residual block containing several key elements: (a) parallel multi-resolution convolution streams for extracting multi-scale features, (b) information exchange across the multi-resolution streams, (c) spatial and channel attention mechanisms for capturing contextual information, and (d) attention based multi-scale feature aggregation. In a nutshell, our approach learns an enriched set of features that combines contextual information from multiple scales, while simultaneously preserving the high-resolution spatial details. Extensive experiments on five real image benchmark datasets demonstrate that our method, named as MIRNet, achieves state-of-the-art results for image denoising, super-resolution, and image enhancement. The source code and pre-trained models are available at https://github.com/swz30/MIRNet.

...read moreread less

357 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119

Collapse