Home
/
Authors
/
Xu Jizheng

Author

Xu Jizheng

Bio: Xu Jizheng is an academic researcher from Microsoft. The author has contributed to research in topics: Video processing & Motion vector. The author has an hindex of 33, co-authored 457 publications receiving 5071 citations.

Papers published on a yearly basis

2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

AOD-Net: All-in-One Dehazing Network

[...]

Boyi Li¹, Xiulian Peng², Zhangyang Wang³, Xu Jizheng², Dan Feng¹ - Show less +1 more•Institutions (3)

Huazhong University of Science and Technology¹, Microsoft², Texas A&M University³

01 Oct 2017

TL;DR: An image dehazing model built with a convolutional neural network (CNN) based on a re-formulated atmospheric scattering model, called All-in-One Dehazing Network (AOD-Net), which demonstrates superior performance than the state-of-the-art in terms of PSNR, SSIM and the subjective visual quality.

...read moreread less

Abstract: This paper proposes an image dehazing model built with a convolutional neural network (CNN), called All-in-One Dehazing Network (AOD-Net). It is designed based on a re-formulated atmospheric scattering model. Instead of estimating the transmission matrix and the atmospheric light separately as most previous models did, AOD-Net directly generates the clean image through a light-weight CNN. Such a novel end-to-end design makes it easy to embed AOD-Net into other deep models, e.g., Faster R-CNN, for improving high-level tasks on hazy images. Experimental results on both synthesized and natural hazy image datasets demonstrate our superior performance than the state-of-the-art in terms of PSNR, SSIM and the subjective visual quality. Furthermore, when concatenating AOD-Net with Faster R-CNN, we witness a large improvement of the object detection performance on hazy images.

...read moreread less

1,185 citations

Journal Article•DOI•

Efficient Parallel Framework for HEVC Motion Estimation on Many-Core Processors

[...]

Chenggang Clarence Yan¹, Yongdong Zhang¹, Xu Jizheng², Feng Dai¹, Jun Zhang¹, Qionghai Dai³, Feng Wu² - Show less +3 more•Institutions (3)

Chinese Academy of Sciences¹, Microsoft², Tsinghua University³

08 Jul 2014-IEEE Transactions on Circuits and Systems for Video Technology

TL;DR: This paper analyzes the ME structure in HEVC and proposes a parallel framework to decouple ME for different partitions on many-core processors and achieves more than 30 and 40 times speedup for 1920 × 1080 and 2560 × 1600 video sequences, respectively.

...read moreread less

Abstract: High Efficiency Video Coding (HEVC) provides superior coding efficiency than previous video coding standards at the cost of increasing encoding complexity. The complexity increase of motion estimation (ME) procedure is rather significant, especially when considering the complicated partitioning structure of HEVC. To fully exploit the coding efficiency brought by HEVC requires a huge amount of computations. In this paper, we analyze the ME structure in HEVC and propose a parallel framework to decouple ME for different partitions on many-core processors. Based on local parallel method (LPM), we first use the directed acyclic graph (DAG)-based order to parallelize coding tree units (CTUs) and adopt improved LPM (ILPM) within each CTU (DAGILPM), which exploits the CTU-level and prediction unit (PU)-level parallelism. Then, we find that there exist completely independent PUs (CIPUs) and partially independent PUs (PIPUs). When the degree of parallelism (DP) is smaller than the maximum DP of DAGILPM, we process the CIPUs and PIPUs, which further increases the DP. The data dependencies and coding efficiency stay the same as LPM. Experiments show that on a 64-core system, compared with serial execution, our proposed scheme achieves more than 30 and 40 times speedup for 1920 × 1080 and 2560 × 1600 video sequences, respectively.

...read moreread less

366 citations

Journal Article•DOI•

A Highly Parallel Framework for HEVC Coding Unit Partitioning Tree Decision on Many-core Processors

[...]

Chenggang Yan¹, Yongdong Zhang¹, Xu Jizheng², Feng Dai¹, Liang Li¹, Qionghai Dai, Feng Wu² - Show less +3 more•Institutions (2)

Tsinghua University¹, Microsoft²

11 Mar 2014-IEEE Signal Processing Letters

TL;DR: This paper proposes a parallel framework to decide coding unit trees through in-depth understanding of the dependency among different coding units, and achieves averagely more than 11 and 16 times speedup for 1920x1080 and 2560x1600 video sequences, respectively, without any coding efficiency degradation.

...read moreread less

Abstract: High Efficiency Video Coding (HEVC) uses a very flexible tree structure to organize coding units, which leads to a superior coding efficiency compared with previous video coding standards. However, such a flexible coding unit tree structure also places a great challenge for encoders. In order to fully exploit the coding efficiency brought by this structure, huge amount of computational complexity is needed for an encoder to decide the optimal coding unit tree for each image block. One way to achieve this is to use parallel computing enabled by many-core processors. In this paper, we analyze the challenge to use many-core processors to make coding unit tree decision. Through in-depth understanding of the dependency among different coding units, we propose a parallel framework to decide coding unit trees. Experimental results show that, on the Tile64 platform, our proposed method achieves averagely more than 11 and 16 times speedup for 1920x1080 and 2560x1600 video sequences, respectively, without any coding efficiency degradation.

...read moreread less

342 citations

Journal Article•DOI•

Overview of the Emerging HEVC Screen Content Coding Extension

[...]

Xu Jizheng¹, Rajan Laxman Joshi², Robert A. Cohen³•Institutions (3)

Microsoft¹, Qualcomm², Mitsubishi Electric Research Laboratories³

01 Jan 2016-IEEE Transactions on Circuits and Systems for Video Technology

TL;DR: An overview of the technical features and characteristics of the current HEVC-SCC test model and related coding tools, including intra-block copy, palette mode, adaptive color transform, and adaptive motion vector resolution are provided.

...read moreread less

Abstract: A screen content coding (SCC) extension to High Efficiency Video Coding (HEVC) is currently under development by the Joint Collaborative Team on Video Coding, which is a joint effort from the ITU-T Video Coding Experts Group and the ISO/IEC Moving Picture Experts Group. The main goal of the HEVC-SCC standardization effort is to enable significantly improved compression performance for videos containing a substantial amount of still or moving rendered graphics, text, and animation rather than, or in addition to, camera-captured content. This paper provides an overview of the technical features and characteristics of the current HEVC-SCC test model and related coding tools, including intra-block copy, palette mode, adaptive color transform, and adaptive motion vector resolution. The performance of the SCC extension is compared against existing standards in terms of bitrate savings at equal distortion.

...read moreread less

247 citations

Journal Article•DOI•

Three-Dimensional Embedded Subband Coding with Optimized Truncation (3-D ESCOT)

[...]

Xu Jizheng¹, Zixiang Xiong², Shipeng Li¹, Ya-Qin Zhang¹•Institutions (2)

Microsoft¹, Texas A&M University²

01 May 2001-Applied and Computational Harmonic Analysis

TL;DR: In this article, a three-dimensional embedded subband coding with optimized truncation (3-D ESCOT) algorithm is proposed, in which coefficients in different subbands are independently coded using fractional bit-plane coding and candidate truncation points are formed at the end of each fractional bits-plane.

...read moreread less

192 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•

“Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告

[...]

杉山拓海

12 Sep 2017-Computers & Graphics

3,940 citations

The PASCAL Visual Object Classes Challenge

[...]

Jianguo Zhang

01 Jan 2006

3,012 citations

Journal Article•DOI•

Comparison of the Coding Efficiency of Video Coding Standards—Including High Efficiency Video Coding (HEVC)

[...]

Jens-Rainer Ohm¹, Gary J. Sullivan², Heiko Schwarz³, Thiow Keng Tan, Thomas Wiegand³ - Show less +1 more•Institutions (3)

RWTH Aachen University¹, Microsoft², Fraunhofer Society³

01 Dec 2012-IEEE Transactions on Circuits and Systems for Video Technology

TL;DR: The results of subjective tests for WVGA and HD sequences indicate that HEVC encoders can achieve equivalent subjective reproduction quality as encoder that conform to H.264/MPEG-4 AVC when using approximately 50% less bit rate on average.

...read moreread less

Abstract: The compression capability of several generations of video coding standards is compared by means of peak signal-to-noise ratio (PSNR) and subjective testing results. A unified approach is applied to the analysis of designs, including H.262/MPEG-2 Video, H.263, MPEG-4 Visual, H.264/MPEG-4 Advanced Video Coding (AVC), and High Efficiency Video Coding (HEVC). The results of subjective tests for WVGA and HD sequences indicate that HEVC encoders can achieve equivalent subjective reproduction quality as encoders that conform to H.264/MPEG-4 AVC when using approximately 50% less bit rate on average. The HEVC design is shown to be especially effective for low bit rates, high-resolution video content, and low-delay communication applications. The measured subjective improvement somewhat exceeds the improvement measured by the PSNR metric.

...read moreread less

1,279 citations

Proceedings Article•DOI•

DeblurGAN: Blind Motion Deblurring Using Conditional Adversarial Networks

[...]

Orest Kupyn, Volodymyr Budzan, Mykola Mykhailych¹, Dmytro Mishkin, Jiri Matas - Show less +1 more•Institutions (1)

The Catholic University of America¹

18 Jun 2018

TL;DR: DeblurGAN achieves state-of-the art performance both in the structural similarity measure and visual appearance and is 5 times faster than the closest competitor - Deep-Deblur.

...read moreread less

Abstract: We present DeblurGAN, an end-to-end learned method for motion deblurring. The learning is based on a conditional GAN and the content loss. DeblurGAN achieves state-of-the art performance both in the structural similarity measure and visual appearance. The quality of the deblurring model is also evaluated in a novel way on a real-world problem - object detection on (de-)blurred images. The method is 5 times faster than the closest competitor - Deep-Deblur [25]. We also introduce a novel method for generating synthetic motion blurred images from sharp ones, allowing realistic dataset augmentation. The model, code and the dataset are available at https://github.com/KupynOrest/DeblurGAN

...read moreread less

1,147 citations

Journal Article•DOI•

Benchmarking Single-Image Dehazing and Beyond

[...]

Boyi Li¹, Wenqi Ren², Dengpan Fu³, Dacheng Tao⁴, Dan Feng⁵, Wenjun Zeng⁶, Zhangyang Wang⁷ - Show less +3 more•Institutions (7)

Cornell University¹, Chinese Academy of Sciences², University of Science and Technology of China³, University of Sydney⁴, Huazhong University of Science and Technology⁵, Microsoft⁶, Texas A&M University⁷

01 Jan 2019-IEEE Transactions on Image Processing

TL;DR: In this article, the authors present a comprehensive study and evaluation of existing single image dehazing algorithms, using a new large-scale benchmark consisting of both synthetic and real-world hazy images, called Realistic Single-Image DEhazing (RESIDE).

...read moreread less

Abstract: We present a comprehensive study and evaluation of existing single-image dehazing algorithms, using a new large-scale benchmark consisting of both synthetic and real-world hazy images, called REalistic Single-Image DEhazing (RESIDE). RESIDE highlights diverse data sources and image contents, and is divided into five subsets, each serving different training or evaluation purposes. We further provide a rich variety of criteria for dehazing algorithm evaluation, ranging from full-reference metrics to no-reference metrics and to subjective evaluation, and the novel task-driven evaluation. Experiments on RESIDE shed light on the comparisons and limitations of the state-of-the-art dehazing algorithms, and suggest promising future directions.

...read moreread less

922 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse