Home
/
Authors
/
Yifan Jiang

Author

Yifan Jiang

Other affiliations: Huazhong University of Science and Technology

Bio: Yifan Jiang is an academic researcher from University of Texas at Austin. The author has contributed to research in topics: Deep learning & Image restoration. The author has an hindex of 8, co-authored 15 publications receiving 640 citations. Previous affiliations of Yifan Jiang include Huazhong University of Science and Technology.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

EnlightenGAN: Deep Light Enhancement Without Paired Supervision

[...]

Yifan Jiang¹, Xinyu Gong¹, Ding Liu, Yu Cheng², Chen Fang, Xiaohui Shen, Jianchao Yang, Pan Zhou³, Zhangyang Wang¹ - Show less +5 more•Institutions (3)

University of Texas at Austin¹, Microsoft², Huazhong University of Science and Technology³

22 Jan 2021-IEEE Transactions on Image Processing

TL;DR: EnlightenGAN as mentioned in this paper proposes a highly effective unsupervised generative adversarial network that can be trained without low/normal-light image pairs, yet proves to generalize very well on various real-world test images.

...read moreread less

Abstract: Deep learning-based methods have achieved remarkable success in image restoration and enhancement, but are they still competitive when there is a lack of paired training data? As one such example, this paper explores the low-light image enhancement problem, where in practice it is extremely challenging to simultaneously take a low-light and a normal-light photo of the same visual scene. We propose a highly effective unsupervised generative adversarial network, dubbed EnlightenGAN , that can be trained without low/normal-light image pairs, yet proves to generalize very well on various real-world test images. Instead of supervising the learning using ground truth data, we propose to regularize the unpaired training using the information extracted from the input itself, and benchmark a series of innovations for the low-light image enhancement problem, including a global-local discriminator structure, a self-regularized perceptual loss fusion, and the attention mechanism. Through extensive experiments, our proposed approach outperforms recent methods under a variety of metrics in terms of visual quality and subjective user study. Thanks to the great flexibility brought by unpaired training, EnlightenGAN is demonstrated to be easily adaptable to enhancing real-world images from various domains. Our codes and pre-trained models are available at: https://github.com/VITA-Group/EnlightenGAN .

...read moreread less

537 citations

Posted Content•

EnlightenGAN: Deep Light Enhancement without Paired Supervision

[...]

Yifan Jiang¹, Xinyu Gong¹, Ding Liu, Yu Cheng², Chen Fang, Xiaohui Shen, Jianchao Yang, Pan Zhou³, Zhangyang Wang¹ - Show less +5 more•Institutions (3)

University of Texas at Austin¹, Microsoft², Huazhong University of Science and Technology³

17 Jun 2019-arXiv: Computer Vision and Pattern Recognition

TL;DR: This paper proposes a highly effective unsupervised generative adversarial network, dubbed EnlightenGAN, that can be trained without low/normal-light image pairs, yet proves to generalize very well on various real-world test images.

...read moreread less

Abstract: Deep learning-based methods have achieved remarkable success in image restoration and enhancement, but are they still competitive when there is a lack of paired training data? As one such example, this paper explores the low-light image enhancement problem, where in practice it is extremely challenging to simultaneously take a low-light and a normal-light photo of the same visual scene. We propose a highly effective unsupervised generative adversarial network, dubbed EnlightenGAN, that can be trained without low/normal-light image pairs, yet proves to generalize very well on various real-world test images. Instead of supervising the learning using ground truth data, we propose to regularize the unpaired training using the information extracted from the input itself, and benchmark a series of innovations for the low-light image enhancement problem, including a global-local discriminator structure, a self-regularized perceptual loss fusion, and attention mechanism. Through extensive experiments, our proposed approach outperforms recent methods under a variety of metrics in terms of visual quality and subjective user study. Thanks to the great flexibility brought by unpaired training, EnlightenGAN is demonstrated to be easily adaptable to enhancing real-world images from various domains. The code is available at \url{this https URL}

...read moreread less

520 citations

Proceedings Article•DOI•

AutoGAN: Neural Architecture Search for Generative Adversarial Networks

[...]

Xinyu Gong¹, Shiyu Chang², Yifan Jiang³, Zhangyang Wang¹•Institutions (3)

Texas A&M University¹, IBM², Huazhong University of Science and Technology³

01 Oct 2019

TL;DR: This paper presents the first preliminary study on introducing the NAS algorithm to generative adversarial networks (GANs), dubbed AutoGAN, and discovers architectures that achieve highly competitive performance compared to current state-of-the-art hand-crafted GANs.

...read moreread less

Abstract: Neural architecture search (NAS) has witnessed prevailing success in image classification and (very recently) segmentation tasks. In this paper, we present the first preliminary study on introducing the NAS algorithm to generative adversarial networks (GANs), dubbed AutoGAN. The marriage of NAS and GANs faces its unique challenges. We define the search space for the generator architectural variations and use an RNN controller to guide the search, with parameter sharing and dynamic-resetting to accelerate the process. Inception score is adopted as the reward, and a multi-level search strategy is introduced to perform NAS in a progressive way. Experiments validate the effectiveness of AutoGAN on the task of unconditional image generation. Specifically, our discovered architectures achieve highly competitive performance compared to current state-of-the-art hand-crafted GANs, e.g., setting new state-of-the-art FID scores of 12.42 on CIFAR-10, and 31.01 on STL-10, respectively. We also conclude with a discussion of the current limitations and future potential of AutoGAN. The code is available at https://github.com/TAMU-VITA/AutoGAN

...read moreread less

204 citations

Posted Content•

Pedestrian-Synthesis-GAN: Generating Pedestrian Data in Real Scene and Beyond

[...]

Xi Ouyang, Yu Cheng, Yifan Jiang, Chun-Liang Li, Pan Zhou - Show less +1 more

05 Apr 2018-arXiv: Computer Vision and Pattern Recognition

TL;DR: The proposed framework is built on the Generative Adversarial Network with multiple discriminators, trying to synthesize realistic pedestrians and learn the background context simultaneously, and can smoothly synthesize pedestrians on background images of variations and different levels of details.

...read moreread less

Abstract: State-of-the-art pedestrian detection models have achieved great success in many benchmarks. However, these models require lots of annotation information and the labeling process usually takes much time and efforts. In this paper, we propose a method to generate labeled pedestrian data and adapt them to support the training of pedestrian detectors. The proposed framework is built on the Generative Adversarial Network (GAN) with multiple discriminators, trying to synthesize realistic pedestrians and learn the background context simultaneously. To handle the pedestrians of different sizes, we adopt the Spatial Pyramid Pooling (SPP) layer in the discriminator. We conduct experiments on two benchmarks. The results show that our framework can smoothly synthesize pedestrians on background images of variations and different levels of details. To quantitatively evaluate our approach, we add the generated samples into training data of the baseline pedestrian detectors and show the synthetic images are able to improve the detectors' performance.

...read moreread less

83 citations

Posted Content•

AutoGAN: Neural Architecture Search for Generative Adversarial Networks

[...]

Xinyu Gong¹, Shiyu Chang², Yifan Jiang³, Zhangyang Wang¹•Institutions (3)

Texas A&M University¹, IBM², Huazhong University of Science and Technology³

11 Aug 2019-arXiv: Computer Vision and Pattern Recognition

TL;DR: In this paper, the first preliminary study on introducing the neural architecture search (NAS) algorithm to generative adversarial networks (GANs), dubbed AutoGAN, is presented, which defines the search space for the generator architectural variations and use an RNN controller to guide the search, with parameter sharing and dynamic-resetting to accelerate the process.

...read moreread less

73 citations

Cited by

PDF

Open Access

More filters

Journal Article•

“Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告

[...]

杉山拓海

12 Sep 2017-Computers & Graphics

3,940 citations

Proceedings Article•

Training Generative Adversarial Networks with Limited Data

[...]

Tero Karras¹, Miika Aittala², Janne Hellsten¹, Samuli Laine¹, Jaakko Lehtinen¹, Timo Aila¹ - Show less +2 more•Institutions (2)

Nvidia¹, Massachusetts Institute of Technology²

01 Jan 2020

TL;DR: It is demonstrated, on several datasets, that good results are now possible using only a few thousand training images, often matching StyleGAN2 results with an order of magnitude fewer images, and is expected to open up new application domains for GANs.

...read moreread less

Abstract: Training generative adversarial networks (GAN) using too little data typically leads to discriminator overfitting, causing training to diverge. We propose an adaptive discriminator augmentation mechanism that significantly stabilizes training in limited data regimes. The approach does not require changes to loss functions or network architectures, and is applicable both when training from scratch and when fine-tuning an existing GAN on another dataset. We demonstrate, on several datasets, that good results are now possible using only a few thousand training images, often matching StyleGAN2 results with an order of magnitude fewer images. We expect this to open up new application domains for GANs. We also find that the widely used CIFAR-10 is, in fact, a limited data benchmark, and improve the record FID from 5.59 to 2.42.

...read moreread less

884 citations

Proceedings Article•DOI•

GhostNet: More Features From Cheap Operations

[...]

Kai Han¹, Yunhe Wang¹, Qi Tian¹, Jianyuan Guo², Chunjing Xu¹, Chang Xu³ - Show less +2 more•Institutions (3)

Huawei¹, Peking University², University of Sydney³

14 Jun 2020

Abstract: Deploying convolutional neural networks (CNNs) on embedded devices is difficult due to the limited memory and computation resources. The redundancy in feature maps is an important characteristic of those successful CNNs, but has rarely been investigated in neural architecture design. This paper proposes a novel Ghost module to generate more feature maps from cheap operations. Based on a set of intrinsic feature maps, we apply a series of linear transformations with cheap cost to generate many ghost feature maps that could fully reveal information underlying intrinsic features. The proposed Ghost module can be taken as a plug-and-play component to upgrade existing convolutional neural networks. Ghost bottlenecks are designed to stack Ghost modules, and then the lightweight GhostNet can be easily established. Experiments conducted on benchmarks demonstrate that the proposed Ghost module is an impressive alternative of convolution layers in baseline models, and our GhostNet can achieve higher recognition performance (e.g. 75.7% top-1 accuracy) than MobileNetV3 with similar computational cost on the ImageNet ILSVRC-2012 classification dataset. Code is available at https://github.com/huawei-noah/ghostnet.

...read moreread less

880 citations

Journal Article•DOI•

AutoML: A survey of the state-of-the-art

[...]

Xin He¹, Kaiyong Zhao¹, Xiaowen Chu¹•Institutions (1)

Hong Kong Baptist University¹

05 Jan 2021-Knowledge Based Systems

TL;DR: A comprehensive and up-to-date review of the state-of-the-art (SOTA) in AutoML methods according to the pipeline, covering data preparation, feature engineering, hyperparameter optimization, and neural architecture search (NAS).

...read moreread less

Abstract: Deep learning (DL) techniques have obtained remarkable achievements on various tasks, such as image recognition, object detection, and language modeling. However, building a high-quality DL system for a specific task highly relies on human expertise, hindering its wide application. Meanwhile, automated machine learning (AutoML) is a promising solution for building a DL system without human assistance and is being extensively studied. This paper presents a comprehensive and up-to-date review of the state-of-the-art (SOTA) in AutoML. According to the DL pipeline, we introduce AutoML methods – covering data preparation, feature engineering, hyperparameter optimization, and neural architecture search (NAS) – with a particular focus on NAS, as it is currently a hot sub-topic of AutoML. We summarize the representative NAS algorithms’ performance on the CIFAR-10 and ImageNet datasets and further discuss the following subjects of NAS methods: one/two-stage NAS, one-shot NAS, joint hyperparameter and architecture optimization, and resource-aware NAS. Finally, we discuss some open problems related to the existing AutoML methods for future research.

...read moreread less

809 citations

Posted Content•

GhostNet: More Features from Cheap Operations

[...]

Kai Han¹, Yunhe Wang¹, Qi Tian¹, Jianyuan Guo¹, Chunjing Xu², Chang Xu³ - Show less +2 more•Institutions (3)

Huawei¹, Peking University², University of Sydney³

27 Nov 2019-arXiv: Computer Vision and Pattern Recognition

TL;DR: A novel Ghost module is proposed to generate more feature maps from cheap operations based on a set of intrinsic feature maps to generate many ghost feature maps that could fully reveal information underlying intrinsic features.

...read moreread less

Abstract: Deploying convolutional neural networks (CNNs) on embedded devices is difficult due to the limited memory and computation resources. The redundancy in feature maps is an important characteristic of those successful CNNs, but has rarely been investigated in neural architecture design. This paper proposes a novel Ghost module to generate more feature maps from cheap operations. Based on a set of intrinsic feature maps, we apply a series of linear transformations with cheap cost to generate many ghost feature maps that could fully reveal information underlying intrinsic features. The proposed Ghost module can be taken as a plug-and-play component to upgrade existing convolutional neural networks. Ghost bottlenecks are designed to stack Ghost modules, and then the lightweight GhostNet can be easily established. Experiments conducted on benchmarks demonstrate that the proposed Ghost module is an impressive alternative of convolution layers in baseline models, and our GhostNet can achieve higher recognition performance (e.g. $75.7\%$ top-1 accuracy) than MobileNetV3 with similar computational cost on the ImageNet ILSVRC-2012 classification dataset. Code is available at this https URL

...read moreread less

664 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse