Posted Content•DOI•

GGADN: Guided generative adversarial dehazing network

Zhang Jian¹, Wanjuan Song²•Institutions (2)

Wuhan Sports University¹, University of Education, Winneba²

03 Aug 2021-pp 1-11

TL;DR: Network training is based on the pre-trained VGG feature model and L1-regularized gradient prior which is developed by new loss function parameters, and is better than the state-of-the-art dehazing methods.

read less

Abstract: Image dehazing has always been a challenging topic in image processing. The development of deep learning methods, especially the generative adversarial networks (GAN), provides a new way for image dehazing. In recent years, many deep learning methods based on GAN have been applied to image dehazing. However, GAN has two problems in image dehazing. Firstly, For haze image, haze not only reduces the quality of the image but also blurs the details of the image. For GAN network, it is difficult for the generator to restore the details of the whole image while removing the haze. Secondly, GAN model is defined as a minimax problem, which weakens the loss function. It is difficult to distinguish whether GAN is making progress in the training process. Therefore, we propose a guided generative adversarial dehazing network (GGADN). Different from other generation adversarial networks, GGADN adds a guided module on the generator. The guided module verifies the network of each layer of the generator. At the same time, the details of the map generated by each layer are strengthened. Network training is based on the pre-trained VGG feature model and L1-regularized gradient prior which is developed by new loss function parameters. From the dehazing results of synthetic images and real images, the proposed method is better than the state-of-the-art dehazing methods.

...read moreread less

Summary (2 min read)

Jump to: [1 Introduction] – [2 Related Works] – [3 The proposed method] – [4 Experimental results and analysis] and [5 Conclusions]

1 Introduction

Especially for haze, floating particles in haze lead to the fading and blurring of pictures, and the reduction of contrast and softness.
At present, image dehazing research is mainly divided into two types, feature-based method and learning-based method.
The guided module verifies the network of each layer of the generator.

3 The proposed method

This paper presents a Guided Generative Adversarial Dehazing Network.
The loss function is modified by using the pre-trained VGG feature and L1-regularization gradient.
In the proposed algorithm, an end-to-end dehazing network is used to train the network to avoid image distortion or artifact which caused by the estimation of transmittance and atmospheric light value.
They are defined from left to right as follows: – Conv1: In-channels are three.
The GGan training method is to directly use the Adversarial loss function is expressed as: LA = 1 N N∑ i=1 log [1−D(Ii, J̃i)] (3) Where D is the discriminant network, is the output of generator G. Discriminators are input in Minibatch mode.

4 Experimental results and analysis

To evaluate the performance of proposed model, the authors compare it with several advanced single image dehazing algorithms on synthetic datasets and real scene images.
Among them, 3000 synthetic images are randomly selected and 300 test images are extracted.
2 Implementation Keras deep learning architecture is used to train the model, RMSprop algorithm is used to optimize the model parameters, and the training epochs is 100.
Compared with other dehazing methods (DCP, DehazeNet, MSCNN and AODNet ), the advantages of proposed method are that the detail information of dehazing image is preserved completely, the color recovery is more natural, and the degree of dehazing is moderate.

5 Conclusions

This paper presents a Guided Generative Adversarial Dehazing Network.
In GGADN the generator and discriminator architecture are modified.
The synthetic data set is trained by end-to-end training neural network.
Sigmoid function is introduced to the last layer of discriminator for feature mapping.
The first one is to explore the clear image without reference for training the dehazing network [45,46,51,52].

Did you find this useful? Give us your feedback

Figures (5)

Fig. 4 Qualitative comparisons with state-of-the-art dehazing methods for hazy images on real data sets.

Fig. 2 The structure of discriminator model.

Table 1 Quantitative comparison on the synthetic data-set.

Fig. 3 Qualitative comparisons with state-of-the-art dehazing methods for hazy images on synthetic data sets.

Fig. 1 The structure of generator model.

Content maybe subject to copyright Report

GGADN: Guided Generative Adversarial Dehazing

Network

Zhang Jian (  41789332@qq.com )

Wuhan Sports University

Wanjuan Song

Hubei University of Education

Research Article

Keywords: Dehazing, GAN, Guidance, Loss function

Posted Date: April 7th, 2021

DOI: https://doi.org/10.21203/rs.3.rs-386958/v1

License:   This work is licensed under a Creative Commons Attribution 4.0 International License. 

Read Full License

Version of Record: A version of this preprint was published at Soft Computing on August 3rd, 2021. See

the published version at

https://doi.org/10.1007/s00500-021-06049-w.

Noname manuscript No.

(will be inserted by the editor)

GGADN: Guided Generative Adversarial Dehazing

Network

Jian Zhang · Wanjuan Song

Received: date / Accepted: date

Abstract Image dehazing has always been a challenging topic in image pro-

cessing. The development of deep learning methods, especially the Genera-

tive Adversarial Networks(GAN), provides a new way for image dehazing.

In recent years, many deep learning methods based on GAN have been ap-

plied to image dehazing. However, GAN has two problems in image dehazing.

Firstly, For haze image, haze not only reduces the quality of the image, but

also blurs the d et ai l s of the image. For Gan network, it is diﬃcult for the

generator to restore the details of the whole image while removing the haze.

Secondly, GAN model is d eﬁ n ed as a minimax problem, which we akens the

loss fu n ct i on. It is diﬃcult to distingu i sh whether GAN is making progress in

the training process. Therefore, we propose a Guided Generati ve Adversarial

Dehazing Network(GGADN). Diﬀerent from other generation adversarial net-

works, GGADN adds a guided module on the generator. The guided module

veriﬁes the network of each layer of the generator. At the same time, the de-

tails of the map generated by each layer are strengthened. Network training

is based on the pre-trained VGG feature model and L1-regularized gradient

prior which is developed by new loss function paramet er s. From the dehazing

results of synthetic images and real images, proposed method is better than

the state-of-the-art dehazing methods.

Keywords Dehazing · GAN · Guidance · Loss func t ion

J. Zhang

College of Sport Engineering and Information Technology, Wuhan Sports University, Wuhan,

China

W. Song

College of Computer, Hubei University of Education, Wuhan, Hubei, China

E-mail: key

swj@whu.edu.cn

2 Jian Zhang, Wanjuan Song

1 Introduction

In computer vision, weather is an important factor aﬀecting the quality of

image[1–6]. Especially for haze, ﬂoating particles in haze lead to the f ad in g

and blurri n g of pictures, and the reduction of contrast and softness. They

absorb and scatter light, resulting in serious color attenuation, poor clarity

and contrast, and poor visual eﬀect, which has a serious impact on subse-

quent computer vision tasks[7–16]. Therefore, it is necessar y to re move haze

eﬀectively.

In recent years, the research of image dehazing algorithm has made great

progress. At present, image dehazing research is mainly divided into two types,

feature-based method and learning-based method. The diﬃculty of feature-

based method lies in the feature extraction and a priori choice[17–20]. The

common dehazing features and priors are as foll ows:

– Contrast: Tan found that the contrast of haze free image was hi gh . Thus,

image dehazing is performed by maximizing the local contrast of the image.

– Dark channel priori: He found that the value of dar k ch ann el in haze free

image is close to zero, and then it can be used to estimate the transmission

image.

– The prior of color attenuation: Zhu found the relationship betwe en haze

concentration and brightness and satu rat i on through statistics. And a li n-

ear model of scene depth is created to solve the scene. Then, the haze-free

image is calculated.

Learning based dehazing algorithms can be divided into two kinds, step-

by-step learning algorithm and end-to-end learning algorithm. Step by st ep

learning algorithm is similar to the trad it i on al method, focusing on t h e pre-

diction of intermediate variables. For example, Cai [21] designed a dehazenet

by analyzi ng artiﬁcial prior features to complete the prediction of transmis-

sion image. Similarly, Ren [22] proposed a multi-scale convolutional neural

network mscnn, which can accurately predict the transmission image through

two diﬀerent scale network models. End-to-end learning algorithm can realize

image dehazing simply and eﬃciently through the design of full convolutional

neural network. For example, consi de ri n g that the above algorith m ignores the

reasonable prediction of atmospheric light value, Li [23] integrated multiple in-

termediate variables in the at mos ph er i c scattering model into one using li ne ar

variation, and proposed AOD net to directly predict haze-free images.

In this paper, we pres ents a Guided Generative Adversarial Dehazing Net-

work(GGADN). Diﬀerent from other generation adversarial networks, GGADN

adds a guided module on the generator. The guided module veriﬁes the net-

work of each layer of the generator. At the same time, the details of the map

generated by each layer are strengthened. GGADN is trai n ed and corrected

in a synthetic fuzzy image dataset containing indoor and outdoor images.

Network training is based on the pre-trained Visual Geometry Group (VGG)

feature model and L1-regularized gradient prior which is developed by new loss

function parameters. From the dehazing results of synthetic images and real

GGADN: Guided Generative Adversarial Dehazing Network 3

images, proposed method is better than the state-of-th e-ar t dehazing methods.

And the image after dehazing is cl ear er in detail.

2 Related Works

2.1 Atmospheric Scattering Model

The purpose of image defogging is t o restore a clear image from the blurred

image corroded by haze or smoke. In the ﬁeld of compu t er vision, in order

to overcome the image distortion caused by haze, McCartney [24] proposed

an atmospheric scat t er i n g model which can be used to describe the for mat i on

process of haze image [25–27]. The equation is as follow.

I(x) = J(x)t(x) + A(1 − t(x)) (1)

Where, I(x) is a haze image and J(x) is a haze-free image. A(x) is the

value of atmospher i c light, represe nting the intensity of atmospheric light.

T (x) is the transmittance, which indi cat es the part of light that is not scattered

when it reaches the imaging device through the atmospheric medium, and x

represents the pixel position. In the formula, the ﬁrst term on the right is th e

direct attenuation term, which represents the reﬂected light of the object after

atmospheric attenuation, and the second term is the enhanced atmospheric

light obtained by atmospheric scattering.

When the composition of the atmosphe re is unifor m, that is, A(x) is con-

stant, the transmittance can be expressed as:

t(x) = e

−βd(x)

(2)

Where β is the attenuation coeﬃcient of the atmosphere and D(x) is the

depth of the scene. From the formula, it is not diﬃcult to ﬁn d that the scene

depth and atmospheric light value have a great inﬂuence on the image dehazing

eﬀect. In single image dehazing, only the h aze image is known, the atmospheric

light valu e and sce ne depth are unknown, and the assumption of uniform

atmospheric composition is not necessaril y true. Therefore, how to remove

haze eﬀectively is a challenging problem.

2.2 Generative Adversarial Networks

Gan(Generative adversarial networks) is a deep learn in g model designed by

Goo df el l ow [28]. Gan is a deep l e arn i ng model, which is one of the most promis-

ing unsupervised learning methods on complex distribution in recent years

[29–35]. The model prod uc es good output through the game learning of two

modules in the framework: Generati ve model and Discriminative model. In the

original Gan theory, we don’t need that G and D are neural networks, we just

need to ﬁt the corresponding generating and discriminating functions. But in

4 Jian Zhang, Wanjuan Song

practice, deep neural networks are generally used as G and D. An excellent

Gan application needs a good training method, otherwise the output may not

be ideal du e to the freedom of the neural network model.

However, the n etwork structure of Gan is unstable in the training process,

and some artifacts such as noise and color shift are often produced in the

synthetic i m age. Sohn [36] introduced conditional information into Gan. The

increase of conditional information variables ensures the stability of learning

process to a certain extent and improves the representation ability of generator,

but the running time i s too long. Deniz [37] propose an end-to-end ne twork

called Cycle -D eh aze for sin gle image de haz i ng. In order to improve the quality

of texture information recovery and produce a better visual haze free image,

the CycleGan agent is enhanced by combining cycle consistency and perceptual

loss. Du [38] also uses the end-to-end learning method to learn the mapping

from hazy image to haze free i mage directly. This de haz in g network generates

the confrontation training network through the Gan model. An adaptive loss

function is used in discriminator. A post -p r ocessing method for halo artifacts

removal using guided ﬁltering is proposed.

Although the existing Gan network has achieved some results in haze re-

moval, there are also some problems. Firstly, Gan model is deﬁned as a mini-

max problem, which has no loss function. It is diﬃcult to dist i n gui s h whether

Gan is making progress in the training process. The learning process of Gan

may have collapse problem, and the generator begins to degenerate. It always

generates the sam e sample points and cannot continue learning. W he n the gen-

erating model collapses, the disc ri mi n at ion mod el will also point to the similar

direction for the similar sample points, so the training cannot continue. Sec-

ondly, For haze image, haze not only reduces the quality of the image, but also

blurs the details of the image. For Gan network, it is diﬃcult for the generator

to restore the de tai l s of the whole image while removing the haze. Especially

for the image with complex structure, the eﬀect of d eh azi ng is not ideal.

In view of the above problems, this paper improves the dehazing Gan and

designs a Guided Generative Adversarial Dehazing Network(GGADN).

3 The proposed method

This paper presents a Guided Generative Adversarial Dehazi n g Network(GGADN).

In GGADN the generator and discriminator architecture are modi ﬁe d. The

synthetic data set is trained by end-to-end training neural network. The loss

function is modiﬁed by using the pre-trained VGG feature and L1-regularizat ion

gradient. Sigmoid function i s introduced to the last layer of discri mi nat or for

feature mapping. In order to carry out probability analysis, the discri mi nant

results can be n orm ali z ed to [0,1].

HTML Viewer

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Image Dehazing Based on Polarization Information and Deep Prior Learning

[...]

Pengshuai Bi, Dongliang Wang, Wei Chen, Lan Yang, Jian Liang, Guang Yi Li, Feng-Fei Zhang, Zhe Wang, Xuepeng Zhang - Show less +5 more

01 Jul 2022-Optik

TL;DR: Zhang et al. as discussed by the authors proposed a new image dehazing algorithm based on polarization information and deep prior learning, which can provide more information about the image, while prior knowledge can constrain and optimize the network.

...read moreread less

Frequently Asked Questions (17)

Q1. What is the training epoch of the model?

Keras deep learning architecture is used to train the model, RMSprop algorithm is used to optimize the model parameters, and the training epochs is 100.

Q2. What is the training of the network?

Network training is based on the pre-trained VGG feature model and L1-regularized gradient prior which is developed by new loss function parameters.

Q3. What is the purpose of the proposed algorithm?

In the proposed algorithm, an end-to-end dehazing network is used to train the network to avoid image distortion or artifact which caused by the estimation of transmittance and atmospheric light value.

Q4. How many channels are used in the GGan training method?

The GGan training method is to directly use the Adversarial loss function is expressed as:LA = 1NN∑i=1log [1−D(Ii, J̃i)] (3)Where D is the discriminant network, is the output of generator G. Discriminators are input in Minibatch mode.

Q5. What is the purpose of the generator?

The generator introduces the skip connection of symmetry layer in ”RESNET” and ”u-net” models to break through the bottleneck of information redundancy in the process of dehazing.

Q6. How is the cycle haze agent enhanced?

In order to improve the quality of texture information recovery and produce a better visual haze free image, the CycleGan agent is enhanced by combining cycle consistency and perceptual loss.

Q7. What is the main reason for the haze?

Especially for haze, floating particles in haze lead to the fading and blurring of pictures, and the reduction of contrast and softness.

Q8. What is the purpose of the paper?

In order to evaluate the algorithm objectively, this paper uses Peak Signal to Noise Ratio (PSNR) [43] and Structural Similarity index (SSIM) [44] as objective evaluation indexes.

Q9. What is the value of dark channel in haze free image?

He found that the value of dark channel in haze free image is close to zero, and then it can be used to estimate the transmission image.

Q10. What is the transmittance of the atmosphere?

When the composition of the atmosphere is uniform, that is, A(x) is constant, the transmittance can be expressed as:t(x) = e−βd(x)

Q11. What is the main problem with the Gan network?

the network structure of Gan is unstable in the training process, and some artifacts such as noise and color shift are often produced in the synthetic image.

Q12. What is the purpose of image defogging?

2.1 Atmospheric Scattering ModelThe purpose of image defogging is to restore a clear image from the blurred image corroded by haze or smoke.

Q13. What are the advantages of proposed method?

Compared with other dehazing methods (DCP, DehazeNet, MSCNN and AODNet ), the advantages of proposed method are that the detail information of dehazing image is preserved completely, the color recovery is more natural, and the degree of dehazing is moderate.

Q14. What are the advantages of the proposed method?

AODNet method can generate a clear image as shown in the fifth image, but compared with the dehazing image generated by proposed algorithm, there are more details of unclear objects, the color of dehazing image is dark, and the sky part of the image has slight color distortion.

Q15. How many training sets are used to train the model?

In image processing, the whole large training set is divided into several small training sets to improve the computational efficiency and help to train the model quickly.

Q16. Why is there still a gap between the real scene and the synthetic haze image?

Because there is still a gap between the real scene and the synthetic haze image in visual perception, in order to verify the generalization ability of proposed method, the proposed method is compared with DCP, DehazeNet, MSCNN and AODNet in the natural scene images.

Q17. How can the authors predict the transmission image?

Ren [22] proposed a multi-scale convolutional neural network mscnn, which can accurately predict the transmission image through two different scale network models.

GGADN: Guided generative adversarial dehazing network

Summary (2 min read)

1 Introduction

3 The proposed method

4 Experimental results and analysis

5 Conclusions

Figures (5)

Citations

References

Related Papers (5)

Frequently Asked Questions (17)

Q1. What is the training epoch of the model?

Q2. What is the training of the network?

Q3. What is the purpose of the proposed algorithm?

Q4. How many channels are used in the GGan training method?

Q5. What is the purpose of the generator?

Q6. How is the cycle haze agent enhanced?

Q7. What is the main reason for the haze?

Q8. What is the purpose of the paper?

Q9. What is the value of dark channel in haze free image?

Q10. What is the transmittance of the atmosphere?

Q11. What is the main problem with the Gan network?

Q12. What is the purpose of image defogging?

Q13. What are the advantages of proposed method?

Q14. What are the advantages of the proposed method?

Q15. How many training sets are used to train the model?

Q16. Why is there still a gap between the real scene and the synthetic haze image?

Q17. How can the authors predict the transmission image?