Home
/
Authors
/
Fan Zhang

Author

Fan Zhang

Bio: Fan Zhang is an academic researcher from Wuhan University. The author has contributed to research in topics: Feature extraction & Feature (computer vision). The author has an hindex of 9, co-authored 12 publications receiving 1232 citations.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Saliency-Guided Unsupervised Feature Learning for Scene Classification

[...]

Fan Zhang¹, Bo Du¹, Liangpei Zhang¹•Institutions (1)

Wuhan University¹

01 Apr 2015-IEEE Transactions on Geoscience and Remote Sensing

TL;DR: The proposed unsupervised-feature-learning-based scene classification method provides more accurate classification results than the other latent-Dirichlet-allocation-based methods and the sparse coding method.

...read moreread less

Abstract: Due to the rapid technological development of various different satellite sensors, a huge volume of high-resolution image data sets can now be acquired. How to efficiently represent and recognize the scenes from such high-resolution image data has become a critical task. In this paper, we propose an unsupervised feature learning framework for scene classification. By using the saliency detection algorithm, we extract a representative set of patches from the salient regions in the image data set. These unlabeled data patches are exploited by an unsupervised feature learning method to learn a set of feature extractors which are robust and efficient and do not need elaborately designed descriptors such as the scale-invariant-feature-transform-based algorithm. We show that the statistics generated from the learned feature extractors can characterize a complex scene very well and can produce excellent classification accuracy. In order to reduce overfitting in the feature learning step, we further employ a recently developed regularization method called “dropout,” which has proved to be very effective in image classification. In the experiments, the proposed method was applied to two challenging high-resolution data sets: the UC Merced data set containing 21 different aerial scene categories with a submeter resolution and the Sydney data set containing seven land-use categories with a 60-cm spatial resolution. The proposed method obtained results that were equal to or even better than the previous best results with the UC Merced data set, and it also obtained the highest accuracy with the Sydney data set, demonstrating that the proposed unsupervised-feature-learning-based scene classification method provides more accurate classification results than the other latent-Dirichlet-allocation-based methods and the sparse coding method.

...read moreread less

477 citations

Journal Article•DOI•

Scene Classification via a Gradient Boosting Random Convolutional Network Framework

[...]

Fan Zhang¹, Bo Du¹, Liangpei Zhang¹•Institutions (1)

Wuhan University¹

01 Mar 2016-IEEE Transactions on Geoscience and Remote Sensing

TL;DR: A gradient boosting random convolutional network (GBRCN) framework for scene classification, which can effectively combine many deep neural networks and can provide more accurate classification results than the state-of-the-art methods.

...read moreread less

Abstract: Due to the recent advances in satellite sensors, a large amount of high-resolution remote sensing images is now being obtained each day. How to automatically recognize and analyze scenes from these satellite images effectively and efficiently has become a big challenge in the remote sensing field. Recently, a lot of work in scene classification has been proposed, focusing on deep neural networks, which learn hierarchical internal feature representations from image data sets and produce state-of-the-art performance. However, most methods, including the traditional shallow methods and deep neural networks, only concentrate on training a single model. Meanwhile, neural network ensembles have proved to be a powerful and practical tool for a number of different predictive tasks. Can we find a way to combine different deep neural networks effectively and efficiently for scene classification? In this paper, we propose a gradient boosting random convolutional network (GBRCN) framework for scene classification, which can effectively combine many deep neural networks. As far as we know, this is the first time that a deep ensemble framework has been proposed for scene classification. Moreover, in the experiments, the proposed method was applied to two challenging high-resolution data sets: 1) the UC Merced data set containing 21 different aerial scene categories with a submeter resolution and 2) a Sydney data set containing eight land-use categories with a 1.0-m spatial resolution. The proposed GBRCN framework outperformed the state-of-the-art methods with the UC Merced data set, including the traditional single convolutional network approach. For the Sydney data set, the proposed method again obtained the best accuracy, demonstrating that the proposed framework can provide more accurate classification results than the state-of-the-art methods.

...read moreread less

384 citations

Journal Article•DOI•

Spectral–Spatial Unified Networks for Hyperspectral Image Classification

[...]

Yonghao Xu¹, Liangpei Zhang¹, Bo Du¹, Fan Zhang¹•Institutions (1)

Wuhan University¹

09 May 2018-IEEE Transactions on Geoscience and Remote Sensing

TL;DR: A band grouping-based long short-term memory model and a multiscale convolutional neural network are proposed as the spectral and spatial feature extractors, respectively, for the hyperspectral image (HSI) classification.

...read moreread less

Abstract: In this paper, we propose a spectral–spatial unified network (SSUN) with an end-to-end architecture for the hyperspectral image (HSI) classification. Different from traditional spectral–spatial classification frameworks where the spectral feature extraction (FE), spatial FE, and classifier training are separated, these processes are integrated into a unified network in our model. In this way, both FE and classifier training will share a uniform objective function and all the parameters in the network can be optimized at the same time. In the implementation of the SSUN, we propose a band grouping-based long short-term memory model and a multiscale convolutional neural network as the spectral and spatial feature extractors, respectively. In the experiments, three benchmark HSIs are utilized to evaluate the performance of the proposed method. The experimental results demonstrate that the SSUN can yield a competitive performance compared with existing methods.

...read moreread less

259 citations

Journal Article•DOI•

Weakly Supervised Learning Based on Coupled Convolutional Neural Networks for Aircraft Detection

[...]

Fan Zhang¹, Bo Du¹, Liangpei Zhang¹, Miaozhong Xu¹•Institutions (1)

Wuhan University¹

06 Jun 2016-IEEE Transactions on Geoscience and Remote Sensing

TL;DR: A coupled CNN method, which combines a candidate region proposal network and a localization network to extract the proposals and simultaneously locate the aircraft, which is more efficient and accurate, even in large-scale VHR images.

...read moreread less

Abstract: Aircraft detection from very high resolution (VHR) remote sensing images has been drawing increasing interest in recent years due to the successful civil and military applications. However, several challenges still exist: 1) extracting the high-level features and the hierarchical feature representations of the objects is difficult; 2) manual annotation of the objects in large image sets is generally expensive and sometimes unreliable; and 3) locating objects within such a large image is difficult and time consuming. In this paper, we propose a weakly supervised learning framework based on coupled convolutional neural networks (CNNs) for aircraft detection, which can simultaneously solve these problems. We first develop a CNN-based method to extract the high-level features and the hierarchical feature representations of the objects. We then employ an iterative weakly supervised learning framework to automatically mine and augment the training data set from the original image. We propose a coupled CNN method, which combines a candidate region proposal network and a localization network to extract the proposals and simultaneously locate the aircraft, which is more efficient and accurate, even in large-scale VHR images. In the experiments, the proposed method was applied to three challenging high-resolution data sets: the Sydney International Airport data set, the Tokyo Haneda Airport data set, and the Berlin Tegel Airport data set. The extensive experimental results confirm that the proposed method can achieve a higher detection accuracy than the other methods.

...read moreread less

229 citations

Journal Article•DOI•

Hyperspectral image classification via a random patches network

[...]

Yonghao Xu¹, Bo Du¹, Fan Zhang¹, Liangpei Zhang¹•Institutions (1)

Wuhan University¹

01 Aug 2018-Isprs Journal of Photogrammetry and Remote Sensing

TL;DR: This study proposes an efficient deep learning based method, namely, Random Patches Network (RPNet) for HSI classification, which directly regards the random patches taken from the image as the convolution kernels without any training.

...read moreread less

Abstract: Due to the remarkable achievements obtained by deep learning methods in the fields of computer vision, an increasing number of researches have been made to apply these powerful tools into hyperspectral image (HSI) classification. So far, most of these methods utilize a pre-training stage followed by a fine-tuning stage to extract deep features, which is not only tremendously time-consuming but also depends largely on a great deal of training data. In this study, we propose an efficient deep learning based method, namely, Random Patches Network (RPNet) for HSI classification, which directly regards the random patches taken from the image as the convolution kernels without any training. By combining both shallow and deep convolutional features, RPNet has the advantage of multi-scale, which possesses a better adaption for HSI classification, where different objects tend to have different scales. In the experiments, the proposed method and its two variants RandomNet and RPNet–single are tested on three benchmark hyperspectral data sets. The experimental results demonstrate the RPNet can yield a competitive performance compared with existing methods.

...read moreread less

160 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Deep Learning in Remote Sensing: A Comprehensive Review and List of Resources

[...]

Xiao Xiang Zhu¹, Devis Tuia², Lichao Mou¹, Gui-Song Xia³, Liangpei Zhang³, Feng Xu⁴, Friedrich Fraundorfer⁵ - Show less +3 more•Institutions (5)

Technische Universität München¹, Wageningen University and Research Centre², Wuhan University³, Fudan University⁴, Graz University of Technology⁵

01 Dec 2017-IEEE Geoscience and Remote Sensing Magazine

TL;DR: The challenges of using deep learning for remote-sensing data analysis are analyzed, recent advances are reviewed, and resources are provided that hope will make deep learning in remote sensing seem ridiculously simple.

...read moreread less

Abstract: Central to the looming paradigm shift toward data-intensive science, machine-learning techniques are becoming increasingly important. In particular, deep learning has proven to be both a major breakthrough and an extremely powerful tool in many fields. Shall we embrace deep learning as the key to everything? Or should we resist a black-box solution? These are controversial issues within the remote-sensing community. In this article, we analyze the challenges of using deep learning for remote-sensing data analysis, review recent advances, and provide resources we hope will make deep learning in remote sensing seem ridiculously simple. More importantly, we encourage remote-sensing scientists to bring their expertise into deep learning and use it as an implicit general model to tackle unprecedented, large-scale, influential challenges, such as climate change and urbanization.

...read moreread less

2,095 citations

Journal Article•DOI•

Deep Learning for Remote Sensing Data: A Technical Tutorial on the State of the Art

[...]

Liangpei Zhang¹, Lefei Zhang¹, Bo Du¹•Institutions (1)

Wuhan University¹

07 Jun 2016-IEEE Geoscience and Remote Sensing Magazine

TL;DR: A general framework of DL for RS data is provided, and the state-of-the-art DL methods in RS are regarded as special cases of input-output data combined with various deep networks and tuning tricks.

...read moreread less

Abstract: Deep-learning (DL) algorithms, which learn the representative and discriminative features in a hierarchical manner from the data, have recently become a hotspot in the machine-learning area and have been introduced into the geoscience and remote sensing (RS) community for RS big data analysis. Considering the low-level features (e.g., spectral and texture) as the bottom level, the output feature representation from the top level of the network can be directly fed into a subsequent classifier for pixel-based classification. As a matter of fact, by carefully addressing the practical demands in RS applications and designing the input?output levels of the whole network, we have found that DL is actually everywhere in RS data analysis: from the traditional topics of image preprocessing, pixel-based classification, and target recognition, to the recent challenging tasks of high-level semantic feature extraction and RS scene understanding.

...read moreread less

1,625 citations

Proceedings Article•DOI•

DOTA: A Large-Scale Dataset for Object Detection in Aerial Images

[...]

Gui-Song Xia¹, Xiang Bai¹, Jian Ding¹, Zhen Zhu¹, Serge Belongie¹, Jiebo Luo¹, Mihai Datcu¹, Marcello Pelillo¹, Liangpei Zhang¹ - Show less +5 more•Institutions (1)

Wuhan University¹

01 Jun 2018

TL;DR: The Dataset for Object Detection in Aerial Images (DOTA) as discussed by the authors is a large-scale dataset of aerial images collected from different sensors and platforms and contains objects exhibiting a wide variety of scales, orientations, and shapes.

...read moreread less

Abstract: Object detection is an important and challenging problem in computer vision. Although the past decade has witnessed major advances in object detection in natural scenes, such successes have been slow to aerial imagery, not only because of the huge variation in the scale, orientation and shape of the object instances on the earth's surface, but also due to the scarcity of well-annotated datasets of objects in aerial scenes. To advance object detection research in Earth Vision, also known as Earth Observation and Remote Sensing, we introduce a large-scale Dataset for Object deTection in Aerial images (DOTA). To this end, we collect 2806 aerial images from different sensors and platforms. Each image is of the size about 4000 A— 4000 pixels and contains objects exhibiting a wide variety of scales, orientations, and shapes. These DOTA images are then annotated by experts in aerial image interpretation using 15 common object categories. The fully annotated DOTA images contains 188, 282 instances, each of which is labeled by an arbitrary (8 d.o.f.) quadrilateral. To build a baseline for object detection in Earth Vision, we evaluate state-of-the-art object detection algorithms on DOTA. Experiments demonstrate that DOTA well represents real Earth Vision applications and are quite challenging.

...read moreread less

1,502 citations

Journal Article•DOI•

Remote Sensing Image Scene Classification: Benchmark and State of the Art

[...]

Gong Cheng¹, Junwei Han¹, Xiaoqiang Lu²•Institutions (2)

Northwestern Polytechnical University¹, Chinese Academy of Sciences²

01 Mar 2017-arXiv: Computer Vision and Pattern Recognition

TL;DR: A large-scale data set, termed “NWPU-RESISC45,” is proposed, which is a publicly available benchmark for REmote Sensing Image Scene Classification (RESISC), created by Northwestern Polytechnical University (NWPU).

...read moreread less

Abstract: Remote sensing image scene classification plays an important role in a wide range of applications and hence has been receiving remarkable attention. During the past years, significant efforts have been made to develop various datasets or present a variety of approaches for scene classification from remote sensing images. However, a systematic review of the literature concerning datasets and methods for scene classification is still lacking. In addition, almost all existing datasets have a number of limitations, including the small scale of scene classes and the image numbers, the lack of image variations and diversity, and the saturation of accuracy. These limitations severely limit the development of new approaches especially deep learning-based methods. This paper first provides a comprehensive review of the recent progress. Then, we propose a large-scale dataset, termed "NWPU-RESISC45", which is a publicly available benchmark for REmote Sensing Image Scene Classification (RESISC), created by Northwestern Polytechnical University (NWPU). This dataset contains 31,500 images, covering 45 scene classes with 700 images in each class. The proposed NWPU-RESISC45 (i) is large-scale on the scene classes and the total image number, (ii) holds big variations in translation, spatial resolution, viewpoint, object pose, illumination, background, and occlusion, and (iii) has high within-class diversity and between-class similarity. The creation of this dataset will enable the community to develop and evaluate various data-driven algorithms. Finally, several representative methods are evaluated using the proposed dataset and the results are reported as a useful baseline for future research.

...read moreread less

1,424 citations

Journal Article•DOI•

Deep Learning Classification of Land Cover and Crop Types Using Remote Sensing Data

[...]

Nataliia Kussul¹, Mykola Lavreniuk¹, Sergii Skakun², Andrii Shelestov³•Institutions (3)

National Academy of Sciences of Ukraine¹, University of Maryland, College Park², National Technical University³

31 Mar 2017-IEEE Geoscience and Remote Sensing Letters

TL;DR: A multilevel DL architecture that targets land cover and crop type classification from multitemporal multisource satellite imagery outperforms the one with MLPs allowing us to better discriminate certain summer crop types.

...read moreread less

Abstract: Deep learning (DL) is a powerful state-of-the-art technique for image processing including remote sensing (RS) images. This letter describes a multilevel DL architecture that targets land cover and crop type classification from multitemporal multisource satellite imagery. The pillars of the architecture are unsupervised neural network (NN) that is used for optical imagery segmentation and missing data restoration due to clouds and shadows, and an ensemble of supervised NNs. As basic supervised NN architecture, we use a traditional fully connected multilayer perceptron (MLP) and the most commonly used approach in RS community random forest, and compare them with convolutional NNs (CNNs). Experiments are carried out for the joint experiment of crop assessment and monitoring test site in Ukraine for classification of crops in a heterogeneous environment using nineteen multitemporal scenes acquired by Landsat-8 and Sentinel-1A RS satellites. The architecture with an ensemble of CNNs outperforms the one with MLPs allowing us to better discriminate certain summer crop types, in particular maize and soybeans, and yielding the target accuracies more than 85% for all major crops (wheat, maize, sunflower, soybeans, and sugar beet).

...read moreread less

1,155 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse