Why do deep CNNs appear inadequate for HSI?

Due to network generalization issues [10], deep CNNs for image classification tasks require a large number of images to prevent overfitting, and thus appear inadequate for the HSI classification problem, where a dataset typically consists of a single capture of a scene.

What is the primary technique in hyperspectral image analysis?

One of the principal techniques in hyperspectral image analysis is image classification, where a label is assigned to each pixel based on its characteristics.

What is the problem with the Indian Pines dataset?

With a moderate geometrical resolution of 20 m per pixel, and 16 land cover classes, this dataset poses a challenging classification problem due to the unbalanced number of samples per class, and high inter-class similarity of samples in the dataset.

(Open Access) Hyperspectral Image Classification with Convolutional Neural Networks (2015) | Viktor Slavkovikj

Q: What have the authors contributed in "Hyperspectral image classification with convolutional neural networks" ?

In this paper, however, the authors propose a feature learning approach for hyperspectral image classification based on convolutional neural networks ( CNNs ). Their experimental results, conducted on a commonlyused remote sensing hyperspectral dataset, show that the proposed method provides classification results that are among the state-of-the-art, without using any prior knowledge or engineered features.

Q: What are the well-established methods for analyzing hyperspectral images?

Some of the well-established feature extraction approaches are based on dimensionality reduction methods, such as principal component analysis (PCA) [11], or independent component analysis (ICA) [24].

Q: What has been used to learn a projection matrix?

Discriminant analysis methods [1, 3] have been used to learn a projection matrix in order to maximize a separability criterion of the projected data.

Q: How many layers are there in the network?

The network consists of 5 layers: three convolutional layers with width 16, followed by two fully connected layers with 800 units each.

Q: What is the input to the network?

The input to the network consists of the eight-connected neighborhood of a hyperspectral pixel, to account for the spatial information context.

Q: What is the size of the filters in the first convolutional layer?

Note that the size of the filters in the first convolutional layer is 9 × 16, where the first dimension accounts for the total number of pixels in the spatial neighborhood window of the input pixel, and the second dimension is the width of the filter.

Q: What is the advantage of a hyperspectral sensor?

This is advantageous for image analysis, because each hyperspectral pixel comprises of a large number (in the order of hundreds) of measurements of the electromagnetic spectrum and carries more information as compared to color pixels, which provide data only from the visible range of the spectrum.

biblio.ugent.be

The UGent Institutional Repository is the electronic archiving and dissemination platform for all

UGent research publications. Ghent University has implemented a mandate stipulating that all

academic publications of UGent researchers should be deposited and archived in this repository.

Except for items where current copyright restrictions apply, these papers are available in Open

Access.

This item is the archived peer-reviewed author-version of:

Hyperspectral Image Classification with Convolutional Neural Networks

Viktor Slavkovikj, Steven Verstockt, Wesley De Neve, Sofie Van Hoecke, and Rik Van de Walle

In: Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, 1159–1162, 2015.

http://doi.acm.org/10.1145/2733373.2806306

To refer to or to cite this work, please use the citation to the published version:

Slavkovikj, V., Verstockt, S., De Neve, W., Van Hoecke, S., and Van de Walle, R. (2015).

Hyperspectral Image Classification with Convolutional Neural Networks. Proceedings of the 23rd

Annual ACM Conference on Multimedia Conference 1159–1162. 10.1145/2733373.2806306

Hyperspectral Image Classiﬁcation with Convolutional

Neural Networks

Viktor Slavkovikj

, Steven Verstockt

, Wesley De Neve

1,2

Soﬁe Van Hoecke

, and Rik Van de Walle

Multimedia Lab, Depar tment of Electronics and Information Systems, Ghent University-iMinds

B-9050 Ledeberg-Ghent, Belgium

Image and Video Systems Lab, Korea Advanced Institute of Science and Technology (KAIST)

Yuseong-gu, Daejeon, 305-732, Republic of Korea

{viktor.slavkovikj, steven.verstockt, wesley.deneve,

soﬁe.vanhoecke, rik.vandewalle}@ugent.be

ABSTRACT

Hyperspectral image (HSI) classiﬁcation is one of the most

widely used methods for scene analysis from hyperspectral

imagery. In the past, many diﬀerent engineered features

have been proposed for the HSI classiﬁcation problem. In

this paper, however, we propose a feature learning approach

for hyperspectral image classiﬁcation based on convolutional

neural networks (CNNs). The proposed CNN model is able

to learn structured features, roughly resembling diﬀerent

spectral band-pass ﬁlters, directly from the hyperspectral in-

put data. Our experimental results, conducted on a commonly-

used remote sensing hyperspectral dataset, show that the

proposed method provides classiﬁcation resu lts that are among

the state-of-the-art, without using any prior knowledge or

engineered features.

Categories and Subject Descriptors

I.5.1 [Pattern Recognition]: Models—neural nets; I.5.4

[Pattern Recognition]: Applications—computer vision;

I.4.8 [Image Processing and Computer Vision]: Scene

Analysis

Keywords

Classiﬁcation, convolutional neural networks, deep learning,

hyperspectral imaging

1. INTRODUCTION

Recent developments of imaging spectroscopy sensors have

enabled acquisition of hyperspectral images with a high spa-

tial res o lu tio n , a characteristic which was previously exclu-

sive to standard electr ooptical s ys tems . However, unlike the

Permission to make digital or hard copies of all or part of this work for personal or

classroom use is granted without fee provided that copies are not made or distributed

for proﬁt or commercial advantage and that copies bear this notice and the full cita-

tion on the ﬁrst page. Copyrights for components of this work owned by others than

ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or re-

publish, to post on servers or to redistribute to lists, requires prior speciﬁc permission

and/or a fee. Request permissions from Permissions@acm.org.

MM’15, October 26–30, 2015, Brisbane, Australia.

 2015 ACM. ISBN 978-1-4503-3459-4/15/10 ...$15.00.

DOI: http://dx.doi.org/10.1145/2733373.2806306.

standard color pictures, images acquired with hyperspectral

sensors contain much higher spectral resolution. This is ad-

vantageous for image analysis, because each hyp er s pectral

pixel comprises of a large number (in th e order of hundreds)

of measurements of the electromagnetic spectrum and car-

ries more information as compared to color pixels, which

provide data only from the visible range of the spectrum. As

a result, hyperspectral image analysis has found numerous

biomedical, forensic, and remote sensing app lic a t io n s [18, 5,

21].

One of th e principal techniques in hyperspectral image

analysis is image classiﬁcation, where a label is assigned to

each pixel based on its characteristics. Inference of class la-

bels from hyp er s pectral data is challenging, however, sin c e

classiﬁcation methods ar e aﬀected by the curse of dimension-

ality (i.e., the Hughes eﬀect[12]). That is, the classiﬁcation

accuracy is poor a s the number of training samples required

to populate the high-d imen s io n a l spectral space is limited.

Therefore, many diﬀerent feature extractio n methods (see

Section 2) have been proposed to tackle the classiﬁcation

problem in hypers pectral images. The goal of feature ex-

traction is to reduce the dimensionality of the hyperspectral

data while preserving as much of the discriminative infor-

mation as possible, so that in a later stage a classiﬁ er can

be trained on the extracted features. Since it is diﬃcult

to discern potentially relevant features from hyperspectral

data, we ap p r o a ch hyperspectral image classiﬁcation as a n

end-to-end learning [15, 25] task, where the assignment of

class labels fr o m hyperspectral input pixels is a single stage

learning proces s , in w h ich the intermediate feature rep r es en -

tations are also learned. The c o ntributions of o u r paper are

twofold:

• We propose convolutional neural networks for hyper-

spectral image classiﬁcation .

• We investigate hyperspectral data augmentation as a

way o f mitigating the p r o b lem of limited train in g sam-

ples in hyperspectral image cla s s iﬁ c a t io n .

The rema in d er of this paper is organized as follows. We

discuss related work in hyperspectral image cla s s iﬁ c a tio n in

Section 2 . In S ec tio n 3, we present the architecture of the

convolutional neural network that was used as basis for our

1159

experiments. Section 4 describ es hyp er s pectral data aug-

mentation for alleviating the limited training samples prob-

lem. In Section 5, we report classiﬁcation results obtained

by the proposed metho d on a hyperspectral image dataset.

Section 6 concludes the paper.

2. RELATED WORK

In the past, many diﬀerent feature extraction and classi-

ﬁcation methods have been proposed for hyperspectral im-

ages. Some of the well-established feature extraction ap-

proaches are based on dimensionality reduction methods,

such as principal component analysis (PCA) [11], or inde-

pendent component analysis (ICA) [24]. These method s are

aimed at projecting the hyperspectral data to a subspace, in

which class separation is performed more eﬀectively. Simi-

larly, to be able to calculate c oordinates of d a ta in a lower-

dimensional space, manifold learning methods [8, 7] try to

estimate the intrinsic geometry of the manifold embedded

in the high-d imen s io n a l hyperspectral da t a space. Discrim-

inant analys is methods [1, 3] have b een used to learn a pro-

jection matrix in order to maximize a separability c r iter io n

of the projected data. Morphological features [2], on the

other hand, were introduced to take advantage of structural

information present in the images. They have been suc-

cessfully combined with support vector machines [6 ], which

are known for their good generalization properties for high-

dimensional data with lower eﬀective dimensionality [19].

Recently, statistical learning mo d els , such as neural net-

works, have also been invest ig a ted for the purpose of hyp er -

spectral image classiﬁcation. For instance, Li et al. [17] have

proposed a deep belief network (DBN) approach for classi-

ﬁcation of hypers pectral images. The model is a stack of

restricted Boltzmann machines, which are trained by greedy

layer -w is e unsupervised learning [9]. However, by reducing

the data to the ﬁrst three PCA components, the spectral

characteristics of the images h ave not been used in a princi-

pal manner by th e DBN model. Our proposed approach, by

contrast, fully exploits the available spectral information in

a hyperspectral image.

3. CNN ARCHITECTURE

Deep CNNs have been successfully applied in solving chal-

lenging tasks, such as image clas s iﬁ c a t io n [13], speech recog-

nition [16], music information retrieval [4], and text recogni-

tion [25]. However, to our knowledge, CNN models have not

been studied in literature for t h e purpose of hyper s pectral

image classiﬁcation.

Due to network g en er a liz a tio n issues [10], deep CNNs for

image classiﬁcation ta s ks require a large number of images

to prevent overﬁtting, and thus appear inad equ a te for the

HSI classiﬁcation problem, where a dataset typically con-

sists of a single capture of a scene. Furthermore, the large

number o f bands in hyperspectral images pose a computa-

tional challenge for a straightforward application of a CNN

model.

We propose a CNN architecture which integrates both

spatial and spectr a l information for simultaneous spatia l-

spectral classiﬁc a tio n of hyperspectral imag es . The pro-

posed architecture is visualized in Figure 1. T h e input to

the network consists of the eight-connected neighborhood of

a hyperspectral pixel, to account for the spatial information

context. In or d er to exploit the original spectral informa-

tion, all convolutional operations are performed across th e

spectral bands. Th e network c o n s is ts of 5 layers: three con-

volutional layers with width 16, followed by two fully con-

nected layers with 80 0 units each. Note that the size of

the ﬁ lter s in the ﬁrst convolutional layer is 9 × 16, where

the ﬁrs t dimension accounts for the total number of pixels

in the spatial neighborhood window of the input pixel, and

the second dimension is the width of the ﬁlter. This allows

for simultaneous learning fro m both the spatial and spectral

domain.

1: convolution (9x16) #32

2: convolution (1x16) #32

3: convolution (1x16) #32

4: fully connected #800

5: fully connected #800

pixel label

prediction

hyperspectral pixel

neighborhood

Figure 1: Diagram of the proposed convolutional

neural network architecture for hyperspectral image

classiﬁcation. The size of the ﬁlters in the convolu-

tional laye rs are indicated as (h × w), and # denotes

the number of convolutional ﬁlters, as well as the

number of hidden units in the fully connected lay-

ers.

In order to obtain the CN N architecture from Figure 1,

we experimented with the numb er of layers, the number of

hidden units in the fully connected layers, and the num-

ber and s iz e of the ﬁlters in the convolutional layers. In

addition, we tested several modiﬁca tio n s of the original net-

work. Namely, we experimented with max-pooling layers af-

ter the convolutional layers, and also with varying the stride

of the convolutions. This worsened the classiﬁcation results,

which is indicative of non-stationarity of statistics across

spectral bands. Testing the hyperbolic tangent activation

function produced slightly better results than rectiﬁed lin ea r

units [20] activation. As a result, we used hyperbolic tangent

activa tio n s in all layers exclusive of the last layer , w h er e the

softmax function was used. We also attempted dropout reg-

ularization [22] in th e fully co n n ec ted layers, however, this

did not improve the classiﬁcation results.

We trained the network using minibatch gradient descent

and momentum [23], a n d we set the size of the minibatches

to 50 samples. We evaluated the model on a held out valida-

tion set during training, and we report results on a separate

1160

test set for the model that achieved the best res u lts on the

validatio n set.

4. DATA AUGMENTATION

Identifying the classes of pixels from hyperspectral images

to produce labeled training data is a manual task, which is

expensive and time consuming. Therefore, available trainin g

samples for HSI cla s s iﬁ c a tio n are s c a r c e. To try to alleviate

this problem, we experimented with simple augmentation

for hyperspectral data. For each class in the hyperspectral

image dataset, we calculate the per-spectral-ba n d standard

deviation of the samples in the training s et which belong to

the class. Afterwards, we use the calculated vector of stan-

dard deviations σ as a parameter to a zero mean multivari-

ate normal distribution N (0, αΣ), where α is a scale factor,

and Σ is a diagonal matrix containing σ along the main di-

agonal entries. Finally, the augmented samples for the class

are generated by adding nois e sampled from the distribution

N to the original samples. We tried several values for the

scaling factor in the set {1, 0.5, 0.33, 0.25, 0.125}, and ﬁxed

α = 0.25 for the experiments. The goa l of the proposed

hyperspectral data augmentation is to prevent overﬁtting in

cases where a low number of samples are used to train the

network.

5. EXPERIMENTAL RESULTS

We tested our meth od on the commonly-used Indian Pines

hyperspectral image dataset [14]. This dataset was acquired

in June 1992 by NASA ’s A ir borne Visible/Infrared Imaging

Spectrometer (AVIRIS). The Indian Pines scene is a mixed

forest and agricultural site in Northwest Indiana, captured

at about 20 km a ltitu d e by the AVIRIS sensor. The hy-

perspectral image of the scene co n s is t s of 220 ban d s in the

spectral range from 0.4 µm to 2.5 µm, with a spectral res-

olution of 10 nm. The whole scene consists of 145 × 145

pixels. There are in total 10,366 labeled samples . With a

moderate geometrical resolution of 20 m per pixel, and 16

land cover classes, this dataset poses a challenging classiﬁ-

cation problem due to the unbalanced number of samples

per class, and high inter-class similarity of samples in the

dataset.

Table 1: AVIRIS I ndian Pines dataset and per class

training sets and corresponding test sets.

Class Train set Test set

# Name 5% 10% 20% 5% 10% 20%

1 Alfalfa 3 6 11 25 24 21

2 Corn-notil 72 144 287 681 645 573

3 Corn-min 42 84 167 396 375 333

4 Corn 12 24 47 111 105 93

5 Grass-pasture 25 50 100 236 223 198

6 Grass-trees 38 75 150 354 336 298

7 Grass-pasture-mowed 2 3 6 12 11 10

8 Hay-windrowed 25 49 98 232 220 195

9 Oats 1 2 4 9 9 8

10 Soybeans-notil 49 97 194 459 435 387

11 Soybeans-min 124 247 494 1,172 1,110 987

12 Soybeans-clean 31 62 123 291 276 245

13 Wheat 11 22 43 100 95 84

14 Woods 65 130 259 614 582 517

15 Bldg-grass-trees 19 38 76 180 171 152

16 Stone-steel-towers 5 10 19 45 42 38

Total 524 1,043 2,078 4,917 4,659 4,139

For our experiments, we evaluated the classiﬁcation accu-

racy of the method using a balanced training set per class,

with low number of training samples. We trained the net -

work with 5%, 10%, and 20% of ran d o mly selected labeled

samples per class, and equally divided the remaining labeled

samples into separate validation and test sets. In each case,

we repeated the experiment with and without hyperspectral

data augmentation.

Figure 2: A subset of ﬁlters learned in the ﬁrst con-

volutional layer of the network. Each subplot rep-

resents a (9 × 16) ﬁlter.

Table 2: Classiﬁcation results for the Indian Pines

image on the test sets.

Indian Pines

Test set

5% 10% 20%

Augmented

OCA(%) 86.54 ± 0.30 92.70 ± 1.00 96.58 ± 0.55

F1 score 0.86 ± 0.00 0.93 ± 0.01 0.97 ± 0.01

Non-augment.

OCA (%) 85.46 ± 1.73 92.76 ± 0.93 96.54 ± 0.47

F1 score 0.85 ± 0.02 0.93 ± 0.01 0.96 ± 0.00

The achieved classiﬁcation results for each of the experi-

ments are shown in Table 2. We performed 5 Monte Carlo

runs, where for each run we selected a training set of 5%,

10%, and 20% of the labeled samples, as explained above, to

train our model. In the cases with augmentation, we found

3 fold (per class) augmentation of the training data to give

the best results. We report the average an d standard error

of the 5 Monte Carlo runs in terms of the overall classiﬁca-

tion accuracy (OCA), i.e., the number of correctly classiﬁed

samples from the total number of samples in the test set,

and the F1 score, which is weighted s o that it accounts for

the imbalance of the classes. From the results in Table 2, it

can be seen that only when using a very low number of aug-

mented labeled samples for training (5%), there is improve-

ment in the classiﬁcation scores over the non-augmented

counterpart. However, we have observed that in all cases

augmentation reduced the number of training iteration s sig-

niﬁcantly, as compared with training with the corresponding

non-augmented data.

We have visualized some of the learned ﬁlters from the

ﬁrst convolutional layer of the network in Figure 2. From

the visualization, it is clear that the learned ﬁlters have a

structured shape, and that some of the ﬁlters roughly re-

semble diﬀerent spectral b a n d -p a s s ﬁlters.

6. CONCLUSIONS

Due to the inherent nature of hyperspectral data, dis-

cernment of good features for hyperspectral image classi-

1161

ﬁcation is diﬃcult. Therefore, in this paper, we have pre-

sented a new approach towards hyperspectral image classi-

ﬁcation based on deep convolutional neural networks. To

evalua te the eﬀectiveness of the method, we performed ex-

periments on a commonly-used hyp er s pectral ima g e dataset.

Our experimental results have shown that the neural net-

work model can learn s tr u c tu r ed features resembling diﬀer-

ent spectral band-pass ﬁlters d ir ec tly from the input data.

These features prove useful for hyperspectral image classi-

ﬁcation, which makes end-to-end learning applicable t o hy-

perspectral scene understandin g .

7. ACKNOWLEDGMENTS

The research activities as described in this paper were

funded by Ghent University and the Interdisciplinary Re-

search Institute iMinds.

8. REFERENCES

[1] T. V. Bandos, L. Bruzzone, and G. Camps-Valls.

Classiﬁcation of hyperspectral images with regularized

linear discriminant analysis. IEEE Transactions on

Geoscience and Remote Sensing, 47(3):862–873, 2009.

[2] J. A. Benediktsson, J. A. Palmason, and J. R .

Sveinsson. Classiﬁcation of hyperspectral d a ta fr o m

urban areas based on extended morphological proﬁles.

IEEE Transactions on Geoscience and Remote

Sensing, 43(3):480–491, 2005.

[3] D. Cai, X. He, and J. Han. Semi-supervised

discriminant analysis. In Proceedings of the

International Conference on Computer Vision

(ICCV), pages 1–7, Oct 2007.

[4] S. Dieleman and B. Schrauwen. End-to-end learnin g

for music audio. In Acoustics, Speech and Signal

Processing (ICASSP), 2014 IEEE International

Conference on, pages 6964–6968, May 2014.

[5] G. Edelman, E. Gaston, T. van Leeuwen, P. C u llen ,

and M. Aalders. Hyperspectr a l imaging for

non-contact analysis of forensic traces. Forensic

Science International, 223(1-3):28–39, 2012.

[6] M. Fauvel, J. A. Benediktsson, J. Chanussot, and

J. R. Sveinsson. Spectral an d s p a t ia l cla s s iﬁ c a tio n of

hyperspectral data using SVMs and morphological

proﬁles. IEEE Transactions on Geoscience and

Remote Sensing, 46(11):3804–3814, 2008.

[7] X. He, D. Cai, S. Yan, an d H.-J. Zhang. Neighborhood

preserving embedding. In Proceedings of the

International Conference on Computer Vision

(ICCV), volume 2, pages 1208–1213, Oct 2005.

[8] X. He and P. Niyogi. Locality preserving projections.

In NIPS, pages 153–160. MIT Press, 2004.

[9] G. E. Hinton, S. Osindero, and Y.-W. Teh. A fast

learning algorithm for deep belief nets. Neural

Computation, 18(7):1527–1554, July 2006.

[10] G. E. Hinton, N. Srivastava, A. Krizhevsky,

I. Sutskever, and R. Salakhutdinov. Improving neural

networks by preventing co-adaptation of feature

detectors. CoRR, abs/1207.0580, 2012.

[11] H. Hotelling. Analysis of a complex of statistica l

varia b les into principal components. Journal of

Educational Psychology, 24:417–441, 1933.

[12] G. Hughes. On the mean accuracy of stat is tic a l

pattern recognizers. IEEE Transactions on

Information Theory, 14(1):55–63, Jan 1968.

[13] A. Krizhevsky, I. Sutskever, and G. E. Hinton.

Imagenet classiﬁcation with deep convolutional neural

networks. In Advances in Neural Information

Processing Systems 25, pages 1097–1105. 2012.

[14] D. A. Landgrebe. Signal theory methods in

multispectral remote sensing. Wiley ser ies in remote

sensing. Wiley, Hoboken, N.J., Chichester, 2003.

[15] Y. Lecun, U. Muller, J. Ben, E. Cosa tto , and

B. Flepp. Oﬀ-road obstacle avoidance through

end-to-end learning. In Advances in Neural

Information Processing Systems 18, pages 739–74 6 ,

Cambridge, MA, 2006.

[16] H. Lee, P. Pham, Y. Largman, and A. Y. Ng.

Unsupervised feature learning for audio c la s s iﬁ c a tio n

using convolutional deep belief n etworks. In Advances

in Neural Information Processing Systems 22, pages

1096–1104. 2009.

[17] T. Li, J. Zhang, and Y. Zhang. Classiﬁca t io n of

hyperspectral image based on deep belief networks. In

Image Processing (ICIP), 2014 IEEE International

Conference on, pages 5132–5136, Oct 2014.

[18] G. Lu and B. Fei. Medical hyperspectral imaging: a

review. Journal of Biomedical Optics, 19(1):010901,

2014.

[19] F. Melgani and L. Bruzzone. Classiﬁcation of

hyperspectral remote sensing images with s u p port

vector machines. IEEE Transactions on Geoscience

and Remote Sensing, 42(8):1778–1790, Aug 2004.

[20] V. Nair and G. E. Hinton. Rectiﬁed linear units

improve restricted boltzmann machines. In

Proceedings of the 27th International Conference on

Machine Learning (ICML-10), June 21-24, 2010,

Haifa, Israel, pages 807–814, 2010.

[21] A. Plaza, J. A. Benediktsson, J. W. Boar d ma n ,

J. Brazile, L. Bruzzone, G. Camps-Valls,

J. Chanussot, M. Fauvel, P. Gamba, A. Gualtieri,

M. Marconcini, J. C. Tilton, and G. Trianni. Recent

adva n c es in techniques for hyperspectral image

processing. Remote Sensing of Environment, 113,

Supplement 1(0):S110–S122, 2009.

[22] N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever,

and R. Salakhutdinov. Dropout: A simple way to

prevent neural networks from overﬁtting. J. Mach.

Learn. Res., 15(1):1929–1958, Jan. 2014.

[23] I. Sutskever, J. Martens, G. E. Dahl, a n d G. E.

Hinton. On the importance of in itia liz a tio n and

momentum in deep learning. In Proceedings of the

30th International Conference on Machine Learning

(ICML-13), volume 28, pages 1139–1147, May 2013.

[24] J. Wang and C.-I. Chang. Independent component

analysis-based dimensionality reduction with

applications in hyperspectral image ana lys is . IEEE

Transactions on Geoscience and Remote Sensing,

44(6):1586–1600, 2006.

[25] T. Wang, D. Wu, A. Coates, and A. Ng. End-to-en d

text recognition with convolutional neural networks. I n

Pattern Recognition (ICPR), 2012 21st International

Conference on, pages 3304–3308, Nov 2012.

1162

Hyperspectral Image Classification with Convolutional Neural Networks

Figures

Citations

Convolutional Neural Network Based Fault Detection for Rotating Machinery

Convolutional Neural Networks for Large-Scale Remote-Sensing Image Classification

Deep learning classifiers for hyperspectral imaging: A review

Comprehensive survey of deep learning in remote sensing: theories, tools, and challenges for the community

Diverse Region-Based CNN for Hyperspectral Image Classification

References

ImageNet Classification with Deep Convolutional Neural Networks

Dropout: a simple way to prevent neural networks from overfitting

A fast learning algorithm for deep belief nets

Rectified Linear Units Improve Restricted Boltzmann Machines

Analysis of a complex of statistical variables into principal components.

Related Papers (5)

Deep Convolutional Neural Networks for Hyperspectral Image Classification

Deep Learning-Based Classification of Hyperspectral Data

Deep Feature Extraction and Classification of Hyperspectral Images Based on Convolutional Neural Networks

ImageNet Classification with Deep Convolutional Neural Networks

Dropout: a simple way to prevent neural networks from overfitting

Frequently Asked Questions (14)

Q1. What have the authors contributed in "Hyperspectral image classification with convolutional neural networks" ?

Q2. What are the well-established methods for analyzing hyperspectral images?

Q3. What has been used to learn a projection matrix?

Q4. Why do deep CNNs appear inadequate for HSI?

Q5. How many layers are there in the network?

Q6. What is the primary technique in hyperspectral image analysis?

Q7. What is the problem with the Indian Pines dataset?

Q8. What is the input to the network?

Q9. What is the size of the filters in the first convolutional layer?

Q10. What is the advantage of a hyperspectral sensor?

Q11. How do you calculate the geometry of a manifold?

Q12. What are the methods for combining support vector machines?

Q13. How many augmented labels are used for training?

Q14. What did the authors do to improve the classification results?