Hyperspectral Image Classification With Convolutional Neural Network and Active Learning

doi:10.1109/TGRS.2020.2964627

Home
/
Papers
/
Hyperspectral Image Classification With Convolutional Neural Network and Active Learning

Journal Article•DOI•

Hyperspectral Image Classification With Convolutional Neural Network and Active Learning

Xiangyong Cao¹, Jing Yao¹, Zongben Xu¹, Deyu Meng¹•Institutions (1)

Xi'an Jiaotong University¹

03 Feb 2020-IEEE Transactions on Geoscience and Remote Sensing (IEEE)-Vol. 58, Iss: 7, pp 4604-4616

TL;DR: This article presents an active deep learning approach for HSI classification, which integrates both active learning and deep learning into a unified framework and achieves better performance on three benchmark HSI data sets with significantly fewer labeled samples.

read less

Abstract: Deep neural network has been extensively applied to hyperspectral image (HSI) classification recently. However, its success is greatly attributed to numerous labeled samples, whose acquisition costs a large amount of time and money. In order to improve the classification performance while reducing the labeling cost, this article presents an active deep learning approach for HSI classification, which integrates both active learning and deep learning into a unified framework. First, we train a convolutional neural network (CNN) with a limited number of labeled pixels. Next, we actively select the most informative pixels from the candidate pool for labeling. Then, the CNN is fine-tuned with the new training set constructed by incorporating the newly labeled pixels. This step together with the previous step is iteratively conducted. Finally, Markov random field (MRF) is utilized to enforce class label smoothness to further boost the classification performance. Compared with the other state-of-the-art traditional and deep learning-based HSI classification methods, our proposed approach achieves better performance on three benchmark HSI data sets with significantly fewer labeled samples.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

More Diverse Means Better: Multimodal Deep Learning Meets Remote Sensing Imagery Classification

[...]

Danfeng Hong¹, Lianru Gao², Naoto Yokoya³, Jing Yao⁴, Jocelyn Chanussot¹, Qian Du⁵, Bing Zhang² - Show less +3 more•Institutions (5)

University of Grenoble¹, Chinese Academy of Sciences², University of Tokyo³, Xi'an Jiaotong University⁴, Mississippi State University⁵

12 Aug 2020-arXiv: Computer Vision and Pattern Recognition

TL;DR: A baseline solution to the aforementioned difficulty by developing a general multimodal deep learning (MDL) framework that is not only limited to pixel-wise classification tasks but also applicable to spatial information modeling with convolutional neural networks (CNNs).

...read moreread less

Abstract: Classification and identification of the materials lying over or beneath the Earth's surface have long been a fundamental but challenging research topic in geoscience and remote sensing (RS) and have garnered a growing concern owing to the recent advancements of deep learning techniques. Although deep networks have been successfully applied in single-modality-dominated classification tasks, yet their performance inevitably meets the bottleneck in complex scenes that need to be finely classified, due to the limitation of information diversity. In this work, we provide a baseline solution to the aforementioned difficulty by developing a general multimodal deep learning (MDL) framework. In particular, we also investigate a special case of multi-modality learning (MML) -- cross-modality learning (CML) that exists widely in RS image classification applications. By focusing on "what", "where", and "how" to fuse, we show different fusion strategies as well as how to train deep networks and build the network architecture. Specifically, five fusion architectures are introduced and developed, further being unified in our MDL framework. More significantly, our framework is not only limited to pixel-wise classification tasks but also applicable to spatial information modeling with convolutional neural networks (CNNs). To validate the effectiveness and superiority of the MDL framework, extensive experiments related to the settings of MML and CML are conducted on two different multimodal RS datasets. Furthermore, the codes and datasets will be available at this https URL, contributing to the RS community.

...read moreread less

582 citations

Journal Article•DOI•

A survey: Deep learning for hyperspectral image classification with few labeled samples

[...]

Sen Jia¹, Shuguo Jiang¹, Zhijie Lin¹, Nanying Li¹, Meng Xu¹, Shiqi Yu² - Show less +2 more•Institutions (2)

Shenzhen University¹, Southern University of Science and Technology²

11 Aug 2021-Neurocomputing

TL;DR: Although there is a vast gap between deep learning models (that usually need sufficient labeled samples) and the HSI scenario with few labeled samples, the issues of small-sample sets can be well characterized by fusion of deep learning methods and related techniques, such as transfer learning and a lightweight model.

...read moreread less

170 citations

Cites methods from "Hyperspectral Image Classification ..."

...In contrast, the active learning method based on posterior probability [98, 99, 100] is more widely used....
[...]
...[100] use convolutional neural networks to generate the posterior probability....
[...]

Journal Article•DOI•

More Diverse Means Better: Multimodal Deep Learning Meets Remote-Sensing Imagery Classification

[...]

Danfeng Hong¹, Lianru Gao², Naoto Yokoya³, Jing Yao⁴, Jocelyn Chanussot¹, Qian Du⁵, Bing Zhang² - Show less +3 more•Institutions (5)

University of Grenoble¹, Chinese Academy of Sciences², University of Tokyo³, Xi'an Jiaotong University⁴, Mississippi State University⁵

01 May 2021-IEEE Transactions on Geoscience and Remote Sensing

TL;DR: In this article, a general multimodal deep learning (MDL) framework is proposed for geoscience and remote sensing (RS) applications, which is not only limited to pixel-wise classification tasks but also applicable to spatial information modeling with CNNs.

...read moreread less

Abstract: Classification and identification of the materials lying over or beneath the earth’s surface have long been a fundamental but challenging research topic in geoscience and remote sensing (RS), and have garnered a growing concern owing to the recent advancements of deep learning techniques. Although deep networks have been successfully applied in single-modality-dominated classification tasks, yet their performance inevitably meets the bottleneck in complex scenes that need to be finely classified, due to the limitation of information diversity. In this work, we provide a baseline solution to the aforementioned difficulty by developing a general multimodal deep learning (MDL) framework. In particular, we also investigate a special case of multi-modality learning (MML)—cross-modality learning (CML) that exists widely in RS image classification applications. By focusing on “what,” “where,” and “how” to fuse, we show different fusion strategies as well as how to train deep networks and build the network architecture. Specifically, five fusion architectures are introduced and developed, further being unified in our MDL framework. More significantly, our framework is not only limited to pixel-wise classification tasks but also applicable to spatial information modeling with convolutional neural networks (CNNs). To validate the effectiveness and superiority of the MDL framework, extensive experiments related to the settings of MML and CML are conducted on two different multimodal RS data sets. Furthermore, the codes and data sets will be available at https://github.com/danfenghong/IEEE_TGRS_MDL-RS , contributing to the RS community.

...read moreread less

165 citations

Journal Article•DOI•

X-ModalNet: A semi-supervised deep cross-modal network for classification of remote sensing data

[...]

Danfeng Hong¹, Danfeng Hong², Naoto Yokoya³, Gui-Song Xia⁴, Jocelyn Chanussot⁵, Jocelyn Chanussot⁶, Xiao Xiang Zhu², Xiao Xiang Zhu¹ - Show less +4 more•Institutions (6)

Technische Universität München¹, German Aerospace Center², University of Tokyo³, Wuhan University⁴, University of Grenoble⁵, Chinese Academy of Sciences⁶

01 Sep 2020-Isprs Journal of Photogrammetry and Remote Sensing

TL;DR: This paper proposes a novel cross-modal deep-learning framework, called X-ModalNet, with three well-designed modules: self-adversarial module, interactive learning module, and label propagation module, aimed at learning to transfer more discriminative information from a small-scale hyperspectral image (HSI) into the classification task using a large-scale MSI or SAR data.

...read moreread less

Abstract: This paper addresses the problem of semi-supervised transfer learning with limited cross-modality data in remote sensing. A large amount of multi-modal earth observation images, such as multispectral imagery (MSI) or synthetic aperture radar (SAR) data, are openly available on a global scale, enabling parsing global urban scenes through remote sensing imagery. However, their ability in identifying materials (pixel-wise classification) remains limited, due to the noisy collection environment and poor discriminative information as well as limited number of well-annotated training images. To this end, we propose a novel cross-modal deep-learning framework, called X-ModalNet, with three well-designed modules: self-adversarial module, interactive learning module, and label propagation module, by learning to transfer more discriminative information from a small-scale hyperspectral image (HSI) into the classification task using a large-scale MSI or SAR data. Significantly, X-ModalNet generalizes well, owing to propagating labels on an updatable graph constructed by high-level features on the top of the network, yielding semi-supervised cross-modality learning. We evaluate X-ModalNet on two multi-modal remote sensing datasets (HSI-MSI and HSI-SAR) and achieve a significant improvement in comparison with several state-of-the-art methods.

...read moreread less

159 citations

Cites background from "Hyperspectral Image Classification ..."

...…which is of great benefit to many potential applications such as image classification (Tuia et al., 2015; Han et al., 2018; Srivastava et al., 2019; Cao et al., 2020a), object and change detection (Zhang et al., 2018b, 2019b; Wu et al., 2019; Wu et al., 2020), mineral exploration (Gao et al.,…...
[...]
...Cao et al. (2020b) integrated CNNs and active learning to better utilize the unlabeled samples for hyperspectral image classification....
[...]
...[10] integrated CNNs and active learning to better utilize the unlabeled samples for hyperspectral image classification....
[...]

Journal Article•DOI•

Spectral Superresolution of Multispectral Imagery With Joint Sparse and Low-Rank Learning

[...]

Lianru Gao¹, Danfeng Hong², Jing Yao³, Bing Zhang¹, Paolo Gamba⁴, Jocelyn Chanussot¹ - Show less +2 more•Institutions (4)

Chinese Academy of Sciences¹, University of Grenoble², Xi'an Jiaotong University³, University of Pavia⁴

01 Mar 2021-IEEE Transactions on Geoscience and Remote Sensing

TL;DR: In this article, a joint sparse and low-rank learning (J-SLoL) method was proposed to spectrally enhance multispectral (MS) images by jointly learning low rank HS-MS dictionary pairs from overlapped regions.

...read moreread less

Abstract: Extensive attention has been widely paid to enhance the spatial resolution of hyperspectral (HS) images with the aid of multispectral (MS) images in remote sensing. However, the ability in the fusion of HS and MS images remains to be improved, particularly in large-scale scenes, due to the limited acquisition of HS images. Alternatively, we super-resolve MS images in the spectral domain by the means of partially overlapped HS images, yielding a novel and promising topic: spectral superresolution (SSR) of MS imagery. This is challenging and less investigated task due to its high ill-posedness in inverse imaging. To this end, we develop a simple but effective method, called joint sparse and low-rank learning (J-SLoL), to spectrally enhance MS images by jointly learning low-rank HS–MS dictionary pairs from overlapped regions. J-SLoL infers and recovers the unknown HS signals over a larger coverage by sparse coding on the learned dictionary pair. Furthermore, we validate the SSR performance on three HS–MS data sets (two for classification and one for unmixing) in terms of reconstruction, classification, and unmixing by comparing with several existing state-of-the-art baselines, showing the effectiveness and superiority of the proposed J-SLoL algorithm. Furthermore, the codes and data sets will be available at https://github.com/danfenghong/IEEE_TGRS_J-SLoL , contributing to the remote sensing (RS) community.

...read moreread less

94 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41

Collapse

References

PDF

Open Access

More filters

Proceedings Article•

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

[...]

Sergey Ioffe¹, Christian Szegedy¹•Institutions (1)

Google¹

06 Jul 2015

TL;DR: Applied to a state-of-the-art image classification model, Batch Normalization achieves the same accuracy with 14 times fewer training steps, and beats the original model by a significant margin.

...read moreread less

Abstract: Training Deep Neural Networks is complicated by the fact that the distribution of each layer's inputs changes during training, as the parameters of the previous layers change. This slows down the training by requiring lower learning rates and careful parameter initialization, and makes it notoriously hard to train models with saturating nonlinearities. We refer to this phenomenon as internal covariate shift, and address the problem by normalizing layer inputs. Our method draws its strength from making normalization a part of the model architecture and performing the normalization for each training mini-batch. Batch Normalization allows us to use much higher learning rates and be less careful about initialization, and in some cases eliminates the need for Dropout. Applied to a state-of-the-art image classification model, Batch Normalization achieves the same accuracy with 14 times fewer training steps, and beats the original model by a significant margin. Using an ensemble of batch-normalized networks, we improve upon the best published result on ImageNet classification: reaching 4.82% top-5 test error, exceeding the accuracy of human raters.

...read moreread less

30,843 citations

"Hyperspectral Image Classification ..." refers methods in this paper

...Also, the batch normalization (BN) [44] training strategy is used to help train the CNN....
[...]

Posted Content•

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

[...]

Sergey Ioffe¹, Christian Szegedy¹•Institutions (1)

Google¹

11 Feb 2015-arXiv: Learning

TL;DR: Batch Normalization as mentioned in this paper normalizes layer inputs for each training mini-batch to reduce the internal covariate shift in deep neural networks, and achieves state-of-the-art performance on ImageNet.

...read moreread less

Abstract: Training Deep Neural Networks is complicated by the fact that the distribution of each layer's inputs changes during training, as the parameters of the previous layers change. This slows down the training by requiring lower learning rates and careful parameter initialization, and makes it notoriously hard to train models with saturating nonlinearities. We refer to this phenomenon as internal covariate shift, and address the problem by normalizing layer inputs. Our method draws its strength from making normalization a part of the model architecture and performing the normalization for each training mini-batch. Batch Normalization allows us to use much higher learning rates and be less careful about initialization. It also acts as a regularizer, in some cases eliminating the need for Dropout. Applied to a state-of-the-art image classification model, Batch Normalization achieves the same accuracy with 14 times fewer training steps, and beats the original model by a significant margin. Using an ensemble of batch-normalized networks, we improve upon the best published result on ImageNet classification: reaching 4.9% top-5 validation error (and 4.8% test error), exceeding the accuracy of human raters.

...read moreread less

17,184 citations

Journal Article•DOI•

Fast approximate energy minimization via graph cuts

[...]

Yuri Boykov¹, Olga Veksler¹, Ramin Zabih²•Institutions (2)

Princeton University¹, Cornell University²

01 Nov 2001-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: This work presents two algorithms based on graph cuts that efficiently find a local minimum with respect to two types of large moves, namely expansion moves and swap moves that allow important cases of discontinuity preserving energies.

...read moreread less

Abstract: Many tasks in computer vision involve assigning a label (such as disparity) to every pixel. A common constraint is that the labels should vary smoothly almost everywhere while preserving sharp discontinuities that may exist, e.g., at object boundaries. These tasks are naturally stated in terms of energy minimization. The authors consider a wide class of energies with various smoothness constraints. Global minimization of these energy functions is NP-hard even in the simplest discontinuity-preserving case. Therefore, our focus is on efficient approximation algorithms. We present two algorithms based on graph cuts that efficiently find a local minimum with respect to two types of large moves, namely expansion moves and swap moves. These moves can simultaneously change the labels of arbitrarily large sets of pixels. In contrast, many standard algorithms (including simulated annealing) use small moves where only one pixel changes its label at a time. Our expansion algorithm finds a labeling within a known factor of the global minimum, while our swap algorithm handles more general energy functions. Both of these algorithms allow important cases of discontinuity preserving energies. We experimentally demonstrate the effectiveness of our approach for image restoration, stereo and motion. On real data with ground truth, we achieve 98 percent accuracy.

...read moreread less

7,413 citations

Journal Article•DOI•

Image Super-Resolution Using Deep Convolutional Networks

[...]

Chao Dong¹, Chen Change Loy¹, Kaiming He², Xiaoou Tang¹•Institutions (2)

The Chinese University of Hong Kong¹, Microsoft²

01 Feb 2016-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: Zhang et al. as discussed by the authors proposed a deep learning method for single image super-resolution (SR), which directly learns an end-to-end mapping between the low/high-resolution images.

...read moreread less

Abstract: We propose a deep learning method for single image super-resolution (SR). Our method directly learns an end-to-end mapping between the low/high-resolution images. The mapping is represented as a deep convolutional neural network (CNN) that takes the low-resolution image as the input and outputs the high-resolution one. We further show that traditional sparse-coding-based SR methods can also be viewed as a deep convolutional network. But unlike traditional methods that handle each component separately, our method jointly optimizes all layers. Our deep CNN has a lightweight structure, yet demonstrates state-of-the-art restoration quality, and achieves fast speed for practical on-line usage. We explore different network structures and parameter settings to achieve trade-offs between performance and speed. Moreover, we extend our network to cope with three color channels simultaneously, and show better overall reconstruction quality.

...read moreread less

6,122 citations

Active Learning Literature Survey

[...]

Burr Settles

01 Jan 2009

TL;DR: This report provides a general introduction to active learning and a survey of the literature, including a discussion of the scenarios in which queries can be formulated, and an overview of the query strategy frameworks proposed in the literature to date.

...read moreread less

Abstract: The key idea behind active learning is that a machine learning algorithm can achieve greater accuracy with fewer training labels if it is allowed to choose the data from which it learns. An active learner may pose queries, usually in the form of unlabeled data instances to be labeled by an oracle (e.g., a human annotator). Active learning is well-motivated in many modern machine learning problems, where unlabeled data may be abundant or easily obtained, but labels are difficult, time-consuming, or expensive to obtain. This report provides a general introduction to active learning and a survey of the literature. This includes a discussion of the scenarios in which queries can be formulated, and an overview of the query strategy frameworks proposed in the literature to date. An analysis of the empirical and theoretical evidence for successful active learning, a summary of problem setting variants and practical issues, and a discussion of related topics in machine learning research are also presented.

...read moreread less

5,227 citations

"Hyperspectral Image Classification ..." refers methods in this paper

...A variety of heuristic AL strategies have been proposed in the machine learning field, such as uncertainty sampling [57], expected model change [58], variance reduction [59], estimated error reduction [60], and density-weighted methods [59]....
[...]