Regularized CNN Feature Hierarchy for Hyperspectral Image Classification

doi:10.3390/RS13122275

Home
/
Papers
/
Regularized CNN Feature Hierarchy for Hyperspectral Image Classification

Journal Article•DOI•

Regularized CNN Feature Hierarchy for Hyperspectral Image Classification

Muhammad Ahmad, Manuel Mazzara, Salvatore Distefano

10 Jun 2021-Remote Sensing (Multidisciplinary Digital Publishing Institute)-Vol. 13, Iss: 12, pp 2275

TL;DR: In this paper, the authors proposed an idea to enhance the generalization performance of CNN for Hyperspectral Image Classification (HSIC) using soft labels that are a weighted average of the hard labels and uniform distribution over ground labels.

read less

Abstract: Convolutional Neural Networks (CNN) have been rigorously studied for Hyperspectral Image Classification (HSIC) and are known to be effective in exploiting joint spatial-spectral information with the expense of lower generalization performance and learning speed due to the hard labels and non-uniform distribution over labels. Therefore, this paper proposed an idea to enhance the generalization performance of CNN for HSIC using soft labels that are a weighted average of the hard labels and uniform distribution over ground labels. The proposed method helps to prevent CNN from becoming over-confident. We empirically show that, in improving generalization performance, regularization also improves model calibration, which significantly improves beam-search. Several publicly available Hyperspectral datasets are used to validate the experimental evaluation, which reveals improved performance as compared to the state-of-the-art models with overall 99.29%, 99.97%, and 100.0% accuracy for Indiana Pines, Pavia University, and Salinas dataset, respectively.

...read moreread less

Content maybe subject to copyright Report

Citations

PDF

Open Access

More filters

Posted Content•

Hyperspectral Image Classification - Traditional to Deep Models: A Survey for Future Prospects.

[...]

Sidrah Shabbir, Muhammad Ahmad

15 Jan 2021-arXiv: Image and Video Processing

TL;DR: In this paper, a survey of state-of-the-art DL frameworks for hyperspectral imaging (HSI) classification is presented, including spectral-features, spatial-features and together spatial-spectral features.

...read moreread less

Abstract: Hyperspectral Imaging (HSI) has been extensively utilized in many real-life applications because it benefits from the detailed spectral information contained in each pixel. Notably, the complex characteristics i.e., the nonlinear relation among the captured spectral information and the corresponding object of HSI data make accurate classification challenging for traditional methods. In the last few years, deep learning (DL) has been substantiated as a powerful feature extractor that effectively addresses the nonlinear problems that appeared in a number of computer vision tasks. This prompts the deployment of DL for HSI classification (HSIC) which revealed good performance. This survey enlists a systematic overview of DL for HSIC and compared state-of-the-art strategies of the said topic. Primarily, we will encapsulate the main challenges of traditional machine learning for HSIC and then we will acquaint the superiority of DL to address these problems. This survey breakdown the state-of-the-art DL frameworks into spectral-features, spatial-features, and together spatial-spectral features to systematically analyze the achievements (future directions as well) of these frameworks for HSIC. Moreover, we will consider the fact that DL requires a large number of labeled training examples whereas acquiring such a number for HSIC is challenging in terms of time and cost. Therefore, this survey discusses some strategies to improve the generalization performance of DL strategies which can provide some future guidelines.

...read moreread less

68 citations

Journal Article•DOI•

Hyperspectral Image Classification—Traditional to Deep Models: A Survey for Future Prospects

[...]

01 Jan 2022-IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

TL;DR: In this article , a survey of state-of-the-art DL frameworks for hyperspectral imaging classification (HSIC) is presented. And the authors discuss some strategies to improve the generalization performance of DL strategies and provide some future guidelines.

...read moreread less

Abstract: Hyperspectral imaging (HSI) has been extensively utilized in many real-life applications because it benefits from the detailed spectral information contained in each pixel. Notably, the complex characteristics, i.e., the nonlinear relation among the captured spectral information and the corresponding object of HSI data, make accurate classification challenging for traditional methods. In the last few years, deep learning (DL) has been substantiated as a powerful feature extractor that effectively addresses the nonlinear problems that appeared in a number of computer vision tasks. This prompts the deployment of DL for HSI classification (HSIC) which revealed good performance. This survey enlists a systematic overview of DL for HSIC and compared state-of-the-art strategies of the said topic. Primarily, we will encapsulate the main challenges of TML for HSIC and then we will acquaint the superiority of DL to address these problems. This article breaks down the state-of-the-art DL frameworks into spectral-features, spatial-features, and together spatial–spectral features to systematically analyze the achievements (future research directions as well) of these frameworks for HSIC. Moreover, we will consider the fact that DL requires a large number of labeled training examples whereas acquiring such a number for HSIC is challenging in terms of time and cost. Therefore, this survey discusses some strategies to improve the generalization performance of DL strategies which can provide some future guidelines.

...read moreread less

63 citations

Journal Article•DOI•

Classification of hyperspectral remote sensing images using different dimension reduction methods with 3D/2D CNN

[...]

Hüseyin Firat, Mehmet Emin Asker, Davut Hanbay

01 Jan 2022-Remote Sensing Applications: Society and Environment

TL;DR: In this article , a hybrid 3D/2D CNN method is used together with dimension reduction methods to improve HRSI classification performance, which consists of a combination of 3D CNN, 2D CNN and depthwise separable convolution.

...read moreread less

22 citations

Journal Article•DOI•

Artifacts of different dimension reduction methods on hybrid CNN feature hierarchy for Hyperspectral Image Classification

[...]

Muhammad Ahmad¹, Muhammad Ahmad², Sidrah Shabbir, Rana Aamir Raza³, Manuel Mazzara, Salvatore Distefano¹, Adil Khan - Show less +3 more•Institutions (3)

University of Messina¹, National University of Computer and Emerging Sciences², Bahauddin Zakariya University³

01 Nov 2021-Optik

TL;DR: This work presents a compact hybrid CNN model which overcomes the aforementioned challenges by distributing spatial–spectral feature extraction across 3D and 2D layers and shows that the proposed pipeline outperformed in terms of generalization performance and statistical significance.

...read moreread less

20 citations

Journal Article•DOI•

Classification of Apple Disease Based on Non-Linear Deep Features

[...]

Hamail Ayaz, Erick Rodríguez-Esparza, Muhammad Ahmad, Diego Oliva, Marco Pérez-Cisneros, Ram Sarkar - Show less +2 more

12 Jul 2021-Applied Sciences

TL;DR: This work progressively modifies a baseline model by using an end-to-end trained DCNN model that has fewer parameters, better recognition accuracy than existing models (i.e., ResNet, SqeezeNet, and MiniVGGNet), and comparative results confirm the superiority of the proposed model.

...read moreread less

Abstract: Diseases in apple orchards (rot, scab, and blotch) worldwide cause a substantial loss in the agricultural industry. Traditional hand picking methods are subjective to human efforts. Conventional machine learning methods for apple disease classification depend on hand-crafted features that are not robust and are complex. Advanced artificial methods such as Convolutional Neural Networks (CNN’s) have become a promising way for achieving higher accuracy although they need a high volume of samples. This work investigates different Deep CNN (DCNN) applications to apple disease classification using deep generative images to obtain higher accuracy. In order to achieve this, our work progressively modifies a baseline model by using an end-to-end trained DCNN model that has fewer parameters, better recognition accuracy than existing models (i.e., ResNet, SqeezeNet, and MiniVGGNet). We have performed a comparative study with state-of-the-art CNN as well as conventional methods proposed in the literature, and comparative results confirm the superiority of our proposed model.

...read moreread less

19 citations

References

PDF

Open Access

More filters

Journal Article•DOI•

Learning representations by back-propagating errors

[...]

David E. Rumelhart¹, Geoffrey E. Hinton², Ronald J. Williams¹•Institutions (2)

University of California, San Diego¹, Carnegie Mellon University²

01 Jan 1988-Nature

TL;DR: Back-propagation repeatedly adjusts the weights of the connections in the network so as to minimize a measure of the difference between the actual output vector of the net and the desired output vector, which helps to represent important features of the task domain.

...read moreread less

Abstract: We describe a new learning procedure, back-propagation, for networks of neurone-like units. The procedure repeatedly adjusts the weights of the connections in the network so as to minimize a measure of the difference between the actual output vector of the net and the desired output vector. As a result of the weight adjustments, internal ‘hidden’ units which are not part of the input or output come to represent important features of the task domain, and the regularities in the task are captured by the interactions of these units. The ability to create useful new features distinguishes back-propagation from earlier, simpler methods such as the perceptron-convergence procedure1.

...read moreread less

23,814 citations

Proceedings Article•DOI•

Rethinking the Inception Architecture for Computer Vision

[...]

Christian Szegedy¹, Vincent Vanhoucke¹, Sergey Ioffe¹, Jonathon Shlens¹, Zbigniew Wojna² - Show less +1 more•Institutions (2)

Google¹, University College London²

27 Jun 2016

TL;DR: In this article, the authors explore ways to scale up networks in ways that aim at utilizing the added computation as efficiently as possible by suitably factorized convolutions and aggressive regularization.

...read moreread less

Abstract: Convolutional networks are at the core of most state of-the-art computer vision solutions for a wide variety of tasks. Since 2014 very deep convolutional networks started to become mainstream, yielding substantial gains in various benchmarks. Although increased model size and computational cost tend to translate to immediate quality gains for most tasks (as long as enough labeled data is provided for training), computational efficiency and low parameter count are still enabling factors for various use cases such as mobile vision and big-data scenarios. Here we are exploring ways to scale up networks in ways that aim at utilizing the added computation as efficiently as possible by suitably factorized convolutions and aggressive regularization. We benchmark our methods on the ILSVRC 2012 classification challenge validation set demonstrate substantial gains over the state of the art: 21:2% top-1 and 5:6% top-5 error for single frame evaluation using a network with a computational cost of 5 billion multiply-adds per inference and with using less than 25 million parameters. With an ensemble of 4 models and multi-crop evaluation, we report 3:5% top-5 error and 17:3% top-1 error on the validation set and 3:6% top-5 error on the official test set.

...read moreread less

16,962 citations

Journal Article•DOI•

Regularized Evolution for Image Classifier Architecture Search

[...]

Esteban Real¹, Alok Aggarwal¹, Yanping Huang¹, Quoc V. Le¹•Institutions (1)

Google¹

17 Jul 2019

TL;DR: AmoebaNet-A as mentioned in this paper modified the tournament selection evolutionary algorithm by introducing an age property to favor the younger genotypes and achieved state-of-the-art performance.

...read moreread less

Abstract: The effort devoted to hand-crafting neural network image classifiers has motivated the use of architecture search to discover them automatically. Although evolutionary algorithms have been repeatedly applied to neural network topologies, the image classifiers thus discovered have remained inferior to human-crafted ones. Here, we evolve an image classifier— AmoebaNet-A—that surpasses hand-designs for the first time. To do this, we modify the tournament selection evolutionary algorithm by introducing an age property to favor the younger genotypes. Matching size, AmoebaNet-A has comparable accuracy to current state-of-the-art ImageNet models discovered with more complex architecture-search methods. Scaled to larger size, AmoebaNet-A sets a new state-of-theart 83.9% top-1 / 96.6% top-5 ImageNet accuracy. In a controlled comparison against a well known reinforcement learning algorithm, we give evidence that evolution can obtain results faster with the same hardware, especially at the earlier stages of the search. This is relevant when fewer compute resources are available. Evolution is, thus, a simple method to effectively discover high-quality architectures.

...read moreread less

2,076 citations

Journal Article•DOI•

Deep Feature Extraction and Classification of Hyperspectral Images Based on Convolutional Neural Networks

[...]

Yushi Chen¹, Hanlu Jiang¹, Chunyang Li¹, Xiuping Jia², Pedram Ghamisi³ - Show less +1 more•Institutions (3)

Harbin Institute of Technology¹, University of New South Wales², Technische Universität München³

18 Jul 2016-IEEE Transactions on Geoscience and Remote Sensing

TL;DR: This paper proposes a 3-D CNN-based FE model with combined regularization to extract effective spectral-spatial features of hyperspectral imagery and reveals that the proposed models with sparse constraints provide competitive results to state-of-the-art methods.

...read moreread less

Abstract: Due to the advantages of deep learning, in this paper, a regularized deep feature extraction (FE) method is presented for hyperspectral image (HSI) classification using a convolutional neural network (CNN). The proposed approach employs several convolutional and pooling layers to extract deep features from HSIs, which are nonlinear, discriminant, and invariant. These features are useful for image classification and target detection. Furthermore, in order to address the common issue of imbalance between high dimensionality and limited availability of training samples for the classification of HSI, a few strategies such as L2 regularization and dropout are investigated to avoid overfitting in class data modeling. More importantly, we propose a 3-D CNN-based FE model with combined regularization to extract effective spectral-spatial features of hyperspectral imagery. Finally, in order to further improve the performance, a virtual sample enhanced method is proposed. The proposed approaches are carried out on three widely used hyperspectral data sets: Indian Pines, University of Pavia, and Kennedy Space Center. The obtained results reveal that the proposed models with sparse constraints provide competitive results to state-of-the-art methods. In addition, the proposed deep FE opens a new window for further research.

...read moreread less

2,059 citations

Posted Content•

Learning Transferable Architectures for Scalable Image Recognition

[...]

Barret Zoph¹, Vijay K. Vasudevan¹, Jonathon Shlens¹, Quoc V. Le¹•Institutions (1)

Google¹

21 Jul 2017-arXiv: Computer Vision and Pattern Recognition

TL;DR: This paper proposes to search for an architectural building block on a small dataset and then transfer the block to a larger dataset and introduces a new regularization technique called ScheduledDropPath that significantly improves generalization in the NASNet models.

...read moreread less

Abstract: Developing neural network image classification models often requires significant architecture engineering. In this paper, we study a method to learn the model architectures directly on the dataset of interest. As this approach is expensive when the dataset is large, we propose to search for an architectural building block on a small dataset and then transfer the block to a larger dataset. The key contribution of this work is the design of a new search space (the "NASNet search space") which enables transferability. In our experiments, we search for the best convolutional layer (or "cell") on the CIFAR-10 dataset and then apply this cell to the ImageNet dataset by stacking together more copies of this cell, each with their own parameters to design a convolutional architecture, named "NASNet architecture". We also introduce a new regularization technique called ScheduledDropPath that significantly improves generalization in the NASNet models. On CIFAR-10 itself, NASNet achieves 2.4% error rate, which is state-of-the-art. On ImageNet, NASNet achieves, among the published works, state-of-the-art accuracy of 82.7% top-1 and 96.2% top-5 on ImageNet. Our model is 1.2% better in top-1 accuracy than the best human-invented architectures while having 9 billion fewer FLOPS - a reduction of 28% in computational demand from the previous state-of-the-art model. When evaluated at different levels of computational cost, accuracies of NASNets exceed those of the state-of-the-art human-designed models. For instance, a small version of NASNet also achieves 74% top-1 accuracy, which is 3.1% better than equivalently-sized, state-of-the-art models for mobile platforms. Finally, the learned features by NASNet used with the Faster-RCNN framework surpass state-of-the-art by 4.0% achieving 43.1% mAP on the COCO dataset.

...read moreread less

1,342 citations