Learning Deep Hierarchical Spatial-Spectral Features for Hyperspectral Image Classification Based on Residual 3D-2D CNN.

doi:10.3390/S19235276

Citations

PDF

Open Access

More filters

Journal Article•

Benefits of depth in neural networks

[...]

Matus Telgarsky¹•Institutions (1)

University of Michigan¹

06 Jun 2016-Journal of Machine Learning Research

TL;DR: This result is proved here for a class of nodes termed "semi-algebraic gates" which includes the common choices of ReLU, maximum, indicator, and piecewise polynomial functions, therefore establishing benefits of depth against not just standard networks with ReLU gates, but also convolutional networks with reLU and maximization gates, sum-product networks, and boosted decision trees.

...read moreread less

Abstract: For any positive integer $k$, there exist neural networks with $\Theta(k^3)$ layers, $\Theta(1)$ nodes per layer, and $\Theta(1)$ distinct parameters which can not be approximated by networks with $\mathcal{O}(k)$ layers unless they are exponentially large --- they must possess $\Omega(2^k)$ nodes. This result is proved here for a class of nodes termed "semi-algebraic gates" which includes the common choices of ReLU, maximum, indicator, and piecewise polynomial functions, therefore establishing benefits of depth against not just standard networks with ReLU gates, but also convolutional networks with ReLU and maximization gates, sum-product networks, and boosted decision trees (in this last case with a stronger separation: $\Omega(2^{k^3})$ total tree nodes are required).

...read moreread less

288 citations

Journal Article•DOI•

A New End-to-End Multi-Dimensional CNN Framework for Land Cover/Land Use Change Detection in Multi-Source Remote Sensing Datasets

[...]

Seyd Teymoor Seydi, Mahdi Hasanlou, Meisam Amani

01 Jun 2020-Remote Sensing

TL;DR: A novel CD framework based on the convolutional neural network (CNN) is proposed to not only address the aforementioned problems but also to considerably improve the level of accuracy.

...read moreread less

Abstract: The diversity of change detection (CD) methods and the limitations in generalizing these techniques using different types of remote sensing datasets over various study areas have been a challenge for CD applications Additionally, most CD methods have been implemented in two intensive and time-consuming steps: (a) predicting change areas, and (b) decision on predicted areas In this study, a novel CD framework based on the convolutional neural network (CNN) is proposed to not only address the aforementioned problems but also to considerably improve the level of accuracy The proposed CNN-based CD network contains three parallel channels: the first and second channels, respectively, extract deep features on the original first- and second-time imagery and the third channel focuses on the extraction of change deep features based on differencing and staking deep features Additionally, each channel includes three types of convolution kernels: 1D-, 2D-, and 3D-dilated-convolution The effectiveness and reliability of the proposed CD method are evaluated using three different types of remote sensing benchmark datasets (ie, multispectral, hyperspectral, and Polarimetric Synthetic Aperture RADAR (PolSAR)) The results of the CD maps are also evaluated both visually and statistically by calculating nine different accuracy indices Moreover, the results of the CD using the proposed method are compared to those of several state-of-the-art CD algorithms All the results prove that the proposed method outperforms the other remote sensing CD techniques For instance, considering different scenarios, the Overall Accuracies (OAs) and Kappa Coefficients (KCs) of the proposed CD method are better than 9589% and 0805, respectively, and the Miss Detection (MD) and the False Alarm (FA) rates are lower than 12% and 3%, respectively

...read moreread less

58 citations

Cites background from "Learning Deep Hierarchical Spatial-..."

...The main purpose of the 3D convolution layer is to investigate the relationship between spectral bands so that all of the content of the spectral information is fully used [46,47]....
[...]
...The 2D convolution layer is concentrated on spatial dimensions and is unable to handle spectral information [47,50]....
[...]

Journal Article•DOI•

Artificial intelligence-based hybrid deep learning models for image classification: The first narrative review.

[...]

Biswajit Jena¹, Sanjay Saxena¹, Gopal Krishna Nayak¹, Luca Saba², Neeraj Sharma³, Jasjit S. Suri - Show less +2 more•Institutions (3)

International Institute of Information Technology¹, University of Cagliari², Indian Institute of Technology (BHU) Varanasi³

01 Oct 2021-Computers in Biology and Medicine

TL;DR: In this article, the authors provided the first narrative deep learning review by considering all facets of image classification using AI and employed a PRISMA search strategy using Google Scholar, PubMed, IEEE, and Elsevier Science Direct, through which 127 relevant HDL studies were considered.

...read moreread less

50 citations

Journal Article•DOI•

Spatial-Spectral Feature Refinement for Hyperspectral Image Classification Based on Attention-Dense 3D-2D-CNN.

[...]

Jin Zhang, Fengyuan Wei, Fan Feng, Chunyang Wang

11 Sep 2020-Sensors

TL;DR: This work proposed a novel 3D-2D-convolutional neural network (CNN) model named AD-HybridSN (Attention-Dense- HybridSN), which can learn more discriminative spatial–spectral features using very few training data and is far better than all the contrast models.

...read moreread less

Abstract: Convolutional neural networks provide an ideal solution for hyperspectral image (HSI) classification. However, the classification effect is not satisfactory when limited training samples are available. Focused on "small sample" hyperspectral classification, we proposed a novel 3D-2D-convolutional neural network (CNN) model named AD-HybridSN (Attention-Dense-HybridSN). In our proposed model, a dense block was used to reuse shallow features and aimed at better exploiting hierarchical spatial-spectral features. Subsequent depth separable convolutional layers were used to discriminate the spatial information. Further refinement of spatial-spectral features was realized by the channel attention method and spatial attention method, which were performed behind every 3D convolutional layer and every 2D convolutional layer, respectively. Experiment results indicate that our proposed model can learn more discriminative spatial-spectral features using very few training data. In Indian Pines, Salinas and the University of Pavia, AD-HybridSN obtain 97.02%, 99.59% and 98.32% overall accuracy using only 5%, 1% and 1% labeled data for training, respectively, which are far better than all the contrast models.

...read moreread less

25 citations

Cites background or methods from "Learning Deep Hierarchical Spatial-..."

...Compared with traditional 2D convolutional layers, the depth separable convolutional layers have fewer parameters and less computational burden, which make it more suitable for hyperspectral data processing [34]....
[...]
...[34] conducted vast experiments using different amounts of training samples and found that the degradation of the CNN model is very common when the sample size decreased....
[...]
...[34] proposed R-HybridSN (Residual-HybridSN) by means of rational use of non-identity residual connections, enriching the feature learning paths and enhancing the flow of spectral information in the network....
[...]
...Compared with the simple pipelined network, the well-designed model, which is more like a directed acyclic graph of layers, usually has a better classification effect [34]....
[...]
...The main strategies for small sample hyperspectral classification include generative adversarial networks [39,40], semi-supervised learning [41,42] and network optimization [33,34]....
[...]

Journal Article•DOI•

Multiscale Information Fusion for Hyperspectral Image Classification Based on Hybrid 2D-3D CNN

[...]

Hang Gong, Qiuxia Li, Chunlai Li, Haishan Dai, Zhiping He, Wenjing Wang, Haoyang Li, Feng Han, Abudusalamu Tuniyazi, Tingkui Mu - Show less +6 more

09 Jun 2021-Remote Sensing

TL;DR: In this paper, a lightweight, multiscale squeeze-and-excitation pyramid pooling network (MSPN) was proposed to tackle the small sample problem, which applied the semantic segmentation function to the pixel-level hyperspectral classification due to their comparability.

...read moreread less

Abstract: Hyperspectral images are widely used for classification due to its rich spectral information along with spatial information. To process the high dimensionality and high nonlinearity of hyperspectral images, deep learning methods based on convolutional neural network (CNN) are widely used in hyperspectral classification applications. However, most CNN structures are stacked vertically in addition to using a onefold size of convolutional kernels or pooling layers, which cannot fully mine the multiscale information on the hyperspectral images. When such networks meet the practical challenge of a limited labeled hyperspectral image dataset—i.e., “small sample problem”—the classification accuracy and generalization ability would be limited. In this paper, to tackle the small sample problem, we apply the semantic segmentation function to the pixel-level hyperspectral classification due to their comparability. A lightweight, multiscale squeeze-and-excitation pyramid pooling network (MSPN) is proposed. It consists of a multiscale 3D CNN module, a squeezing and excitation module, and a pyramid pooling module with 2D CNN. Such a hybrid 2D-3D-CNN MSPN framework can learn and fuse deeper hierarchical spatial–spectral features with fewer training samples. The proposed MSPN was tested on three publicly available hyperspectral classification datasets: Indian Pine, Salinas, and Pavia University. Using 5%, 0.5%, and 0.5% training samples of the three datasets, the classification accuracies of the MSPN were 96.09%, 97%, and 96.56%, respectively. In addition, we also selected the latest dataset with higher spatial resolution, named WHU-Hi-LongKou, as the challenge object. Using only 0.1% of the training samples, we could achieve a 97.31% classification accuracy, which is far superior to the state-of-the-art hyperspectral classification methods.

...read moreread less

19 citations

Collapse

Learning Deep Hierarchical Spatial-Spectral Features for Hyperspectral Image Classification Based on Residual 3D-2D CNN.

Citations

Cites background from "Learning Deep Hierarchical Spatial-..."

Cites background or methods from "Learning Deep Hierarchical Spatial-..."

References

"Learning Deep Hierarchical Spatial-..." refers background in this paper

Related Papers (5)