scispace - formally typeset
Search or ask a question
Author

Tianyu Zhang

Bio: Tianyu Zhang is an academic researcher from Qiqihar University. The author has contributed to research in topics: Hyperspectral imaging & Feature extraction. The author has co-authored 3 publications.

Papers
More filters
Journal ArticleDOI
TL;DR: Wang et al. as discussed by the authors proposed a spectral-spatial attention fusion with a deformable convolution residual network (SSAF-DCR) for hyperspectral image classification.
Abstract: Convolutional neural networks (CNNs) have exhibited excellent performance in hyperspectral image classification. However, due to the lack of labeled hyperspectral data, it is difficult to achieve high classification accuracy of hyperspectral images with fewer training samples. In addition, although some deep learning techniques have been used in hyperspectral image classification, due to the abundant information of hyperspectral images, the problem of insufficient spatial spectral feature extraction still exists. To address the aforementioned issues, a spectral–spatial attention fusion with a deformable convolution residual network (SSAF-DCR) is proposed for hyperspectral image classification. The proposed network is composed of three parts, and each part is connected sequentially to extract features. In the first part, a dense spectral block is utilized to reuse spectral features as much as possible, and a spectral attention block that can refine and optimize the spectral features follows. In the second part, spatial features are extracted and selected by a dense spatial block and attention block, respectively. Then, the results of the first two parts are fused and sent to the third part, and deep spatial features are extracted by the DCR block. The above three parts realize the effective extraction of spectral–spatial features, and the experimental results for four commonly used hyperspectral datasets demonstrate that the proposed SSAF-DCR method is superior to some state-of-the-art methods with very few training samples.

7 citations

Journal ArticleDOI
TL;DR: Wang et al. as mentioned in this paper proposed a deep spectral spatial inverted residuals network (DSSIRNet) to solve the problem of limited labeled samples by data augmentation of small spatial blocks.
Abstract: Convolutional neural networks (CNNs) have been widely used in hyperspectral image classification in recent years. The training of CNNs relies on a large amount of labeled sample data. However, the number of labeled samples of hyperspectral data is relatively small. Moreover, for hyperspectral images, fully extracting spectral and spatial feature information is the key to achieve high classification performance. To solve the above issues, a deep spectral spatial inverted residuals network (DSSIRNet) is proposed. In this network, a data block random erasing strategy is introduced to alleviate the problem of limited labeled samples by data augmentation of small spatial blocks. In addition, a deep inverted residuals (DIR) module for spectral spatial feature extraction is proposed, which locks the effective features of each layer while avoiding network degradation. Furthermore, a global 3D attention module is proposed, which can realize the fine extraction of spectral and spatial global context information under the condition of the same number of input and output feature maps. Experiments are carried out on four commonly used hyperspectral datasets. A large number of experimental results show that compared with some state-of-the-art classification methods, the proposed method can provide higher classification accuracy for hyperspectral images.

6 citations

Journal ArticleDOI
TL;DR: Wang et al. as discussed by the authors proposed an improved dense block based on a multiscale spectral pyramid (MSSP), which can fully extract spectral information from hyperspectral images, and a short connection with nonlinear transformation is introduced to enhance the representation ability of the model.
Abstract: In recent years, convolutional neural networks (CNNs) have been widely used in hyperspectral image classification and have achieved good performance. However, the high dimensions and few samples of hyperspectral remote sensing images tend to be the main factors restricting improvements in classification performance. At present, most advanced classification methods are based on the joint extraction of spatial and spectral features. In this article, an improved dense block based on a multiscale spectral pyramid (MSSP) is proposed. This method uses the idea of multiscale and group convolution of the convolution kernel, which can fully extract spectral information from hyperspectral images. The designed MSSP is the main unit of the spectral dense block (called MSSP Block). Additionally, a short connection with nonlinear transformation is introduced to enhance the representation ability of the model. To demonstrate the effectiveness of the proposed dual-branch multiscale spectral attention network, some experiments are conducted on five commonly used datasets. The experimental results show that, compared with some state-of-the-art methods, the proposed method can provide better classification performance and has strong generalization ability.

2 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: The experimental results proved that the proposed model can provide the optimal trade-off between accuracy and computational time compared to other related methods using the Indian Pines, Pavia University, and Salinas Scene hyperspectral benchmark datasets.
Abstract: The performance of hyperspectral image (HSI) classification is highly dependent on spatial and spectral information, and is heavily affected by factors such as data redundancy and insufficient spatial resolution. To overcome these challenges, many convolutional neural networks (CNN) especially 2D-CNN-based methods have been proposed for HSI classification. However, these methods produced insufficient results compared to 3D-CNN-based methods. On the other hand, the high computational complexity of the 3D-CNN-based methods is still a major concern that needs to be addressed. Therefore, this study introduces a consolidated convolutional neural network (C-CNN) to overcome the aforementioned issues. The proposed C-CNN is comprised of a three-dimension CNN (3D-CNN) joined with a two-dimension CNN (2D-CNN). The 3D-CNN is used to represent spatial–spectral features from the spectral bands, and the 2D-CNN is used to learn abstract spatial features. Principal component analysis (PCA) was firstly applied to the original HSIs before they are fed to the network to reduce the spectral bands redundancy. Moreover, image augmentation techniques including rotation and flipping have been used to increase the number of training samples and reduce the impact of overfitting. The proposed C-CNN that was trained using the augmented images is named C-CNN-Aug. Additionally, both Dropout and L2 regularization techniques have been used to further reduce the model complexity and prevent overfitting. The experimental results proved that the proposed model can provide the optimal trade-off between accuracy and computational time compared to other related methods using the Indian Pines, Pavia University, and Salinas Scene hyperspectral benchmark datasets.

31 citations

Journal ArticleDOI
TL;DR: This research analyzes how to accurately classify new HSI from limited samples with labels using a compressed synergic deep convolution neural network with Aquila optimization model constructed by utilising a novel optimization technique known as the Aquila Optimizer (AO).
Abstract: The classification technology of hyperspectral images (HSI) consists of many contiguous spectral bands that are often utilized for a various Earth observation activities, such as surveillance, detection, and identification. The incorporation of both spectral and spatial characteristics is necessary for improved classification accuracy. In the classification of hyperspectral images, deep learning has gained significant traction. This research analyzes how to accurately classify new HSI from limited samples with labels. A novel deep-learning-based categorization based on feature extraction and classification is designed for this purpose. Initial extraction of spectral and spatial information is followed by spectral and spatial information integration to generate fused features. The classification challenge is completed using a compressed synergic deep convolution neural network with Aquila optimization (CSDCNN-AO) model constructed by utilising a novel optimization technique known as the Aquila Optimizer (AO). The HSI, the Kennedy Space Center (KSC), the Indian Pines (IP) dataset, the Houston U (HU) dataset, and the Salinas Scene (SS) dataset are used for experiment assessment. The sequence testing on these four HSI-classified datasets demonstrate that our innovative framework outperforms the conventional technique on common evaluation measures such as average accuracy (AA), overall accuracy (OA), and Kappa coefficient (k). In addition, it significantly reduces training time and computational cost, resulting in enhanced training stability, maximum performance, and remarkable training accuracy.

5 citations

Journal ArticleDOI
TL;DR: A novel deep learning-based fuzzy-twin proximal support vector machine (DL-FTPSVM) kernel neural network model is proposed to perform an effective hyperspectral image (HSI) classification and proves the superiority of the DL- FTPSVM model incurring better classification accuracy compared to techniques from previous works.

3 citations

Journal ArticleDOI
TL;DR: Wang et al. as mentioned in this paper proposed a two-branch convolutional neural network with a polarized full attention mechanism for HSI classification, which can make the network easier to be trained and fit the network to small sample size conditions.
Abstract: In recent years, convolutional neural networks (CNNs) have been introduced for pixel-wise hyperspectral image (HSI) classification tasks. However, some problems of the CNNs are still insufficiently addressed, such as the receptive field problem, small sample problem, and feature fusion problem. To tackle the above problems, we proposed a two-branch convolutional neural network with a polarized full attention mechanism for HSI classification. In the proposed network, two-branch CNNs are implemented to efficiently extract the spectral and spatial features, respectively. The kernel sizes of the convolutional layers are simplified to reduce the complexity of the network. This approach can make the network easier to be trained and fit the network to small sample size conditions. The one-shot connection technique is applied to improve the efficiency of feature extraction. An improved full attention block, named polarized full attention, is exploited to fuse the feature maps and provide global contextual information. Experimental results on several public HSI datasets confirm the effectiveness of the proposed network.

3 citations

Journal ArticleDOI
TL;DR: Li et al. as discussed by the authors proposed a triple-branch ternary-attention mechanism network with deformable 3D convolution (D3DTBTA), which can better capture the vector features of three dimensions in hyperspectral images.
Abstract: ABSTRACT In recent years, the classification of hyperspectral images (HSI) has received extensive research attention. As compared with traditional HSI classification, which only uses spectral information, it is found that spatial information is also essential in HSI classification. To effectively utilize the spectral and spatial information of HSI, this paper proposes a triple-branch ternary-attention mechanism network with deformable 3D convolution (D3DTBTA). In D3DTBTA, three branches, i.e. the spectral, spatial-X, and spatial-Y branches, are combined with the attention mechanism in three directions, which can better capture the vector features of three dimensions in HSI. Furthermore, considering the adaptation of scale and receptive field size in the convolution operation, our method uses deformable convolution to enable D3DTBTA to enhance feature extraction. Our experimental results show that the framework outperforms existing algorithms on four hyperspectral datasets, especially when the training samples are limited.

2 citations