scispace - formally typeset
Search or ask a question
Author

Yun Cao

Bio: Yun Cao is an academic researcher from China University of Geosciences (Beijing). The author has contributed to research in topics: Autoencoder & Feature extraction. The author has an hindex of 2, co-authored 9 publications receiving 14 citations.

Papers
More filters
Journal ArticleDOI
TL;DR: A novel unsupervised feature learning method called latent relationship guided the stacked sparse autoencoder (LRSSAE) is developed in this article, which can effectively exploit the latent relationship under feature space to improve the ability of feature learning.
Abstract: Classification is an important application of hyperspectral image (HSI). However, it is also a challenging research topic due to the spatial variability of spectral signature and limited training samples. To address these problems, a novel unsupervised feature learning method called latent relationship guided the stacked sparse autoencoder (LRSSAE) is developed in this article, which can effectively exploit the latent relationship under feature space to improve the ability of feature learning. Moreover, the superpixels constraint is employed on the feature representation to avoid the “salt-and-pepper” problem, and it is enforced on the latent relationship to enhance the latent relationship learning additionally. In LRSSAE, combining the stacked sparse autoencoder (SSAE) with the graph regularizations of latent relationship in each hidden layer and the superpixel constraints in the top layer, we extract feature representation in an unsupervised manner. And then, we present a customized iterative algorithm to optimize the LRSSAE. We evaluate the proposed method on three widely used HSI data sets comprehensively. The results demonstrate that our method achieves promising classification performance on these data sets and obtains improvements of 5.06%, 5.77%, and 2.11% in overall accuracy compared to the best SSAE method.

20 citations

Journal ArticleDOI
TL;DR: In this article, the authors provide a thorough review of recent achievements in the field of land-use mapping using deep learning (DL) algorithms, which offer novel opportunities for the development of LUM for HSR-RSIs.
Abstract: Land-use mapping (LUM) using high-spatial resolution remote sensing images (HSR-RSIs) is a challenging and crucial technology. However, due to the characteristics of HSR-RSIs, such as different image acquisition conditions and massive, detailed information, and performing LUM faces unique scientific challenges. With the emergence of new deep learning (DL) algorithms in recent years, methods to LUM with DL have achieved huge breakthroughs, which offer novel opportunities for the development of LUM for HSR-RSIs. This article aims to provide a thorough review of recent achievements in this field. Existing high spatial resolution datasets in the research of semantic segmentation and single-object segmentation are presented first. Next, we introduce several basic DL approaches that are frequently adopted for LUM. After reviewing DL-based LUM methods comprehensively, which highlights the contributions of researchers in the field of LUM for HSR-RSIs, we summarize these DL-based approaches based on two LUM criteria. Individually, the first one has supervised learning, semisupervised learning, or unsupervised learning, while another one is pixel-based or object-based. We then briefly review the fundamentals and the developments of the development of semantic segmentation and single-object segmentation. At last, quantitative results that experiment on the dataset of ISPRS Vaihingen and ISPRS Potsdam are given for several representative models such as fully convolutional network (FCN) and U-Net, following up with a comparison and discussion of the results.

20 citations

Journal ArticleDOI
TL;DR: Zhang et al. as discussed by the authors developed a deep metric learning approach with generative adversarial network regularization (DML-GANR) for HSR-RSI retrieval.
Abstract: With a small number of labeled samples for training, it can save considerable manpower and material resources, especially when the amount of high spatial resolution remote sensing images (HSR-RSIs) increases considerably. However, many deep models face the problem of overfitting when using a small number of labeled samples. This might degrade HSR-RSI retrieval accuracy. Aiming at obtaining more accurate HSR-RSI retrieval performance with small training samples, we develop a deep metric learning approach with generative adversarial network regularization (DML-GANR) for HSR-RSI retrieval. The DML-GANR starts from a high-level feature extraction (HFE) to extract high-level features, which includes convolutional layers and fully connected (FC) layers. Each of the FC layers is constructed by deep metric learning (DML) to maximize the interclass variations and minimize the intraclass variations. The generative adversarial network (GAN) is adopted to mitigate the overfitting problem and validate the qualities of extracted high-level features. DML-GANR is optimized through a customized approach, and the optimal parameters are obtained. The experimental results on the three data sets demonstrate the superior performance of DML-GANR over state-of-the-art techniques in HSR-RSI retrieval.

18 citations

Journal ArticleDOI
TL;DR: A deep metric learning approach with generative adversarial network regularization (DML-GANR) with superior performance over state-of-the-art techniques in HSR-RSI retrieval.
Abstract: With a small number of labeled samples for training, it can save considerable manpower and material resources, especially when the amount of high spatial resolution remote sensing images (HSR-RSIs) increases considerably. However, many deep models face the problem of overfitting when using a small number of labeled samples. This might degrade HSRRSI retrieval accuracy. Aiming at obtaining more accurate HSR-RSI retrieval performance with small training samples, we develop a deep metric learning approach with generative adversarial network regularization (DML-GANR) for HSR-RSI retrieval. The DML-GANR starts from a high-level feature extraction (HFE) to extract high-level features, which includes convolutional layers and fully connected (FC) layers. Each of the FC layers is constructed by deep metric learning (DML) to maximize the interclass variations and minimize the intraclass variations. The generative adversarial network (GAN) is adopted to mitigate the overfitting problem and validate the qualities of extracted high-level features. DML-GANR is optimized through a customized approach, and the optimal parameters are obtained. The experimental results on the three data sets demonstrate the superior performance of DML-GANR over state-of-the-art techniques in HSR-RSI retrieval.

12 citations

Journal ArticleDOI
TL;DR: Wang et al. as mentioned in this paper proposed a deep feature learning with label consistencies (DFL-LC) method to realize hyperspectral images (HSIs) classification, which can effectively extract features from HSI data compared with other traditional hand-crafted methods.
Abstract: Deep learning approaches have recently been widely applied to the classification of hyperspectral images (HSIs) and achieve good capability. Deep learning can effectively extract features from HSI data compared with other traditional hand-crafted methods. Most deep learning methods extract image features through traditional convolution, which has demonstrated impressive ability in HSI classification. However, traditional convolution can only operate convolutions with fixed size and weight on regular square image regions. Moreover, it refers to the spectral features of the adjacent pixels but ignores the spectral features of long-range data with the training sample. Although a graph convolution network (GCN) can process irregular image regions, the pixels’ relationships for graph construction cannot be well ensured with limited iterations. Hence, the extracted features have limited performance with the GCN. Aiming to extract more representative and discriminative image features, in this article, the deep feature learning with label consistencies (DFL-LC) method is developed to realize HSI classification. In the proposed method, a multiscale convolutional neural network is adopted to obtain basic HSI features, and the GCN can further capture relationships between pixels and extract more representative HSI features. For obtaining discriminative features, we add the label consistency of single pixels and label consistency of group pixels regularization in the objective function. It can maintain label consistency for the general and long-range data and alleviate deficiently labeled samples. The experimental results on three representative datasets fully demonstrate that the DFL-LC method is superior to other methods in both quantitative and qualitative aspects.

11 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: This paper mainly works for systematically reviewing the emerging achievements for image retrieval from RS big data, and discusses the RS image retrieval based applications including fusion-oriented RS image processing, geo-localization and disaster rescue.

133 citations

Journal ArticleDOI
TL;DR: Sun et al. as mentioned in this paper proposed a new multi-source data reconstruction-based deep unsupervised hashing method, called MrHash, which explores the characteristics of remote sensing images to construct reliable pseudo-labels.
Abstract: Unsupervised hashing for remote sensing (RS) image retrieval first extracts image features and then use these features to construct supervised information (e.g., pseudo-labels) to train hashing networks. Existing methods usually regard RS images as natural images to extract unisource features. However, these features only contain partial information about ground objects and cannot produce reliable pseudo-labels. In addition, existing methods only generate a pseudo single-label to annotate each RS image, which cannot accurately represent multiple scenes in a RS image. To address these drawbacks, this paper proposes a new Multisource data reconstruction-based deep unsupervised Hashing method, called MrHash, which explores the characteristics of RS images to construct reliable pseudo-labels. In particular, we first use geographic coordinates to obtain different satellite images and develop a novel autoencoder network to extract multisource features from these images. Then pseudo multi-labels are designed to deal with the coexistence of multiple scenes in a single image. These labels are generated by a custom probability function with extracted multisource features. Finally, we propose a novel multi-semantic hash loss by using the Kull-back–Leibler (KL) divergence to preserve the semantic similarity of these pseudo multi-labels in Hamming space. Our newly developed MrHash only uses multisource images to construct supervised information, and hash code generation still relies on a unisource input image. Experiments on benchmark datasets clearly show the superiority of the proposed method over state-of-the-art baselines. https://github.com/sunyuxi/MrHash.

65 citations

Journal ArticleDOI
TL;DR: In this paper, the authors surveyed a random selection of 100 papers from the remote sensing (RS) DL literature and found that RS DL studies have largely abandoned traditional RS accuracy assessment terminology, though some of the accuracy measures typically used in DL papers, most notably precision and recall, have direct equivalents in traditional RS terminology.
Abstract: Convolutional neural network (CNN)-based deep learning (DL) is a powerful, recently developed image classification approach. With origins in the computer vision and image processing communities, the accuracy assessment methods developed for CNN-based DL use a wide range of metrics that may be unfamiliar to the remote sensing (RS) community. To explore the differences between traditional RS and DL RS methods, we surveyed a random selection of 100 papers from the RS DL literature. The results show that RS DL studies have largely abandoned traditional RS accuracy assessment terminology, though some of the accuracy measures typically used in DL papers, most notably precision and recall, have direct equivalents in traditional RS terminology. Some of the DL accuracy terms have multiple names, or are equivalent to another measure. In our sample, DL studies only rarely reported a complete confusion matrix, and when they did so, it was even more rare that the confusion matrix estimated population properties. On the other hand, some DL studies are increasingly paying attention to the role of class prevalence in designing accuracy assessment approaches. DL studies that evaluate the decision boundary threshold over a range of values tend to use the precision-recall (P-R) curve, the associated area under the curve (AUC) measures of average precision (AP) and mean average precision (mAP), rather than the traditional receiver operating characteristic (ROC) curve and its AUC. DL studies are also notable for testing the generalization of their models on entirely new datasets, including data from new areas, new acquisition times, or even new sensors.

47 citations

Journal ArticleDOI
TL;DR: A preliminary benchmark of modern SSL algorithms on popular remote sensing datasets is provided, verifying the potential of SSL in remote sensing and providing an extended study on data augmentations to pave the way for fruitful interaction of both domains.
Abstract: In deep learning research, self-supervised learning (SSL) has received great attention, triggering interest within both the computer vision and remote sensing communities. While there has been big success in computer vision, most of the potential of SSL in the domain of Earth observation remains locked. In this article, we provide an introduction to and a review of the concepts and latest developments in SSL for computer vision in the context of remote sensing. Further, we provide a preliminary benchmark of modern SSL algorithms on popular remote sensing datasets, verifying the potential of SSL in remote sensing and providing an extended study on data augmentations. Finally, we identify a list of promising directions of future research in SSL for Earth observation (SSL4EO) to pave the way for the fruitful interaction of both domains.

38 citations

Journal ArticleDOI
TL;DR: Sun et al. as mentioned in this paper proposed a new multisource data reconstruction-based deep unsupervised hashing method, called MrHash, which explores the characteristics of remote sensing images to construct reliable pseudolabels.
Abstract: Unsupervised hashing for remote sensing (RS) image retrieval first extracts image features and then uses these features to construct supervised information (e.g., pseudolabels) to train hashing networks. Existing methods usually regard RS images as natural images to extract unisource features. However, these features only contain partial information about ground objects and cannot produce reliable pseudolabels. In addition, existing methods only generate a pseudo-single-label to annotate each RS image, which cannot accurately represent multiple scenes in an RS image. To address these drawbacks, this article proposes a new Multisource data reconstruction-based deep unsupervised Hashing method, called MrHash, which explores the characteristics of RS images to construct reliable pseudolabels. In particular, we first use geographic coordinates to obtain different satellite images and develop a novel autoencoder network to extract multisource features from these images. Then, pseudo-multilabels are designed to deal with the coexistence of multiple scenes in a single image. These labels are generated by a custom probability function with extracted multisource features. Finally, we propose a novel multisemantic hash loss by using the Kullback–Leibler (KL) divergence to preserve the semantic similarity of these pseudo-multilabels in Hamming space. Our newly developed MrHash only uses multisource images to construct supervised information, and hash code generation still relies on a unisource input image. Experiments on benchmark datasets clearly show the superiority of the proposed method over state-of-the-art baselines. We have added detailed descriptions about our source code. Please check them by accessing https://github.com/sunyuxi/MrHash.

33 citations