scispace - formally typeset
Search or ask a question

Showing papers on "Feature vector published in 2022"


Journal ArticleDOI
TL;DR: A novel convolution autoencoder architecture that can dissociate the spatio-temporal representation to separately capture the spatial and the temporal information is explored, since abnormal events are usually different from the normality in appearance and/or motion behavior.

61 citations


Journal ArticleDOI
TL;DR: The obtained results show that the proposed method exhibits accurate and robust forecasting performance and outperforms conventional regression models.

54 citations


Journal ArticleDOI
Saibo Xing1, Yaguo Lei1, Shuhui Wang1, Na Lu1, Naipeng Li1 
TL;DR: Results show that the proposed method is effective in diagnosing unseen compound faults of machines, and is able to recognize mechanical compound faults when only the data of single faults are accessible for training.

52 citations


Journal ArticleDOI
TL;DR: New cognitive computing with the big data analysis tool for Sentiment Analysis is presented and improved classification performance of the proposed BBSO-FCM model is highlighted in terms of different measures.
Abstract: Advancements in recent networking and information technology have always been a natural phenomenon. The exponential amount of data generated by the people in their day-to-day lives results in the rise of Big Data Analytics (BDA). Cognitive computing is an Artificial Intelligence (AI) based system that can reduce the issues faced during BDA. On the other hand, Sentiment Analysis (SA) is employed to understand such linguistic based tweets, feature extraction, compute subjectivity and sentimental texts placed in these tweets. The application of SA on big data finds it useful for businesses to take commercial benefits insight from text-oriented content. In this view, this paper presents new cognitive computing with the big data analysis tool for SA. The proposed model involves various process such as pre-processing, feature extraction, feature selection and classification. For handling big data, Hadoop Map Reduce tool is used. The proposed model initially undergoes pre-processing to remove the unwanted words. Then, Term Frequency-Inverse Document Frequency (TF-IDF) is utilized as a feature extraction technique to extract the set of feature vectors. Besides, a Binary Brain Storm Optimization (BBSO) algorithm is being used for the Feature Selection (FS) process and thereby achieving improved classification performance. Moreover, Fuzzy Cognitive Maps (FCMs) are used as a classifier to classify the incidence of positive or negative sentiments. A comprehensive experimental results analysis ensures the better performance of the presented BBSO-FCM model on the benchmark dataset. The obtained experimental values highlights the improved classification performance of the proposed BBSO-FCM model in terms of different measures.

50 citations


Journal ArticleDOI
TL;DR: In this paper , a cross-view similarity graph learning term with matrix-induced regularization is embedded to preserve the local structure of data in the label space, which is referred to as CvLP-DCL.
Abstract: Although demonstrating great success, previous multi-view unsupervised feature selection (MV-UFS) methods often construct a view-specific similarity graph and characterize the local structure of data within each single view. In such a way, the cross-view information could be ignored. In addition, they usually assume that different feature views are projected from a latent feature space while the diversity of different views cannot be fully captured. In this work, we resent a MV-UFS model via cross-view local structure preserved diversity and consensus learning, referred to as CvLP-DCL briefly. In order to exploit both the shared and distinguishing information across different views, we project each view into a label space, which consists of a consensus part and a view-specific part. Therefore, we regularize the fact that different views represent same samples. Meanwhile, a cross-view similarity graph learning term with matrix-induced regularization is embedded to preserve the local structure of data in the label space. By imposing the $l_{2,1}$ -norm on the feature projection matrices for constraining row sparsity, discriminative features can be selected from different views. An efficient algorithm is designed to solve the resultant optimization problem and extensive experiments on six publicly datasets are conducted to validate the effectiveness of the proposed CvLP-DCL.

42 citations


Journal ArticleDOI
TL;DR: In this paper , a label description space embedded model for intelligent fault diagnosis (LDS-IFD) is proposed, which consists of three stages, i.e., feature learning, pre-judgment and fault recognition.

36 citations


Journal ArticleDOI
TL;DR: In this paper , the EEG signals are first decomposed in sub-bands using empirical wavelet transform (EWT) based on the Fourier Bessel series expansion (FBSE-EWT).

31 citations


Journal ArticleDOI
TL;DR: Wang et al. as discussed by the authors proposed an Adaptive Correlation (Ad-Corre) loss to guide the network towards generating embedded feature vectors with high correlation for within-class samples and less correlation for betweenclass samples.
Abstract: Automated Facial Expression Recognition (FER) in the wild using deep neural networks is still challenging due to intra-class variations and inter-class similarities in facial images. Deep Metric Learning (DML) is among the widely used methods to deal with these issues by improving the discriminative power of the learned embedded features. This paper proposes an Adaptive Correlation (Ad-Corre) Loss to guide the network towards generating embedded feature vectors with high correlation for within-class samples and less correlation for between-class samples. Ad-Corre consists of 3 components called Feature Discriminator, Mean Discriminator, and Embedding Discriminator. We design the Feature Discriminator component to guide the network to create the embedded feature vectors to be highly correlated if they belong to a similar class, and less correlated if they belong to different classes. In addition, the Mean Discriminator component leads the network to make the mean embedded feature vectors of different classes to be less similar to each other. We use Xception network as the backbone of our model, and contrary to previous work, we propose an embedding feature space that contains $k$ feature vectors. Then, the Embedding Discriminator component penalizes the network to generate the embedded feature vectors, which are dissimilar. We trained our model using the combination of our proposed loss functions called Ad-Corre Loss jointly with the cross-entropy loss. We achieved a very promising recognition accuracy on AffectNet, RAF-DB, and FER-2013. Our extensive experiments and ablation study indicate the power of our method to cope well with challenging FER tasks in the wild. The code is available on Github.

30 citations


Journal ArticleDOI
TL;DR: In this paper, the EEG signals are first decomposed in sub-bands using empirical wavelet transform (EWT) based on the Fourier Bessel series expansion (FBSE) which is termed as FBSE-EWT.

30 citations


Journal ArticleDOI
TL;DR: In this paper, an action-independent Gaussian mixture model (AIGMM) is trained on the extracted features of all fine-grained actions to analyze spatio-temporal information and preserve the local similarities among fine grained actions.

29 citations


Journal ArticleDOI
01 Mar 2022-Sensors
TL;DR: In this work, different statistical features of vibration signals in time and frequency domains are studied for the detection and localisation of faults in the roller bearings and the proposed method reduces the computational burden but also the classification time.
Abstract: In condition based maintenance, different signal processing techniques are used to sense the faults through the vibration and acoustic emission signals, received from the machinery. These signal processing approaches mostly utilise time, frequency, and time-frequency domain analysis. The features obtained are later integrated with the different machine learning techniques to classify the faults into different categories. In this work, different statistical features of vibration signals in time and frequency domains are studied for the detection and localisation of faults in the roller bearings. These are later classified into healthy, outer race fault, inner race fault, and ball fault classes. The statistical features including skewness, kurtosis, average and root mean square values of time domain vibration signals are considered. These features are extracted from the second derivative of the time domain vibration signals and power spectral density of vibration signals. The vibration signal is also converted to the frequency domain and the same features are extracted. All three feature sets are concatenated, creating the time, frequency and spectral power domain feature vectors. These feature vectors are finally fed into the K- nearest neighbour, support vector machine and kernel linear discriminant analysis for the detection and classification of bearing faults. With the proposed method, the reduction percentage of more than 95% percent is achieved, which not only reduces the computational burden but also the classification time. Simulation results show that the signals are classified to achieve an average accuracy of 99.13% using KLDA and 96.64% using KNN classifiers. The results are also compared with the empirical mode decomposition (EMD) features and Fourier transform features without extracting any statistical information, which are two of the most widely used approaches in the literature. To gain a certain level of confidence in the classification results, a detailed statistical analysis is also provided.

Journal ArticleDOI
TL;DR: In this article , a new deep learning method based on the target gray-level distribution constraint mechanism model was proposed to solve the infrared dim small target detection problem in the complex environment.
Abstract: A new deep learning method based on the target gray-level distribution constraint mechanism model is proposed to solve the infrared dim small target detection problem in the complex environment. First, to solve the uneven distribution of positive and negative samples, the designed smoothness operator is used to suppress the background and enhancement target by measuring the difference in their features in 1D and 2D gradient. Second, an infrared dim small target detection network based on dense feature fusion, namely the DFFIR-net network, is proposed. The DFFIR-net enhances the feature expression of dim small targets by integrating the original features and the smoothness features of gray-level gradient. Also, the DFFIR-net alleviates the problem of sparse feature extraction. Finally, a multiscale 2D Gaussian label generation strategy is proposed. This strategy is critical in supervising the training of DFFIR-net in multi-dimensional Gaussian space, improving the feature exploration ability of the network and detection performance under small training samples. The experimental results show that compared with the existing advanced detection methods, the proposed method has higher accuracy and lower false alarm rates in various complex scenes.

Journal ArticleDOI
TL;DR: In this paper , an instance-wise softmax embedding was proposed for unsupervised embedding learning, which directly performs the optimization over the augmented instance features with the binary discrmination softmax encoding.
Abstract: Deep embedding learning plays a key role in learning discriminative feature representations, where the visually similar samples are pulled closer and dissimilar samples are pushed away in the low-dimensional embedding space. This paper studies the unsupervised embedding learning problem by learning such a representation without using any category labels. This task faces two primary challenges: mining reliable positive supervision from highly similar fine-grained classes, and generalizing to unseen testing categories. To approximate the positive concentration and negative separation properties in category-wise supervised learning, we introduce a data augmentation invariant and instance spreading feature using the instance-wise supervision. We also design two novel domain-agnostic augmentation strategies to further extend the supervision in feature space, which simulates the large batch training using a small batch size and the augmented features. To learn such a representation, we propose a novel instance-wise softmax embedding, which directly perform the optimization over the augmented instance features with the binary discrmination softmax encoding. It significantly accelerates the learning speed with much higher accuracy than existing methods, under both seen and unseen testing categories. The unsupervised embedding performs well even without pre-trained network over samples from fine-grained categories. We also develop a variant using category-wise supervision, namely category-wise softmax embedding, which achieves competitive performance over the state-of-of-the-arts, without using any auxiliary information or restrict sample mining.

Journal ArticleDOI
TL;DR: A hybrid deep transfer learning-based approach to PD classification could lead to hitting rates higher than 99%.

Proceedings ArticleDOI
01 Jan 2022
TL;DR: The density-based novelty detection algorithm is so well-grounded in the essence of the method that it is reasonable to use it as the OOD detection algorithm without making any requirements for the feature distribution, and a simple yet effective method, named KNN-contrastive learning, is proposed.
Abstract: The Out-of-Domain (OOD) intent classification is a basic and challenging task for dialogue systems. Previous methods commonly restrict the region (in feature space) of In-domain (IND) intent features to be compact or simply-connected implicitly, which assumes no OOD intents reside, to learn discriminative semantic features. Then the distribution of the IND intent features is often assumed to obey a hypothetical distribution (Gaussian mostly) and samples outside this distribution are regarded as OOD samples. In this paper, we start from the nature of OOD intent classification and explore its optimization objective. We further propose a simple yet effective method, named KNN-contrastive learning. Our approach utilizes k-nearest neighbors (KNN) of IND intents to learn discriminative semantic features that are more conducive to OOD detection.Notably, the density-based novelty detection algorithm is so well-grounded in the essence of our method that it is reasonable to use it as the OOD detection algorithm without making any requirements for the feature distribution.Extensive experiments on four public datasets show that our approach can not only enhance the OOD detection performance substantially but also improve the IND intent classification while requiring no restrictions on feature distribution.

Journal ArticleDOI
01 Jan 2022
TL;DR: In this paper , the authors proposed a new data augmentation network, which consists of a GAN, an autoencoder, and an auxiliary classifier to synthesize class-dependent feature vectors in both the latent space and the original feature space, which can be augmented to the real training data for training classifiers.
Abstract: When training data are scarce, it is challenging to train a deep neural network without causing the overfitting problem. For overcoming this challenge, this article proposes a new data augmentation network-namely adversarial data augmentation network (ADAN)- based on generative adversarial networks (GANs). The ADAN consists of a GAN, an autoencoder, and an auxiliary classifier. These networks are trained adversarially to synthesize class-dependent feature vectors in both the latent space and the original feature space, which can be augmented to the real training data for training classifiers. Instead of using the conventional cross-entropy loss for adversarial training, the Wasserstein divergence is used in an attempt to produce high-quality synthetic samples. The proposed networks were applied to speech emotion recognition using EmoDB and IEMOCAP as the evaluation data sets. It was found that by forcing the synthetic latent vectors and the real latent vectors to share a common representation, the gradient vanishing problem can be largely alleviated. Also, results show that the augmented data generated by the proposed networks are rich in emotion information. Thus, the resulting emotion classifiers are competitive with state-of-the-art speech emotion recognition systems.

Journal ArticleDOI
TL;DR: Wang et al. as discussed by the authors proposed to perform online clustering by conducting twin contrastive learning (TCL) at the instance and cluster level, where the data is projected into a feature space with a dimensionality of the target cluster number.
Abstract: This paper proposes to perform online clustering by conducting twin contrastive learning (TCL) at the instance and cluster level. Specifically, we find that when the data is projected into a feature space with a dimensionality of the target cluster number, the rows and columns of its feature matrix correspond to the instance and cluster representation, respectively. Based on the observation, for a given dataset, the proposed TCL first constructs positive and negative pairs through data augmentations. Thereafter, in the row and column space of the feature matrix, instance- and cluster-level contrastive learning are respectively conducted by pulling together positive pairs while pushing apart the negatives. To alleviate the influence of intrinsic false-negative pairs and rectify cluster assignments, we adopt a confidence-based criterion to select pseudo-labels for boosting both the instance- and cluster-level contrastive learning. As a result, the clustering performance is further improved. Besides the elegant idea of twin contrastive learning, another advantage of TCL is that it could independently predict the cluster assignment for each instance, thus effortlessly fitting online scenarios. Extensive experiments on six widely-used image and text benchmarks demonstrate the effectiveness of TCL. The code is released on https://pengxi.me .

Journal ArticleDOI
TL;DR: Wang et al. as discussed by the authors proposed a new method based on the Time Series Transformer (TST) to recognize the fault modes of the various rotating machinery, which can handle data in 1D format.

Journal ArticleDOI
TL;DR: In this article , a reverse GNN model is proposed to learn the graph from the intrinsic space of the original data points as well as to investigate a new out-of-sample extension method.
Abstract: Graph neural networks (GNNs) conduct feature learning by taking into account the local structure preservation of the data to produce discriminative features, but need to address the following issues, i.e., 1) the initial graph containing faulty and missing edges often affect feature learning and 2) most GNN methods suffer from the issue of out-of-example since their training processes do not directly generate a prediction model to predict unseen data points. In this work, we propose a reverse GNN model to learn the graph from the intrinsic space of the original data points as well as to investigate a new out-of-sample extension method. As a result, the proposed method can output a high-quality graph to improve the quality of feature learning, while the new method of out-of-sample extension makes our reverse GNN method available for conducting supervised learning and semi-supervised learning. Experimental results on real-world datasets show that our method outputs competitive classification performance, compared to state-of-the-art methods, in terms of semi-supervised node classification, out-of-sample extension, random edge attack, link prediction, and image retrieval.

Journal ArticleDOI
TL;DR: In this article , a Gamma-Poisson matrix factorization on substring counts and a min-hash encoder are proposed to encode high-cardinality string categorical variables.
Abstract: Statistical models usually require vector representations of categorical variables, using for instance one-hot encoding. This strategy breaks down when the number of categories grows, as it creates high-dimensional feature vectors. Additionally, for string entries, one-hot encoding does not capture information in their representation.Here, we seek low-dimensional encoding of high-cardinality string categorical variables. Ideally, these should be: scalable to many categories; interpretable to end users; and facilitate statistical analysis. We introduce two encoding approaches for string categories: a Gamma-Poisson matrix factorization on substring counts, and the min-hash encoder, for fast approximation of string similarities. We show that min-hash turns set inclusions into inequality relations that are easier to learn. Both approaches are scalable and streamable. Experiments on real and simulated data show that these methods improve supervised learning with high-cardinality categorical variables. We recommend the following: if scalability is central, the min-hash encoder is the best option as it does not require any data fit; if interpretability is important, the Gamma-Poisson factorization is the best alternative, as it can be interpreted as one-hot encoding on inferred categories with informative feature names. Both models enable autoML on the original string entries as they remove the need for feature engineering or data cleaning.

Journal ArticleDOI
TL;DR: In this paper, a novel framework utilizing neural network-based concepts along with reduced feature vectors and multiple machine learning techniques was constructed to classify the mitotic and non-mitotic cells.

Journal ArticleDOI
TL;DR: In this article, a semi-supervised learning technique, namely pseudo-label stacked auto-encoder (PLSAE), was used to detect Android malware, which involves training using a set of labeled and unlabeled instances.
Abstract: Android has become the target of attackers because of its popularity. The detection of Android mobile malware has become increasingly important due to its significant threat. Supervised machine learning, which has been used to detect Android malware is far from perfect because it requires a significant amount of labeled data. Since labeled data is expensive and difficult to get while unlabeled data is abundant and cheap in this context, we resort to a semi-supervised learning technique, namely pseudo-label stacked auto-encoder (PLSAE), which involves training using a set of labeled and unlabeled instances. We use a hybrid approach of dynamic analysis and static analysis to craft feature vectors. We evaluate our proposed model on CICMalDroid2020, which includes 17,341 most recent samples of five different Android apps categories. After that, we compare the results with state-of-the-art techniques in terms of accuracy and efficiency. Experimental results show that our proposed framework outperforms other semi-supervised approaches and common machine learning algorithms.

Journal ArticleDOI
TL;DR: A novel UFS approach is proposed by integrating local linear embedding (LLE) and manifold regularization constrained in feature subspace into a unified framework, and a tailored iterative algorithm based on Alternative Direction Method of Multipliers (ADMM) is designed to solve the proposed optimization problem.

Journal ArticleDOI
TL;DR: In this paper , the authors propose a method to decompose over feature spaces the variance explained by a banded ridge regression model, effectively ignoring non-predictive and redundant feature spaces.

Journal ArticleDOI
TL;DR: Wang et al. as discussed by the authors proposed an enhanced deep clustering network (EDCN), which is composed of a Feature Extractor, a Conditional Generator, a Discriminator and a Siamese Network.

Journal ArticleDOI
TL;DR: Wang et al. as discussed by the authors proposed a joint optimization clustering framework through introducing the contractive representation in feature learning and utilizing focal loss in the clustering layer to improve label assignment in deep clustering method.

Journal ArticleDOI
TL;DR: In this article , a 3D-CNN framework was proposed to preserve the multivariate structure and dependencies of the feature space of EEG data, and a layer-wise decomposition model was implemented using 3D CNN framework to secure reliable classification results on a single-trial basis.
Abstract: Convolutional neural networks (CNNs) have recently been applied to electroencephalogram (EEG)-based brain-computer interfaces (BCIs). EEG is a noninvasive neuroimaging technique, which can be used to decode user intentions. Because the feature space of EEG data is highly dimensional and signal patterns are specific to the subject, appropriate methods for feature representation are required to enhance the decoding accuracy of the CNN model. Furthermore, neural changes exhibit high variability between sessions, subjects within a single session, and trials within a single subject, resulting in major issues during the modeling stage. In addition, there are many subject-dependent factors, such as frequency ranges, time intervals, and spatial locations at which the signal occurs, which prevent the derivation of a robust model that can achieve the parameterization of these factors for a wide range of subjects. However, previous studies did not attempt to preserve the multivariate structure and dependencies of the feature space. In this study, we propose a method to generate a spatiospectral feature representation that can preserve the multivariate information of EEG data. Specifically, 3-D feature maps were constructed by combining subject-optimized and subject-independent spectral filters and by stacking the filtered data into tensors. In addition, a layer-wise decomposition model was implemented using our 3-D-CNN framework to secure reliable classification results on a single-trial basis. The average accuracies of the proposed model were 87.15% (±7.31), 75.85% (±12.80), and 70.37% (±17.09) for the BCI competition data sets IV_2a, IV_2b, and OpenBMI data, respectively. These results are better than those obtained by state-of-the-art techniques, and the decomposition model obtained the relevance scores for neurophysiologically plausible electrode channels and frequency domains, confirming the validity of the proposed approach.

Journal ArticleDOI
TL;DR: In this paper , an intelligent learning approach for the accurate prediction of antifungal peptides is presented, where the sequential and evolutionary features are explored by three promising descriptors namely conjoint triad feature (CTF), Pseudo-position specific scoring matrix (PsePSSM), and Position-specific scoring matrix-Discrete wavelet transform (PSSM-DWT), which achieved a higher prediction accuracy of 97.81% and 93.92% using training and independent datasets, respectively.

Journal ArticleDOI
TL;DR: In this article , a novel way of visualizing and understanding the vector space before the NNs' output layer is presented, aiming to enlighten the deep feature vectors' properties under classification tasks.
Abstract: One of the most prominent attributes of Neural Networks (NNs) constitutes their capability of learning to extract robust and descriptive features from high dimensional data, like images. Hence, such an ability renders their exploitation as feature extractors particularly frequent in an abundance of modern reasoning systems. Their application scope mainly includes complex cascade tasks, like multi-modal recognition and deep Reinforcement Learning (RL). However, NNs induce implicit biases that are difficult to avoid or to deal with and are not met in traditional image descriptors. Moreover, the lack of knowledge for describing the intra-layer properties -and thus their general behavior- restricts the further applicability of the extracted features. With the paper at hand, a novel way of visualizing and understanding the vector space before the NNs' output layer is presented, aiming to enlighten the deep feature vectors' properties under classification tasks. Main attention is paid to the nature of overfitting in the feature space and its adverse effect on further exploitation. We present the findings that can be derived from our model's formulation and we evaluate them on realistic recognition scenarios, proving its prominence by improving the obtained results.

Journal ArticleDOI
TL;DR: In this article , a grid-based deep feature generator was proposed for breast cancer detection using breast ultrasonography (BUS) images, which achieved 97.18% classification accuracy on the ultrasonic images for three classes, namely malignant, benign and normal.
Abstract: Breast cancer is a prominent cancer type with high mortality. Early detection of breast cancer could serve to improve clinical outcomes. Ultrasonography is a digital imaging technique used to differentiate benign and malignant tumors. Several artificial intelligence techniques have been suggested in the literature for breast cancer detection using breast ultrasonography (BUS). Nowadays, particularly deep learning methods have been applied to biomedical images to achieve high classification performances.This work presents a new deep feature generation technique for breast cancer detection using BUS images. The widely known 16 pre-trained CNN models have been used in this framework as feature generators. In the feature generation phase, the used input image is divided into rows and columns, and these deep feature generators (pre-trained models) have applied to each row and column. Therefore, this method is called a grid-based deep feature generator. The proposed grid-based deep feature generator can calculate the error value of each deep feature generator, and then it selects the best three feature vectors as a final feature vector. In the feature selection phase, iterative neighborhood component analysis (INCA) chooses 980 features as an optimal number of features. Finally, these features are classified by using a deep neural network (DNN).The developed grid-based deep feature generation-based image classification model reached 97.18% classification accuracy on the ultrasonic images for three classes, namely malignant, benign, and normal.The findings obviously denoted that the proposed grid deep feature generator and INCA-based feature selection model successfully classified breast ultrasonic images.