Showing papers on "Feature selection published in 2016"

PDF

Open Access

Journal Article•DOI•

A Survey on Evolutionary Computation Approaches to Feature Selection

[...]

Bing Xue¹, Mengjie Zhang¹, Will N. Browne¹, Xin Yao²•Institutions (2)

Victoria University of Wellington¹, University of Birmingham²

01 Aug 2016-IEEE Transactions on Evolutionary Computation

TL;DR: This paper presents a comprehensive survey of the state-of-the-art work on EC for feature selection, which identifies the contributions of these different algorithms.

...read moreread less

Abstract: Feature selection is an important task in data mining and machine learning to reduce the dimensionality of the data and increase the performance of an algorithm, such as a classification algorithm. However, feature selection is a challenging task due mainly to the large search space. A variety of methods have been applied to solve feature selection problems, where evolutionary computation (EC) techniques have recently gained much attention and shown some success. However, there are no comprehensive guidelines on the strengths and weaknesses of alternative approaches. This leads to a disjointed and fragmented field with ultimately lost opportunities for improving performance and successful applications. This paper presents a comprehensive survey of the state-of-the-art work on EC for feature selection, which identifies the contributions of these different algorithms. In addition, current issues and challenges are also discussed to identify promising areas for future research.

...read moreread less

1,237 citations

Journal Article•DOI•

Binary grey wolf optimization approaches for feature selection

[...]

Eid Emary¹, Hossam M. Zawbaa², Aboul Ella Hassanien³•Institutions (3)

Cairo University¹, Babeș-Bolyai University², Beni-Suef University³

08 Jan 2016-Neurocomputing

TL;DR: Results prove the capability of the proposed binary version of grey wolf optimization (bGWO) to search the feature space for optimal feature combinations regardless of the initialization and the used stochastic operators.

...read moreread less

958 citations

Journal Article•DOI•

High-dimensional and large-scale anomaly detection using a linear one-class SVM with deep learning

[...]

Sarah M. Erfani¹, Sutharshan Rajasegarar¹, Shanika Karunasekera¹, Christopher Leckie¹•Institutions (1)

University of Melbourne¹

01 Oct 2016-Pattern Recognition

TL;DR: A hybrid model where an unsupervised DBN is trained to extract generic underlying features, and a one-class SVM is trained from the features learned by the DBN, which delivers a comparable accuracy with a deep autoencoder and is scalable and computationally efficient.

...read moreread less

876 citations

Journal Article•

mlr: machine learning in R

[...]

Bernd Bischl¹, Michel Lang¹, Lars Kotthoff¹, Julia Schiffner¹, Jakob Richter¹, Erich Studerus¹, Giuseppe Casalicchio¹, Zachary M. Jones¹ - Show less +4 more•Institutions (1)

Ludwig Maximilian University of Munich¹

01 Jan 2016-Journal of Machine Learning Research

TL;DR: The MLR package provides a generic, object-oriented, and extensible framework for classification, regression, survival analysis and clustering for the R language and includes meta-algorithms and model selection techniques to improve and extend the functionality of basic learners with, e.g., hyperparameter tuning, feature selection, and ensemble construction.

...read moreread less

Abstract: The MLR package provides a generic, object-oriented, and extensible framework for classification, regression, survival analysis and clustering for the R language It provides a unified interface to more than 160 basic learners and includes meta-algorithms and model selection techniques to improve and extend the functionality of basic learners with, eg, hyperparameter tuning, feature selection, and ensemble construction Parallel high-performance computing is natively supported The package targets practitioners who want to quickly apply machine learning algorithms, as well as researchers who want to implement, benchmark, and compare their new methods in a structured environment

...read moreread less

502 citations

Journal Article•DOI•

Building an Intrusion Detection System Using a Filter-Based Feature Selection Algorithm

[...]

Mohammed A. Ambusaidi¹, Xiangjian He¹, Priyadarsi Nanda¹, Zhiyuan Tan²•Institutions (2)

University of Technology, Sydney¹, University of Twente²

01 Oct 2016-IEEE Transactions on Computers

TL;DR: The evaluation results show that the feature selection algorithm contributes more critical features for LSSVM-IDS to achieve better accuracy and lower computational cost compared with the state-of-the-art methods.

...read moreread less

Abstract: Redundant and irrelevant features in data have caused a long-term problem in network traffic classification. These features not only slow down the process of classification but also prevent a classifier from making accurate decisions, especially when coping with big data. In this paper, we propose a mutual information based algorithm that analytically selects the optimal feature for classification. This mutual information based feature selection algorithm can handle linearly and nonlinearly dependent data features. Its effectiveness is evaluated in the cases of network intrusion detection. An Intrusion Detection System (IDS), named Least Square Support Vector Machine based IDS (LSSVM-IDS), is built using the features selected by our proposed feature selection algorithm. The performance of LSSVM-IDS is evaluated using three intrusion detection evaluation datasets, namely KDD Cup 99, NSL-KDD and Kyoto 2006+ dataset. The evaluation results show that our feature selection algorithm contributes more critical features for LSSVM-IDS to achieve better accuracy and lower computational cost compared with the state-of-the-art methods.

...read moreread less

406 citations

Journal Article•DOI•

Supervised, Unsupervised, and Semi-Supervised Feature Selection: A Review on Gene Selection

[...]

Jun Chin Ang¹, Andri Mirzal², Habibollah Haron¹, Haza Nuzly Abdull Hamed¹•Institutions (2)

Universiti Teknologi Malaysia¹, Arabian Gulf University²

01 Sep 2016-IEEE/ACM Transactions on Computational Biology and Bioinformatics

TL;DR: The basic taxonomy of feature selection is presented, and the state-of-the-art gene selection methods are reviewed by grouping the literatures into three categories: supervised, unsupervised, and semi-supervised.

...read moreread less

Abstract: Recently, feature selection and dimensionality reduction have become fundamental tools for many data mining tasks, especially for processing high-dimensional data such as gene expression microarray data. Gene expression microarray data comprises up to hundreds of thousands of features with relatively small sample size. Because learning algorithms usually do not work well with this kind of data, a challenge to reduce the data dimensionality arises. A huge number of gene selection are applied to select a subset of relevant features for model construction and to seek for better cancer classification performance. This paper presents the basic taxonomy of feature selection, and also reviews the state-of-the-art gene selection methods by grouping the literatures into three categories: supervised, unsupervised, and semi-supervised. The comparison of experimental results on top 5 representative gene expression datasets indicates that the classification accuracy of unsupervised and semi-supervised feature selection is competitive with supervised feature selection.

...read moreread less

402 citations

Journal Article•DOI•

Variable selection with stepwise and best subset approaches

[...]

Zhongheng Zhang¹•Institutions (1)

Zhejiang University¹

31 Mar 2016-Annals of Translational Medicine

TL;DR: Two R functions stepAIC() and bestglm() are well designed for stepwise and best subset regression, respectively.

...read moreread less

Abstract: While purposeful selection is performed partly by software and partly by hand, the stepwise and best subset approaches are automatically performed by software. Two R functions stepAIC() and bestglm() are well designed for stepwise and best subset regression, respectively. The stepAIC() function begins with a full or null model, and methods for stepwise regression can be specified in the direction argument with character values “forward”, “backward” and “both”. The bestglm() function begins with a data frame containing explanatory variables and response variables. The response variable should be in the last column. Varieties of goodness-of-fit criteria can be specified in the IC argument. The Bayesian information criterion (BIC) usually results in more parsimonious model than the Akaike information criterion.

...read moreread less

378 citations

Journal Article•DOI•

Improving BCI-based emotion recognition by combining EEG feature selection and kernel classifiers

[...]

John Atkinson¹, Daniel Campos¹•Institutions (1)

University of Concepción¹

01 Apr 2016-Expert Systems With Applications

TL;DR: A novel feature-based emotion recognition model is proposed for EEG-based Brain-Computer Interfaces which combines statistical-based feature selection methods and SVM emotion classifiers and incorporates additional features which are relevant for signal pre-processing and recognition classification tasks.

...read moreread less

Abstract: A feature-based emotion recognition model is proposed for EEG-based BCI.The approach combines statistical-based feature selection methods and SVM emotion classifiers.The model is based on Valence/Arousal dimensions for emotion classification.Our combined approach outperformed other recognition methods. Current emotion recognition computational techniques have been successful on associating the emotional changes with the EEG signals, and so they can be identified and classified from EEG signals if appropriate stimuli are applied. However, automatic recognition is usually restricted to a small number of emotions classes mainly due to signal's features and noise, EEG constraints and subject-dependent issues. In order to address these issues, in this paper a novel feature-based emotion recognition model is proposed for EEG-based Brain-Computer Interfaces. Unlike other approaches, our method explores a wider set of emotion types and incorporates additional features which are relevant for signal pre-processing and recognition classification tasks, based on a dimensional model of emotions: Valence and Arousal. It aims to improve the accuracy of the emotion classification task by combining mutual information based feature selection methods and kernel classifiers. Experiments using our approach for emotion classification which combines efficient feature selection methods and efficient kernel-based classifiers on standard EEG datasets show the promise of the approach when compared with state-of-the-art computational methods.

...read moreread less

368 citations

Journal Article•DOI•

Binary ant lion approaches for feature selection

[...]

Eid Emary¹, Hossam M. Zawbaa², Aboul Ella Hassanien³•Institutions (3)

Arab Open University¹, Babeș-Bolyai University², Cairo University³

12 Nov 2016-Neurocomputing

TL;DR: Binary variants of the ant lion optimizer (ALO) are proposed and used to select the optimal feature subset for classification purposes in wrapper-mode and prove the capability of the proposed binary algorithms to search the feature space for optimal feature combinations regardless of the initialization and the used stochastic operators.

...read moreread less

357 citations

Journal Article•DOI•

Joint Feature Selection and Subspace Learning for Cross-Modal Retrieval

[...]

Kaiye Wang¹, Ran He¹, Liang Wang¹, Wei Wang¹, Tieniu Tan¹ - Show less +1 more•Institutions (1)

Chinese Academy of Sciences¹

01 Oct 2016-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: An iterative algorithm is presented to solve the proposed joint learning problem, along with its convergence analysis, and Experimental results on cross-modal retrieval tasks demonstrate that the proposed method outperforms the state-of-the-art subspace approaches.

...read moreread less

Abstract: Cross-modal retrieval has recently drawn much attention due to the widespread existence of multimodal data. It takes one type of data as the query to retrieve relevant data objects of another type, and generally involves two basic problems: the measure of relevance and coupled feature selection. Most previous methods just focus on solving the first problem. In this paper, we aim to deal with both problems in a novel joint learning framework. To address the first problem, we learn projection matrices to map multimodal data into a common subspace, in which the similarity between different modalities of data can be measured. In the learning procedure, the $\ell _{21}$ -norm penalties are imposed on the projection matrices separately to solve the second problem, which selects relevant and discriminative features from different feature spaces simultaneously. A multimodal graph regularization term is further imposed on the projected data,which preserves the inter-modality and intra-modality similarity relationships.An iterative algorithm is presented to solve the proposed joint learning problem, along with its convergence analysis. Experimental results on cross-modal retrieval tasks demonstrate that the proposed method outperforms the state-of-the-art subspace approaches.

...read moreread less

302 citations

Journal Article•DOI•

A hybrid particle swarm optimization for feature subset selection by integrating a novel local search strategy

[...]

Parham Moradi¹, Mozhgan Gholampour¹•Institutions (1)

University of Kurdistan¹

01 Jun 2016

TL;DR: The proposed hybrid feature selection algorithm, called HPSO-LS, uses a local search technique which is embedded in particle swarm optimization to select the reduced sized and salient feature subset to enhance the search process near global optima.

...read moreread less

Abstract: The proposed method uses a local search technique which is embedded in particle swarm optimization (PSO) to select the reduced sized and salient feature subset. The goal of the local search technique is to guide the PSO search process to select distinct features by using their correlation information. Therefore, the proposed method selects the subset of features with reduced redundancy. A hybrid feature selection method based on particle swarm optimization is proposed.Our method uses a novel local search to enhance the search process near global optima.The method efficiently finds the discriminative features with reduced correlations.The size of final feature set is determined using a subset size detection scheme.Our method is compared with well-known and state-of-the-art feature selection methods. Feature selection has been widely used in data mining and machine learning tasks to make a model with a small number of features which improves the classifier's accuracy. In this paper, a novel hybrid feature selection algorithm based on particle swarm optimization is proposed. The proposed method called HPSO-LS uses a local search strategy which is embedded in the particle swarm optimization to select the less correlated and salient feature subset. The goal of the local search technique is to guide the search process of the particle swarm optimization to select distinct features by considering their correlation information. Moreover, the proposed method utilizes a subset size determination scheme to select a subset of features with reduced size. The performance of the proposed method has been evaluated on 13 benchmark classification problems and compared with five state-of-the-art feature selection methods. Moreover, HPSO-LS has been compared with four well-known filter-based methods including information gain, term variance, fisher score and mRMR and five well-known wrapper-based methods including genetic algorithm, particle swarm optimization, simulated annealing and ant colony optimization. The results demonstrated that the proposed method improves the classification accuracy compared with those of the filter based and wrapper-based feature selection methods. Furthermore, several performed statistical tests show that the proposed method's superiority over the other methods is statistically significant.

...read moreread less

Journal Article•DOI•

A hybrid approach of differential evolution and artificial bee colony for feature selection

[...]

Ezgi Zorarpaci¹, Selma Ayşe Özel¹•Institutions (1)

Çukurova University¹

15 Nov 2016-Expert Systems With Applications

TL;DR: The experimental results of this study show that the developed hybrid method is able to select good features for classification tasks to improve run-time performance and accuracy of the classifier.

...read moreread less

Abstract: We developed a hybrid method for feature selection of classification tasks.Our hybrid method combines Artificial Bee Colony with Differential Evolution.We performed experiments over fifteen datasets from the UCI Repository.Our method selects good features without reducing accuracy of classification.By selecting features with our method, we reduced time required for classification. "Dimensionality" is one of the major problems which affect the quality of learning process in most of the machine learning and data mining tasks. Having high dimensional datasets for training a classification model may lead to have "overfitting" of the learned model to the training data. Overfitting reduces generalization of the model, therefore causes poor classification accuracy for the new test instances. Another disadvantage of dimensionality of dataset is to have high CPU time requirement for learning and testing the model. Applying feature selection to the dataset before the learning process is essential to improve the performance of the classification task. In this study, a new hybrid method which combines artificial bee colony optimization technique with differential evolution algorithm is proposed for feature selection of classification tasks. The developed hybrid method is evaluated by using fifteen datasets from the UCI Repository which are commonly used in classification problems. To make a complete evaluation, the proposed hybrid feature selection method is compared with the artificial bee colony optimization, and differential evolution based feature selection methods, as well as with the three most popular feature selection techniques that are information gain, chi-square, and correlation feature selection. In addition to these, the performance of the proposed method is also compared with the studies in the literature which uses the same datasets. The experimental results of this study show that our developed hybrid method is able to select good features for classification tasks to improve run-time performance and accuracy of the classifier. The proposed hybrid method may also be applied to other search and optimization problems as its performance for feature selection is better than pure artificial bee colony optimization, and differential evolution.

...read moreread less

Journal Article•DOI•

Performance-Driven Distributed PCA Process Monitoring Based on Fault-Relevant Variable Selection and Bayesian Inference

[...]

Qingchao Jiang¹, Xuefeng Yan², Biao Huang¹•Institutions (2)

University of Alberta¹, East China University of Science and Technology²

01 Jan 2016-IEEE Transactions on Industrial Electronics

TL;DR: This study proposes a fault-relevant variable selection and Bayesian inference-based distributed method for efficient fault detection and isolation, which reduces redundancy and complexity, explores numerous local behaviors, and provides accurate description of faults, thus improving monitoring performance significantly.

...read moreread less

Abstract: Multivariate statistical process monitoring involves dimension reduction and latent feature extraction in large-scale processes and typically incorporates all measured variables. However, involving variables without beneficial information may degrade monitoring performance. This study analyzes the effect of variable selection on principal component analysis (PCA) monitoring performance. Then, it proposes a fault-relevant variable selection and Bayesian inference-based distributed method for efficient fault detection and isolation. First, the optimal subset of variables is identified for each fault using an optimization algorithm. Second, a sub-PCA model is established in each subset. Finally, the monitoring results of all of the subsets are combined through Bayesian inference. The proposed method reduces redundancy and complexity, explores numerous local behaviors, and provides accurate description of faults, thus improving monitoring performance significantly. Case studies on a numerical example, the Tennessee Eastman benchmark process, and an industrial-scale plant demonstrate the efficiency.

...read moreread less

Journal Article•DOI•

Block-Row Sparse Multiview Multilabel Learning for Image Classification

[...]

Xiaofeng Zhu¹, Xuelong Li², Shichao Zhang³•Institutions (3)

Xi'an Jiaotong University¹, Chinese Academy of Sciences², Zhejiang Gongshang University³

01 Feb 2016-IEEE Transactions on Systems, Man, and Cybernetics

TL;DR: A computationally efficient algorithm to optimize the derived objective function is devised and theoretically prove the convergence of the proposed optimization method is theoretically proved.

...read moreread less

Abstract: In image analysis, the images are often represented by multiple visual features (also known as multiview features), that aim to better interpret them for achieving remarkable performance of the learning. Since the processes of feature extraction on each view are separated, the multiple visual features of images may include overlap, noise, and redundancy. Thus, learning with all the derived views of the data could decrease the effectiveness. To address this, this paper simultaneously conducts a hierarchical feature selection and a multiview multilabel (MVML) learning for multiview image classification, via embedding a proposed a new block-row regularizer into the MVML framework. The block-row regularizer concatenating a Frobenius norm ( ${F}$ -norm) regularizer and an $\boldsymbol {\ell }_{\textbf {2,1}}$ -norm regularizer is designed to conduct a hierarchical feature selection, in which the ${F}$ -norm regularizer is used to conduct a high-level feature selection for selecting the informative views (i.e., discarding the uninformative views) and the $\boldsymbol {\ell }_{\textbf {2,1}}$ -norm regularizer is then used to conduct a low-level feature selection on the informative views. The rationale of the use of a block-row regularizer is to avoid the issue of the over-fitting (via the block-row regularizer), to remove redundant views and to preserve the natural group structures of data (via the ${F}$ -norm regularizer), and to remove noisy features (the $\boldsymbol {\ell }_{\textbf {2,1}}$ -norm regularizer), respectively. We further devise a computationally efficient algorithm to optimize the derived objective function and also theoretically prove the convergence of the proposed optimization method. Finally, the results on real image datasets show that the proposed method outperforms two baseline algorithms and three state-of-the-art algorithms in terms of classification performance.

...read moreread less

Journal Article•DOI•

Scalable High-Performance Image Registration Framework by Unsupervised Deep Feature Representations Learning

[...]

Guorong Wu¹, Minjeong Kim¹, Qian Wang², Brent C. Munsell³, Dinggang Shen¹ - Show less +1 more•Institutions (3)

University of North Carolina at Chapel Hill¹, Shanghai Jiao Tong University², College of Charleston³

01 Jul 2016-IEEE Transactions on Biomedical Engineering

TL;DR: A learning-based image registration framework is proposed that uses deep learning to discover compact and highly discriminative features upon observed imaging data that scales well to new image modalities or new image applications with little to no human intervention.

...read moreread less

Abstract: Feature selection is a critical step in deformable image registration. In particular, selecting the most discriminative features that accurately and concisely describe complex morphological patterns in image patches improves correspondence detection, which in turn improves image registration accuracy. Furthermore, since more and more imaging modalities are being invented to better identify morphological changes in medical imaging data, the development of deformable image registration method that scales well to new image modalities or new image applications with little to no human intervention would have a significant impact on the medical image analysis community. To address these concerns, a learning-based image registration framework is proposed that uses deep learning to discover compact and highly discriminative features upon observed imaging data. Specifically, the proposed feature selection method uses a convolutional stacked autoencoder to identify intrinsic deep feature representations in image patches. Since deep learning is an unsupervised learning method, no ground truth label knowledge is required. This makes the proposed feature selection method more flexible to new imaging modalities since feature representations can be directly learned from the observed imaging data in a very short amount of time. Using the LONI and ADNI imaging datasets, image registration performance was compared to two existing state-of-the-art deformable image registration methods that use handcrafted features. To demonstrate the scalability of the proposed image registration framework, image registration experiments were conducted on 7.0-T brain MR images. In all experiments, the results showed that the new image registration framework consistently demonstrated more accurate registration results when compared to state of the art.

...read moreread less

Journal Article•DOI•

A Survey on Feature Selection

[...]

Jianyu Miao¹, Lingfeng Niu¹•Institutions (1)

Chinese Academy of Sciences¹

01 Jan 2016-Procedia Computer Science

TL;DR: The experimental results show that unsupervised feature selection algorithms benefits machine learning tasks improving the performance of clustering.

...read moreread less

Journal Article•DOI•

Feature selection in mixed data

[...]

Xiao Zhang, Changlin Mei¹, Degang Chen², Jinhai Li³•Institutions (3)

Xi'an Jiaotong University¹, North China Electric Power University², Kunming University of Science and Technology³

01 Aug 2016-Pattern Recognition

TL;DR: It is proved that the newly-defined entropy meets the common requirement of monotonicity and can equivalently characterize the existing attribute reductions in the fuzzy rough set theory.

...read moreread less

Journal Article•DOI•

Feature Selection: A Data Perspective

[...]

Jundong Li¹, Kewei Cheng¹, Suhang Wang¹, Fred Morstatter¹, Robert P. Trevino¹, Jiliang Tang², Huan Liu¹ - Show less +3 more•Institutions (2)

Arizona State University¹, Michigan State University²

29 Jan 2016-arXiv: Learning

TL;DR: Feature selection, as a data preprocessing strategy, has proven to be effective and efficient in preparing data (especially high-dimensional data) for various data mining and machine learning problems.

...read moreread less

Abstract: Feature selection, as a data preprocessing strategy, has been proven to be effective and efficient in preparing data (especially high-dimensional data) for various data mining and machine learning problems. The objectives of feature selection include: building simpler and more comprehensible models, improving data mining performance, and preparing clean, understandable data. The recent proliferation of big data has presented some substantial challenges and opportunities to feature selection. In this survey, we provide a comprehensive and structured overview of recent advances in feature selection research. Motivated by current challenges and opportunities in the era of big data, we revisit feature selection research from a data perspective and review representative feature selection algorithms for conventional data, structured data, heterogeneous data and streaming data. Methodologically, to emphasize the differences and similarities of most existing feature selection algorithms for conventional data, we categorize them into four main groups: similarity based, information theoretical based, sparse learning based and statistical based methods. To facilitate and promote the research in this community, we also present an open-source feature selection repository that consists of most of the popular feature selection algorithms (\url{this http URL}). Also, we use it as an example to show how to evaluate feature selection algorithms. At the end of the survey, we present a discussion about some open problems and challenges that require more attention in future research.

...read moreread less

Journal Article•DOI•

Ensemble-based multi-filter feature selection method for DDoS detection in cloud computing

[...]

Opeyemi Osanaiye¹, Opeyemi Osanaiye², Haibin Cai³, Kim-Kwang Raymond Choo², Ali Dehghantanha⁴, Zheng Xu⁵, Zheng Xu⁶, Mqhele E. Dlodlo¹ - Show less +4 more•Institutions (6)

University of Cape Town¹, University of South Australia², East China Normal University³, University of Salford⁴, Tsinghua University⁵, Chinese Ministry of Public Security⁶

10 May 2016-Eurasip Journal on Wireless Communications and Networking

TL;DR: An ensemble-based multi-filter feature selection method that combines the output of four filter methods to achieve an optimum selection that can effectively reduce the number of features and has a high detection rate and classification accuracy when compared to other classification techniques.

...read moreread less

Abstract: Widespread adoption of cloud computing has increased the attractiveness of such services to cybercriminals. Distributed denial of service (DDoS) attacks targeting the cloud’s bandwidth, services and resources to render the cloud unavailable to both cloud providers, and users are a common form of attacks. In recent times, feature selection has been identified as a pre-processing phase in cloud DDoS attack defence which can potentially increase classification accuracy and reduce computational complexity by identifying important features from the original dataset during supervised learning. In this work, we propose an ensemble-based multi-filter feature selection method that combines the output of four filter methods to achieve an optimum selection. We then perform an extensive experimental evaluation of our proposed method using intrusion detection benchmark dataset, NSL-KDD and decision tree classifier. The findings show that our proposed method can effectively reduce the number of features from 41 to 13 and has a high detection rate and classification accuracy when compared to other classification techniques.

...read moreread less

Journal Article•DOI•

Feature selection methods for big data bioinformatics: A survey from the search perspective.

[...]

Lipo Wang¹, Yaoli Wang², Qing Chang²•Institutions (2)

Nanyang Technological University¹, Taiyuan University of Technology²

01 Dec 2016-Methods

TL;DR: This paper formulate feature selection as a combinatorial optimization or search problem and categorize feature selection methods into exhaustive search, heuristic search, and hybrid methods, where heuristicsearch methods may further be categorized into those with or without data-distilled feature ranking measures.

...read moreread less

Journal Article•DOI•

A Comparative Study on Human Activity Recognition Using Inertial Sensors in a Smartphone

[...]

Aiguo Wang, Guilin Chen, Jing Yang¹, Shenghui Zhao, Chih-Yung Chang² - Show less +1 more•Institutions (2)

Hefei University of Technology¹, Tamkang University²

01 Jun 2016-IEEE Sensors Journal

TL;DR: Experimental results show that the fusion of both accelerometer and gyroscope data contributes to obtain better recognition performance than that of using single source data, and that the proposed feature selector outperforms three other comparative approaches in terms of four performance measures.

...read moreread less

Abstract: Activity recognition plays an essential role in bridging the gap between the low-level sensor data and the high-level applications in ambient-assisted living systems. With the aim to obtain satisfactory recognition rate and adapt to various application scenarios, a variety of sensors have been exploited, among which, smartphone-embedded inertial sensors are widely applied due to its convenience, low cost, and intrusiveness. In this paper, we explore the power of triaxial accelerometer and gyroscope built-in a smartphone in recognizing human physical activities in situations, where they are used simultaneously or separately. A novel feature selection approach is then proposed in order to select a subset of discriminant features, construct an online activity recognizer with better generalization ability, and reduce the smartphone power consumption. Experimental results on a publicly available data set show that the fusion of both accelerometer and gyroscope data contributes to obtain better recognition performance than that of using single source data, and that the proposed feature selector outperforms three other comparative approaches in terms of four performance measures. In addition, great improvement in time performance can be achieved with an effective feature selector, indicating the way of power saving and its applicability to real-world activity recognition.

...read moreread less

Journal Article•DOI•

Image driven machine learning methods for microstructure recognition

[...]

Aritra Chowdhury¹, Elizabeth J. Kautz¹, Bülent Yener¹, Daniel Lewis¹•Institutions (1)

Rensselaer Polytechnic Institute¹

01 Oct 2016-Computational Materials Science

TL;DR: Results demonstrate that pre-trained neural networks represent microstructure image data well, and when used for feature extraction yield the highest classification accuracies for the majority of classifier and feature selection methods tested, suggesting that deep learning algorithms can successfully be applied to micrograph recognition tasks.

...read moreread less

Proceedings Article•

Unsupervised feature selection with structured graph optimization

[...]

Feiping Nie¹, Wei Zhu¹, Xuelong Li²•Institutions (2)

Northwestern Polytechnical University¹, Chinese Academy of Sciences²

12 Feb 2016

TL;DR: This work proposes an unsupervised feature selection approach which performs feature selection and local structure learning simultaneously, and constrain the similarity matrix to make it contain more accurate information of data structure, thus the proposed approach can select more valuable features.

...read moreread less

Abstract: Since amounts of unlabelled and high-dimensional data needed to be processed, unsupervised feature selection has become an important and challenging problem in machine learning. Conventional embedded unsupervised methods always need to construct the similarity matrix, which makes the selected features highly depend on the learned structure. However real world data always contain lots of noise samples and features that make the similarity matrix obtained by original data can't be fully relied. We propose an unsupervised feature selection approach which performs feature selection and local structure learning simultaneously, the similarity matrix thus can be determined adaptively. Moreover, we constrain the similarity matrix to make it contain more accurate information of data structure, thus the proposed approach can select more valuable features. An efficient and simple algorithm is derived to optimize the problem. Experiments on various benchmark data sets, including handwritten digit data, face image data and biomedical data, validate the effectiveness of the proposed approach.

...read moreread less

Journal Article•DOI•

Adversarial Feature Selection Against Evasion Attacks

[...]

Fei Zhang¹, Patrick P. K. Chan¹, Battista Biggio², Daniel S. Yeung¹, Fabio Roli² - Show less +1 more•Institutions (2)

South China University of Technology¹, University of Cagliari²

01 Mar 2016-IEEE Transactions on Systems, Man, and Cybernetics

TL;DR: This paper proposes a novel adversary-aware feature selection model that can improve classifier security against evasion attacks, by incorporating specific assumptions on the adversary's data manipulation strategy.

...read moreread less

Abstract: Pattern recognition and machine learning techniques have been increasingly adopted in adversarial settings such as spam, intrusion, and malware detection, although their security against well-crafted attacks that aim to evade detection by manipulating data at test time has not yet been thoroughly assessed. While previous work has been mainly focused on devising adversary-aware classification algorithms to counter evasion attempts, only few authors have considered the impact of using reduced feature sets on classifier security against the same attacks. An interesting, preliminary result is that classifier security to evasion may be even worsened by the application of feature selection. In this paper, we provide a more detailed investigation of this aspect, shedding some light on the security properties of feature selection against evasion attacks. Inspired by previous work on adversary-aware classifiers, we propose a novel adversary-aware feature selection model that can improve classifier security against evasion attacks, by incorporating specific assumptions on the adversary’s data manipulation strategy. We focus on an efficient, wrapper-based implementation of our approach, and experimentally validate its soundness on different application examples, including spam and malware detection.

...read moreread less

Posted Content•

Distributed and parallel time series feature extraction for industrial big data applications.

[...]

Maximilian Christ, Andreas W. Kempa-Liehr, Michael Feindt

25 Oct 2016-arXiv: Learning

TL;DR: An efficient, scalable feature extraction algorithm for time series, which filters the available features in an early stage of the machine learning pipeline with respect to their significance for the classification or regression task, while controlling the expected percentage of selected but irrelevant features.

...read moreread less

Abstract: The all-relevant problem of feature selection is the identification of all strongly and weakly relevant attributes This problem is especially hard to solve for time series classification and regression in industrial applications such as predictive maintenance or production line optimization, for which each label or regression target is associated with several time series and meta-information simultaneously Here, we are proposing an efficient, scalable feature extraction algorithm for time series, which filters the available features in an early stage of the machine learning pipeline with respect to their significance for the classification or regression task, while controlling the expected percentage of selected but irrelevant features The proposed algorithm combines established feature extraction methods with a feature importance filter It has a low computational complexity, allows to start on a problem with only limited domain knowledge available, can be trivially parallelized, is highly scalable and based on well studied non-parametric hypothesis tests We benchmark our proposed algorithm on all binary classification problems of the UCR time series classification archive as well as time series from a production line optimization project and simulated stochastic processes with underlying qualitative change of dynamics

...read moreread less

Journal Article•DOI•

bartMachine: Machine Learning with Bayesian Additive Regression Trees

[...]

Adam Kapelner, Justin Bleich

04 Apr 2016-Journal of Statistical Software

TL;DR: In this paper, the authors present a new package in R implementing Bayesian additive regression trees (BART), which introduces many new features for data analysis using BART such as variable selection, interaction detection, model diagnostic plots, incorporation of missing data and the ability to save trees for future prediction.

...read moreread less

Abstract: We present a new package in R implementing Bayesian additive regression trees (BART). The package introduces many new features for data analysis using BART such as variable selection, interaction detection, model diagnostic plots, incorporation of missing data and the ability to save trees for future prediction. It is significantly faster than the current R implementation, parallelized, and capable of handling both large sample sizes and high-dimensional data.

...read moreread less

Journal Article•DOI•

Automatic sleep scoring using statistical features in the EMD domain and ensemble methods

[...]

Ahnaf Rashik Hassan¹, Mohammed Imamul Hassan Bhuiyan¹•Institutions (1)

Bangladesh University of Engineering and Technology¹

01 Jan 2016-Biocybernetics and Biomedical Engineering

TL;DR: The proposed sleep staging algorithm is a data-driven and robust automatic sleep staging scheme that uses single channel EEG signal and its non-REM 1 stage detection accuracy is better than most of the existing works.

...read moreread less

Journal Article•DOI•

Accelerated PSO Swarm Search Feature Selection for Data Stream Mining Big Data

[...]

Simon Fong¹, Raymond K. Wong², Athanasios V. Vasilakos³•Institutions (3)

University of Macau¹, University of New South Wales², Luleå University of Technology³

01 Jan 2016-IEEE Transactions on Services Computing

TL;DR: A novel lightweight feature selection is proposed designed particularly for mining streaming data on the fly, by using accelerated particle swarm optimization (APSO) type of swarm search that achieves enhanced analytical accuracy within reasonable processing time.

...read moreread less

Abstract: Big Data though it is a hype up-springing many technical challenges that confront both academic research communities and commercial IT deployment, the root sources of Big Data are founded on data streams and the curse of dimensionality. It is generally known that data which are sourced from data streams accumulate continuously making traditional batch-based model induction algorithms infeasible for real-time data mining. Feature selection has been popularly used to lighten the processing load in inducing a data mining model. However, when it comes to mining over high dimensional data the search space from which an optimal feature subset is derived grows exponentially in size, leading to an intractable demand in computation. In order to tackle this problem which is mainly based on the high-dimensionality and streaming format of data feeds in Big Data, a novel lightweight feature selection is proposed. The feature selection is designed particularly for mining streaming data on the fly, by using accelerated particle swarm optimization (APSO) type of swarm search that achieves enhanced analytical accuracy within reasonable processing time. In this paper, a collection of Big Data with exceptionally large degree of dimensionality are put under test of our new feature selection algorithm for performance evaluation.

...read moreread less

Proceedings Article•DOI•

Variable selection using Mean Decrease Accuracy and Mean Decrease Gini based on Random Forest

[...]

Hong Han¹, Xiaoling Guo¹, Hua Yu¹•Institutions (1)

Chinese Academy of Sciences¹

01 Aug 2016

TL;DR: A new method is proposed based on Random Forest to select variables using Mean Decrease Accuracy (MDA) and Mean decrease Gini (MDG) and it is proved to perform very fast.

...read moreread less

Abstract: Variable selection is very important for interpretation and prediction, especially for high dimensional datasets. In this paper, a new method is proposed based on Random Forest (RF) to select variables using Mean Decrease Accuracy (MDA) and Mean Decrease Gini (MDG). We also use dichotomy method to screen variables, which is proved to perform very fast. Experiments on 10 microarray datasets show that the new method is proficient and robust. In addition, we compared the proposed method with other variable selection methods, and the results demonstrated that our proposed method is more robust and more powerful in both accuracy and CPU time.

...read moreread less

Journal Article•DOI•

Hand gesture recognition with jointly calibrated Leap Motion and depth sensor

[...]

Giulio Marin¹, Fabio Dominio¹, Pietro Zanuttigh¹•Institutions (1)

University of Padua¹

01 Nov 2016-Multimedia Tools and Applications

TL;DR: This paper shows how to jointly exploit the two types of sensors for accurate gesture recognition by introducing a set of novel feature descriptors both for the Leap Motion and for depth data.

...read moreread less

Abstract: Novel 3D acquisition devices like depth cameras and the Leap Motion have recently reached the market. Depth cameras allow to obtain a complete 3D description of the framed scene while the Leap Motion sensor is a device explicitly targeted for hand gesture recognition and provides only a limited set of relevant points. This paper shows how to jointly exploit the two types of sensors for accurate gesture recognition. An ad-hoc solution for the joint calibration of the two devices is firstly presented. Then a set of novel feature descriptors is introduced both for the Leap Motion and for depth data. Various schemes based on the distances of the hand samples from the centroid, on the curvature of the hand contour and on the convex hull of the hand shape are employed and the use of Leap Motion data to aid feature extraction is also considered. The proposed feature sets are fed to two different classifiers, one based on multi-class SVMs and one exploiting Random Forests. Different feature selection algorithms have also been tested in order to reduce the complexity of the approach. Experimental results show that a very high accuracy can be obtained from the proposed method. The current implementation is also able to run in real-time.

...read moreread less

Collapse