Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy

doi:10.1109/TPAMI.2005.159

Journal ArticleDOI

Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy

Hanchuan Peng, +2 more

- 01 Aug 2005 -

IEEE Transactions on Pattern Analysis an...

- Vol. 27, Iss: 8, pp 1226-1238

Chats0

TLDR

In this article, the maximal statistical dependency criterion based on mutual information (mRMR) was proposed to select good features according to the maximal dependency condition. But the problem of feature selection is not solved by directly implementing mRMR.

Abstract:

Feature selection is an important problem for pattern classification systems. We study how to select good features according to the maximal statistical dependency criterion based on mutual information. Because of the difficulty in directly implementing the maximal dependency condition, we first derive an equivalent form, called minimal-redundancy-maximal-relevance criterion (mRMR), for first-order incremental feature selection. Then, we present a two-stage feature selection algorithm by combining mRMR and other more sophisticated feature selectors (e.g., wrappers). This allows us to select a compact set of superior features at very low cost. We perform extensive experimental comparison of our algorithm and other methods using three different classifiers (naive Bayes, support vector machine, and linear discriminate analysis) and four different data sets (handwritten digits, arrhythmia, NCI cancer cell lines, and lymphoma tissues). The results confirm that mRMR leads to promising improvement on feature selection and classification accuracy.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Extreme Learning Machine for Regression and Multiclass Classification

Guang-Bin Huang, +3 more

TL;DR: ELM provides a unified learning platform with a widespread type of feature mappings and can be applied in regression and multiclass classification applications directly and in theory, ELM can approximate any target continuous function and classify any disjoint regions.

...read moreread less

Journal ArticleDOI

A survey on feature selection methods

Girish Chandrashekar, +1 more

- 01 Jan 2014 -

Computers & Electrical Engineering

TL;DR: The objective is to provide a generic introduction to variable elimination which can be applied to a wide array of machine learning problems and focus on Filter, Wrapper and Embedded methods.

...read moreread less

Journal ArticleDOI

A Survey on Human Activity Recognition using Wearable Sensors

Oscar D. Lara, +1 more

- 23 Jan 2013 -

IEEE Communications Surveys and Tutorial...

TL;DR: The state of the art in HAR based on wearable sensors is surveyed and a two-level taxonomy in accordance to the learning approach and the response time is proposed.

...read moreread less

Proceedings Article

Efficient and Robust Feature Selection via Joint ℓ2,1-Norms Minimization

Feiping Nie, +3 more

TL;DR: A new robust feature selection method with emphasizing joint l2,1-norm minimization on both loss function and regularization is proposed, which has been applied into both genomic and proteomic biomarkers discovery.

...read moreread less

Journal ArticleDOI

Feature Selection: A Data Perspective

Jundong Li, +6 more

- 06 Dec 2017 -

ACM Computing Surveys

TL;DR: This survey revisits feature selection research from a data perspective and reviews representative feature selection algorithms for conventional data, structured data, heterogeneous data and streaming data, and categorizes them into four main groups: similarity- based, information-theoretical-based, sparse-learning-based and statistical-based.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Elements of information theory

Thomas M. Cover, +1 more

TL;DR: The author examines the role of entropy, inequality, and randomness in the design of codes and the construction of codes in the rapidly changing environment.

...read moreread less

Book

The Nature of Statistical Learning Theory

Vladimir Vapnik

TL;DR: Setting of the learning problem consistency of learning processes bounds on the rate of convergence ofLearning processes controlling the generalization ability of learning process constructing learning algorithms what is important in learning theory?

...read moreread less

Journal ArticleDOI

A Tutorial on Support Vector Machines for Pattern Recognition

Christopher John Burges

- 01 Jun 1998 -

Data Mining and Knowledge Discovery

TL;DR: There are several arguments which support the observed high accuracy of SVMs, which are reviewed and numerous examples and proofs of most of the key theorems are given.

...read moreread less

Journal ArticleDOI

On Estimation of a Probability Density Function and Mode

Emanuel Parzen

- 01 Sep 1962 -

Annals of Mathematical Statistics

TL;DR: In this paper, the problem of the estimation of a probability density function and of determining the mode of the probability function is discussed. Only estimates which are consistent and asymptotically normal are constructed.

...read moreread less

Journal ArticleDOI

Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling

Ash A. Alizadeh, +29 more

- 03 Feb 2000 -

Nature

TL;DR: It is shown that there is diversity in gene expression among the tumours of DLBCL patients, apparently reflecting the variation in tumour proliferation rate, host response and differentiation state of the tumour.

...read moreread less

Collapse

Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy

Citations

Extreme Learning Machine for Regression and Multiclass Classification

A survey on feature selection methods

A Survey on Human Activity Recognition using Wearable Sensors

Efficient and Robust Feature Selection via Joint ℓ2,1-Norms Minimization

Feature Selection: A Data Perspective

References

Elements of information theory

The Nature of Statistical Learning Theory

A Tutorial on Support Vector Machines for Pattern Recognition

On Estimation of a Probability Density Function and Mode

Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling

Related Papers (5)

An introduction to variable and feature selection

Wrappers for feature subset selection

LIBSVM: A library for support vector machines

Gene Selection for Cancer Classification using Support Vector Machines

Random Forests