Evaluating the Importance of each Feature in Classification task

doi:10.1109/CSNT.2018.8820216

Proceedings ArticleDOI

Evaluating the Importance of each Feature in Classification task

TLDR

This research used two techniques namely Data partition and K Fold, in evaluating the importance of each feature from the randomly generated dataset with 5399 instances and 20 attributes to improve the classification accuracy by knowing the most important feature from any given dataset.

Abstract:

In Machine Learning and statistics attribute/feature selection is used in predictive model construction. This help the Machine in interpreting the features easily by discovering good insight and improves efficiency in predictive modeling. The objective of our research is to improve the classification accuracy by knowing the most important feature from any given dataset. In this research, we used two techniques namely Data partition and K Fold, in evaluating the importance of each feature from the randomly generated dataset with 5399 instances and 20 attributes. In Data partitioning, the attribute with lowest accuracy is filtered out. Where as in K Fold cross validation, attributes with biggest error is removed from the original dataset. In our experiments, the evaluation parameters considered are Recall. Precision and F-Measure. Finally the accuracy rate of both the techniques are compared. The finding in our research stats that K Fold approach achieves better accuracy of 97.03% than Data partitioning(96.11%) in estimating the importance of features in classification.

References

PDF

Open Access

More filters

Journal ArticleDOI

A survey on feature selection methods

Girish Chandrashekar, +1 more

- 01 Jan 2014 -

Computers & Electrical Engineering

TL;DR: The objective is to provide a generic introduction to variable elimination which can be applied to a wide array of machine learning problems and focus on Filter, Wrapper and Embedded methods.

...read moreread less

Journal ArticleDOI

Online selection of discriminative tracking features

Robert T. Collins, +2 more

- 01 Oct 2005 -

IEEE Transactions on Pattern Analysis an...

TL;DR: This paper presents an online feature selection mechanism for evaluating multiple features while tracking and adjusting the set of features used to improve tracking performance, and notes susceptibility of the variance ratio feature selection method to distraction by spatially correlated background clutter.

...read moreread less

Journal ArticleDOI

Support vector machines combined with feature selection for breast cancer diagnosis

Mehmet Fatih Akay

- 01 Mar 2009 -

Expert Systems With Applications

TL;DR: The results show that the highest classification accuracy (99.51%) is obtained for the SVM model that contains five features, and this is very promising compared to the previously reported results.

...read moreread less

Journal ArticleDOI

Robust Joint Graph Sparse Coding for Unsupervised Spectral Feature Selection

Xiaofeng Zhu, +4 more

- 01 Jun 2017 -

IEEE Transactions on Neural Networks

TL;DR: This paper proposes a new unsupervised spectral feature selection model by embedding a graph regularizer into the framework of joint sparse regression for preserving the local structures of data by proposing a novel joint graph sparse coding (JGSC) model.

...read moreread less

Journal ArticleDOI

A feature selection model based on genetic rank aggregation for text sentiment classification

Aytuğ Onan, +1 more

- 01 Feb 2017 -

Journal of Information Science

TL;DR: An ensemble approach for feature selection is presented, which aggregates the several individual feature lists obtained by the different feature selection methods so that a more robust and efficient feature subset can be obtained.

...read moreread less

Related Papers (5)

A hybrid prediction model with F-score feature selection for type II Diabetes databases

B. Sarojini Ilango, +1 more

On Feature Selection Methods for Accurate Classification and Analysis of Emphysema CT Images

Musibau Adekunle Ibrahim, +2 more

- 01 Jan 2017 -

Journal of medical imaging

PCA based feature reduction to improve the accuracy of decision tree c4.5 classification

Muhammad Zulfahmi Nasution, +2 more

Evaluation of k-Nearest Neighbor classifier performance for direct marketing

M. Govindarajan, +1 more

- 01 Jan 2010 -

Expert Systems With Applications

Protein Classification using Machine Learning and Statistical Techniques: A Comparative Analysis.

Chhote Lal Prasad Gupta, +2 more

- 18 Jan 2019 -

arXiv: Learning

Evaluating the Importance of each Feature in Classification task

References

A survey on feature selection methods

Online selection of discriminative tracking features

Support vector machines combined with feature selection for breast cancer diagnosis

Robust Joint Graph Sparse Coding for Unsupervised Spectral Feature Selection

A feature selection model based on genetic rank aggregation for text sentiment classification

Related Papers (5)

A hybrid prediction model with F-score feature selection for type II Diabetes databases

On Feature Selection Methods for Accurate Classification and Analysis of Emphysema CT Images

PCA based feature reduction to improve the accuracy of decision tree c4.5 classification

Evaluation of k-Nearest Neighbor classifier performance for direct marketing

Protein Classification using Machine Learning and Statistical Techniques: A Comparative Analysis.

Trending Questions (1)