Author

S. Balakrishnan

Bio: S. Balakrishnan is an academic researcher from College of Information Technology. The author has contributed to research in topics: Statistical classification & Support vector machine. The author has an hindex of 1, co-authored 1 publications receiving 42 citations.

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

SVM ranking with backward search for feature selection in type II diabetes databases

[...]

S. Balakrishnan¹, R. Narayanaswamy, Nickolas Savarimuthu², R. Samikannu³•Institutions (3)

College of Information Technology¹, National Institute of Technology, Tiruchirappalli², VIT University³

01 Oct 2008

TL;DR: A feature selection approach for finding an optimum feature subset that enhances the classification accuracy of Naive .Bayes classifier is proposed and results confirm that SVM Ranking with Backward Search approach leads to promising improvement on feature selection and enhances classification accuracy.

...read moreread less

Abstract: Clinical databases have accumulated large quantities of information about patients and their clinical history. Data mining is the search for relationships and patterns within this data that could provide useful knowledge for effective decision-making. Classification analysis is one of the widely adopted data mining techniques for healthcare applications to support medical diagnosis, improving quality of patient care, etc. Usually medical databases are high dimensional in nature. If a training dataset contains irrelevant features (i.e., attributes), classification analysis may produce less accurate results. Data pre-processing is required to prepare the data for data mining and machine learning to increase the predictive accuracy. Feature selection is a preprocessing technique commonly used on high-dimensional data and its purposes include reducing dimensionality, removing irrelevant and redundant features, reducing the amount of data needed for learning, improving algorithms' predictive accuracy, and increasing the constructed models' comprehensibility. Much research work in data mining has gone into improving the predictive accuracy of the classifiers by applying the techniques of feature selection. The importance of feature selection in medical data mining is appreciable as the diagnosis of the disease could be done in this patient-care activity with minimum number of features. Feature selection may provide us with the means to reduce the number of clinical measures made while still maintaining or even enhancing accuracy and reducing false negative rates. In medical diagnosis, reduction in false negative rate can, literally, be the difference between life and death. In this paper we propose a feature selection approach for finding an optimum feature subset that enhances the classification accuracy of Naive .Bayes classifier. Experiments were conducted on the Pima Indian Diabetes Dataset to assess the effectiveness of our approach. The results confirm that SVM Ranking with Backward Search approach leads to promising improvement on feature selection and enhances classification accuracy.

...read moreread less

45 citations

S. Balakrishnan

Papers

Cited by