scispace - formally typeset
Journal ArticleDOI

Combination of feature selection approaches with SVM in credit scoring

F. L. Chen, +1 more
- 01 Jul 2010 - 
- Vol. 37, Iss: 7, pp 4902-4909
TLDR
This study suggests that hybrid credit scoring approach is mostly robust and effective in finding optimal subsets and is a promising method to the fields of data mining.
Abstract
The credit scoring has been regarded as a critical topic and its related departments make efforts to collect huge amount of data to avoid wrong decision. An effective classificatory model will objectively help managers instead of intuitive experience. This study proposes four approaches combining with the SVM (support vector machine) classifier for features selection that retains sufficient information for classification purpose. Different credit scoring models are constructed by selecting attributes with four approaches. Two UCI (University of California, Irvine) data sets are chosen to evaluate the accuracy of various hybrid-SVM models. SVM classifier combines with conventional statistical LDA, Decision tree, Rough sets and F-score approaches as features pre-processing step to optimize feature space by removing both irrelevant and redundant features. In this paper, the procedure of the proposed approaches will be described and then evaluated by their performances. The results are compared in combination with SVM classifier and nonparametric Wilcoxon signed rank test will be held to show if there is any significant difference between these models. The result in this study suggests that hybrid credit scoring approach is mostly robust and effective in finding optimal subsets and is a promising method to the fields of data mining.

read more

Citations
More filters
Journal ArticleDOI

An Improved Particle Swarm Optimization for Feature Selection

TL;DR: This paper designs a modified Multi-Swarm PSO (MSPSO) to solve discrete problems, and proposes an Improved Feature Selection (IFS) method by integrating MSPSO, Support Vector Machines (SVM) with F-score method to achieve higher generalization capability.
Journal ArticleDOI

Deep Learning based Radiomics (DLR) and its usage in noninvasive IDH1 prediction for low grade glioma.

TL;DR: The performance of DLR for predicting the mutation status of isocitrate dehydrogenase 1 (IDH1) was validated in a dataset of 151 patients with low-grade glioma and the AUC of IDH1 estimation was improved to 95% using DLR based on multiple-modality MR images, suggesting DLR could be a powerful way to extract deep information from medical images.
Journal ArticleDOI

Machine Learning in Financial Crisis Prediction: A Survey

TL;DR: This paper presents the current achievements and limitations associated with the development of bankruptcy-prediction and credit-scoring models employing machine learning, and provides suggestions for future research.
Journal ArticleDOI

Information gain directed genetic algorithm wrapper feature selection for credit rating

TL;DR: A novel approach to feature selection in credit scoring applications is proposed, called Information Gain Directed Feature Selection algorithm (IGDFS), which performs the ranking of features based on information gain, propagates the top m features through the GA wrapper (GAW) algorithm using three classical machine learning algorithms of KNN, Naive Bayes and Support Vector Machine for credit scoring.
Journal ArticleDOI

The effect of feature selection on financial distress prediction

TL;DR: A comprehensive study is conducted to examine the effect of performing filter and wrapper based feature selection methods on financial distress prediction and finds that on average performing the genetic algorithm and logistic regression for feature selection can provide prediction improvements over the credit and bankruptcy datasets respectively.
References
More filters
Journal ArticleDOI

LIBSVM: A library for support vector machines

TL;DR: Issues such as solving SVM optimization problems theoretical convergence multiclass classification probability estimates and parameter selection are discussed in detail.
Book

The Nature of Statistical Learning Theory

TL;DR: Setting of the learning problem consistency of learning processes bounds on the rate of convergence ofLearning processes controlling the generalization ability of learning process constructing learning algorithms what is important in learning theory?
Journal ArticleDOI

Classification and regression trees

TL;DR: This article gives an introduction to the subject of classification and regression trees by reviewing some widely available algorithms and comparing their capabilities, strengths, and weakness in two examples.
Journal ArticleDOI

An introduction to variable and feature selection

TL;DR: The contributions of this special issue cover a wide range of aspects of variable selection: providing a better definition of the objective function, feature construction, feature ranking, multivariate feature selection, efficient search methods, and feature validity assessment methods.
Related Papers (5)