Home
/
Authors
/
Chieh-Jen Wang

Author

Chieh-Jen Wang

Bio: Chieh-Jen Wang is an academic researcher from Huafan University. The author has contributed to research in topics: Support vector machine & Feature selection. The author has an hindex of 2, co-authored 2 publications receiving 1884 citations.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

A GA-based feature selection and parameters optimizationfor support vector machines

[...]

Cheng-Lung Huang¹, Chieh-Jen Wang²•Institutions (2)

National Kaohsiung First University of Science and Technology¹, Huafan University²

01 Aug 2006-Expert Systems With Applications

TL;DR: This research presents a genetic algorithm approach for feature selection and parameters optimization to solve the problem of optimizing parameters and feature subset without degrading the SVM classification accuracy.

...read moreread less

Abstract: Support Vector Machines, one of the new techniques for pattern classification, have been widely used in many application areas. The kernel parameters setting for SVM in a training process impacts on the classification accuracy. Feature selection is another factor that impacts classification accuracy. The objective of this research is to simultaneously optimize the parameters and feature subset without degrading the SVM classification accuracy. We present a genetic algorithm approach for feature selection and parameters optimization to solve this kind of problem. We tried several real-world datasets using the proposed GA-based approach and the Grid algorithm, a traditional method of performing parameters searching. Compared with the Grid algorithm, our proposed GA-based approach significantly improves the classification accuracy and has fewer input features for support vector machines. q 2005 Elsevier Ltd. All rights reserved.

...read moreread less

1,316 citations

Journal Article•DOI•

Credit scoring with a data mining approach based on support vector machines

[...]

Cheng-Lung Huang¹, Mu-Chen Chen², Chieh-Jen Wang³•Institutions (3)

National Kaohsiung First University of Science and Technology¹, National Chiao Tung University², Huafan University³

01 Nov 2007-Expert Systems With Applications

TL;DR: Experimental results show that SVM is a promising addition to the existing data mining methods and three strategies to construct the hybrid SVM-based credit scoring models are used.

...read moreread less

Abstract: The credit card industry has been growing rapidly recently, and thus huge numbers of consumers' credit data are collected by the credit department of the bank. The credit scoring manager often evaluates the consumer's credit with intuitive experience. However, with the support of the credit classification model, the manager can accurately evaluate the applicant's credit score. Support Vector Machine (SVM) classification is currently an active research area and successfully solves classification problems in many domains. This study used three strategies to construct the hybrid SVM-based credit scoring models to evaluate the applicant's credit score from the applicant's input features. Two credit datasets in UCI database are selected as the experimental data to demonstrate the accuracy of the SVM classifier. Compared with neural networks, genetic programming, and decision tree classifiers, the SVM classifier achieved an identical classificatory accuracy with relatively few input features. Additionally, combining genetic algorithms with SVM classifier, the proposed hybrid GA-SVM strategy can simultaneously perform feature selection task and model parameters optimization. Experimental results show that SVM is a promising addition to the existing data mining methods.

...read moreread less

766 citations

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Recent advances and applications of machine learning in solid-state materials science

[...]

Jonathan Schmidt¹, Mário R. G. Marques¹, Silvana Botti², Miguel A. L. Marques¹•Institutions (2)

Martin Luther University of Halle-Wittenberg¹, University of Jena²

08 Aug 2019

TL;DR: A comprehensive overview and analysis of the most recent research in machine learning principles, algorithms, descriptors, and databases in materials science, and proposes solutions and future research paths for various challenges in computational materials science.

...read moreread less

Abstract: One of the most exciting tools that have entered the material science toolbox in recent years is machine learning. This collection of statistical methods has already proved to be capable of considerably speeding up both fundamental and applied research. At present, we are witnessing an explosion of works that develop and apply machine learning to solid-state systems. We provide a comprehensive overview and analysis of the most recent research in this topic. As a starting point, we introduce machine learning principles, algorithms, descriptors, and databases in materials science. We continue with the description of different machine learning approaches for the discovery of stable materials and the prediction of their crystal structure. Then we discuss research in numerous quantitative structure–property relationships and various approaches for the replacement of first-principle methods by machine learning. We review how active learning and surrogate-based optimization can be applied to improve the rational design process and related examples of applications. Two major questions are always the interpretability of and the physical understanding gained from machine learning models. We consider therefore the different facets of interpretability and their importance in materials science. Finally, we propose solutions and future research paths for various challenges in computational materials science.

...read moreread less

1,301 citations

Journal Article•DOI•

Definitions, methods, and applications in interpretable machine learning.

[...]

W. James Murdoch¹, Chandan Singh¹, Karl Kumbier¹, Reza Abbasi-Asl², Reza Abbasi-Asl¹, Bin Yu¹ - Show less +2 more•Institutions (2)

University of California, Berkeley¹, University of California, San Francisco²

29 Oct 2019-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: The authors define interpretability in the context of machine learning and introduce the predictive, descriptive, relevant (PDR) framework for discussing interpretations, with three overarching desiderata for evaluation: predictive accuracy, descriptive accuracy, and relevancy, with relevance judged relative to a human audience.

...read moreread less

Abstract: Machine-learning models have demonstrated great success in learning complex patterns that enable them to make predictions about unobserved data. In addition to using models for prediction, the ability to interpret what a model has learned is receiving an increasing amount of attention. However, this increased focus has led to considerable confusion about the notion of interpretability. In particular, it is unclear how the wide array of proposed interpretation methods are related and what common concepts can be used to evaluate them. We aim to address these concerns by defining interpretability in the context of machine learning and introducing the predictive, descriptive, relevant (PDR) framework for discussing interpretations. The PDR framework provides 3 overarching desiderata for evaluation: predictive accuracy, descriptive accuracy, and relevancy, with relevancy judged relative to a human audience. Moreover, to help manage the deluge of interpretation methods, we introduce a categorization of existing techniques into model-based and post hoc categories, with subgroups including sparsity, modularity, and simulatability. To demonstrate how practitioners can use the PDR framework to evaluate and understand interpretations, we provide numerous real-world examples. These examples highlight the often underappreciated role played by human audiences in discussions of interpretability. Finally, based on our framework, we discuss limitations of existing methods and directions for future work. We hope that this work will provide a common vocabulary that will make it easier for both practitioners and researchers to discuss and choose from the full range of interpretation methods.

...read moreread less

851 citations

Journal Article•DOI•

Particle swarm optimization for parameter determination and feature selection of support vector machines

[...]

Shih-Wei Lin¹, Kuo-Ching Ying¹, Shih-Chieh Chen², Zne-Jung Lee¹•Institutions (2)

Huafan University¹, National Taiwan University of Science and Technology²

01 Nov 2008-Expert Systems With Applications

TL;DR: Experimental results demonstrate that the classification accuracy rates of the developed approach surpass those of grid search and many other approaches, and that the developed PSO+SVM approach has a similar result to GA+S VM, Therefore, the PSO + SVM approach is valuable for parameter determination and feature selection in an SVM.

...read moreread less

Abstract: Support vector machine (SVM) is a popular pattern classification method with many diverse applications. Kernel parameter setting in the SVM training procedure, along with the feature selection, significantly influences the classification accuracy. This study simultaneously determines the parameter values while discovering a subset of features, without reducing SVM classification accuracy. A particle swarm optimization (PSO) based approach for parameter determination and feature selection of the SVM, termed PSO+SVM, is developed. Several public datasets are employed to calculate the classification accuracy rate in order to evaluate the developed PSO+SVM approach. The developed approach was compared with grid search, which is a conventional method of searching parameter values, and other approaches. Experimental results demonstrate that the classification accuracy rates of the developed approach surpass those of grid search and many other approaches, and that the developed PSO+SVM approach has a similar result to GA+SVM. Therefore, the PSO+SVM approach is valuable for parameter determination and feature selection in an SVM.

...read moreread less

802 citations

Journal Article•DOI•

Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research

[...]

Stefan Lessmann¹, Bart Baesens², Bart Baesens³, Hsin-Vonn Seow⁴, Lyn C. Thomas² - Show less +1 more•Institutions (4)

Humboldt University of Berlin¹, University of Southampton², Catholic University of Leuven³, University of Nottingham Malaysia Campus⁴

16 Nov 2015-European Journal of Operational Research

TL;DR: The study of Baesens et al. (2003) is updated and several novel classification algorithms to the state-of-the-art in credit scoring are compared, providing an independent assessment of recent scoring methods and offering a new baseline to which future approaches can be compared.

...read moreread less

692 citations

Journal Article•DOI•

A distributed PSO-SVM hybrid system with feature selection and parameter optimization

[...]

Cheng-Lung Huang¹, Jian-Fan Dun²•Institutions (2)

National Kaohsiung First University of Science and Technology¹, Huafan University²

01 Sep 2008

TL;DR: Experimental results showed the proposed PSO-SVM model can correctly select the discriminating input features and also achieve high classification accuracy.

...read moreread less

Abstract: This study proposed a novel PSO-SVM model that hybridized the particle swarm optimization (PSO) and support vector machines (SVM) to improve the classification accuracy with a small and appropriate feature subset. This optimization mechanism combined the discrete PSO with the continuous-valued PSO to simultaneously optimize the input feature subset selection and the SVM kernel parameter setting. The hybrid PSO-SVM data mining system was implemented via a distributed architecture using the web service technology to reduce the computational time. In a heterogeneous computing environment, the PSO optimization was performed on the application server and the SVM model was trained on the client (agent) computer. The experimental results showed the proposed approach can correctly select the discriminating input features and also achieve high classification accuracy.

...read moreread less

499 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse