Rough set methods in feature selection and recognition

doi:10.1016/S0167-8655(02)00196-4

Home
/
Papers
/
Rough set methods in feature selection and recognition

Journal Article•DOI•

Rough set methods in feature selection and recognition

Roman W. Swiniarski¹, Andrzej Skowron²•Institutions (2)

San Diego State University¹, University of Warsaw²

06 Mar 2003-Pattern Recognition Letters (Elsevier Science Inc.)-Vol. 24, Iss: 6, pp 833-849

TL;DR: The algorithm for feature selection is based on an application of a rough set method to the result of principal components analysis (PCA) used for feature projection and reduction.

read less

About: This article is published in Pattern Recognition Letters.The article was published on 2003-03-06. It has received 801 citations till now. The article focuses on the topics: Feature (computer vision) & Dimensionality reduction.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Feature selection in machine learning: A new perspective

[...]

Jie Cai¹, Jiawei Luo¹, Shulin Wang¹, Sheng Yang¹•Institutions (1)

Hunan University¹

26 Jul 2018-Neurocomputing

TL;DR: This study discusses several frequently-used evaluation measures for feature selection, and surveys supervised, unsupervised, and semi-supervised feature selection methods, which are widely applied in machine learning problems, such as classification and clustering.

...read moreread less

1,057 citations

Cites background from "Rough set methods in feature select..."

...According to the theoretical principle, feature selection methods can be based on statistics [35-39], information theory [40-45], manifold [46-48], and rough set [49-53], and can be...
[...]

Proceedings Article•DOI•

Filter Bank Common Spatial Pattern (FBCSP) in Brain-Computer Interface

[...]

Kai Keng Ang¹, Zhang Yang Chin¹, Haihong Zhang¹, Cuntai Guan¹•Institutions (1)

Agency for Science, Technology and Research¹

01 Jun 2008

TL;DR: A novel filter bank common spatial pattern (FBCSP) is proposed to perform autonomous selection of key temporal-spatial discriminative EEG characteristics and shows that FBCSP, using a particular combination feature selection and classification algorithm, yields relatively higher cross-validation accuracies compared to prevailing approaches.

...read moreread less

Abstract: In motor imagery-based brain computer interfaces (BCI), discriminative patterns can be extracted from the electroencephalogram (EEG) using the common spatial pattern (CSP) algorithm. However, the performance of this spatial filter depends on the operational frequency band of the EEG. Thus, setting a broad frequency range, or manually selecting a subject-specific frequency range, are commonly used with the CSP algorithm. To address this problem, this paper proposes a novel filter bank common spatial pattern (FBCSP) to perform autonomous selection of key temporal-spatial discriminative EEG characteristics. After the EEG measurements have been bandpass-filtered into multiple frequency bands, CSP features are extracted from each of these bands. A feature selection algorithm is then used to automatically select discriminative pairs of frequency bands and corresponding CSP features. A classification algorithm is subsequently used to classify the CSP features. A study is conducted to assess the performance of a selection of feature selection and classification algorithms for use with the FBCSP. Extensive experimental results are presented on a publicly available dataset as well as data collected from healthy subjects and unilaterally paralyzed stroke patients. The results show that FBCSP, using a particular combination feature selection and classification algorithm, yields relatively higher cross-validation accuracies compared to prevailing approaches.

...read moreread less

991 citations

Cites background from "Rough set methods in feature select..."

...In addition to the prevailing application of MI in feature selection, Rough set theory (RST) [21] is also potentially feasible in feature selection and significantly reduce the pattern dimensionality [22]....
[...]

Journal Article•DOI•

Rough sets and Boolean reasoning

[...]

Zdzisław Pawlak¹, Andrzej Skowron¹•Institutions (1)

University of Warsaw¹

01 Jan 2007-Information Sciences

TL;DR: Methods based on the combination of rough sets and Boolean reasoning with applications in pattern recognition, machine learning, data mining and conflict analysis are discussed.

...read moreread less

940 citations

Journal Article•DOI•

Feature selection based on rough sets and particle swarm optimization

[...]

Xiangyang Wang¹, Jie Yang¹, Xiaolong Teng¹, Weijun Xia¹, Richard Jensen² - Show less +1 more•Institutions (2)

Shanghai Jiao Tong University¹, Aberystwyth University²

01 Mar 2007-Pattern Recognition Letters

TL;DR: A new feature selection strategy based on rough sets and particle swarm optimization (PSO), which does not need complex operators such as crossover and mutation, and requires only primitive and simple mathematical operators, and is computationally inexpensive in terms of both memory and runtime.

...read moreread less

794 citations

Cites background from "Rough set methods in feature select..."

...shortest or minimal reducts while obtaining high quality classifiers based on the selected features (Swiniarski and Skowron, 2003)....
[...]
...The optimal criterion for rough set feature selection is to find shortest or minimal reducts while obtaining high quality classifiers based on the selected features (Swiniarski and Skowron, 2003)....
[...]

Journal Article•DOI•

Neighborhood rough set based heterogeneous feature subset selection

[...]

Qinghua Hu¹, Daren Yu¹, Jinfu Liu¹, Congxin Wu¹•Institutions (1)

Harbin Institute of Technology¹

20 Sep 2008-Information Sciences

TL;DR: A neighborhood rough set model is introduced to deal with the problem of heterogeneous feature subset selection and Experimental results show that the neighborhood model based method is more flexible to deals with heterogeneous data.

...read moreread less

780 citations

Additional excerpts

...Rough set theory, proposed by Pawlak [24], has been proven to be an effective tool for feature selection, rule extraction and knowledge discovery from categorical data in recent years [2,8,25,28,29,32,33,44]....
[...]

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161

Collapse

References

PDF

Open Access

More filters

Book•

C4.5: Programs for Machine Learning

[...]

J. Ross Quinlan¹•Institutions (1)

University of Sydney¹

15 Oct 1992

TL;DR: A complete guide to the C4.5 system as implemented in C for the UNIX environment, which starts from simple core learning methods and shows how they can be elaborated and extended to deal with typical problems such as missing data and over hitting.

...read moreread less

Abstract: From the Publisher: Classifier systems play a major role in machine learning and knowledge-based systems, and Ross Quinlan's work on ID3 and C4.5 is widely acknowledged to have made some of the most significant contributions to their development. This book is a complete guide to the C4.5 system as implemented in C for the UNIX environment. It contains a comprehensive guide to the system's use , the source code (about 8,800 lines), and implementation notes. The source code and sample datasets are also available on a 3.5-inch floppy diskette for a Sun workstation. C4.5 starts with large sets of cases belonging to known classes. The cases, described by any mixture of nominal and numeric properties, are scrutinized for patterns that allow the classes to be reliably discriminated. These patterns are then expressed as models, in the form of decision trees or sets of if-then rules, that can be used to classify new cases, with emphasis on making the models understandable as well as accurate. The system has been applied successfully to tasks involving tens of thousands of cases described by hundreds of properties. The book starts from simple core learning methods and shows how they can be elaborated and extended to deal with typical problems such as missing data and over hitting. Advantages and disadvantages of the C4.5 approach are discussed and illustrated with several case studies. This book and software should be of interest to developers of classification-based intelligent systems and to students in machine learning and expert systems courses.

...read moreread less

21,674 citations

"Rough set methods in feature select..." refers background or methods in this paper

...C ðX Þ called a positive region of the partition U...
[...]
...Let us only mention that several methods of feature selection are inherently built in a predictor design procedure (Quinlan, 1993) and some methods of feature selection merge feature extraction with feature selection....
[...]

Book•

Neural networks for pattern recognition

[...]

Christopher M. Bishop¹•Institutions (1)

Aston University¹

01 Jan 1995

TL;DR: This is the first comprehensive treatment of feed-forward neural networks from the perspective of statistical pattern recognition, and is designed as a text, with over 100 exercises, to benefit anyone involved in the fields of neural computation and pattern recognition.

...read moreread less

Abstract: From the Publisher: This is the first comprehensive treatment of feed-forward neural networks from the perspective of statistical pattern recognition. After introducing the basic concepts, the book examines techniques for modelling probability density functions and the properties and merits of the multi-layer perceptron and radial basis function network models. Also covered are various forms of error functions, principal algorithms for error function minimalization, learning and generalization in neural networks, and Bayesian techniques and their applications. Designed as a text, with over 100 exercises, this fully up-to-date work will benefit anyone involved in the fields of neural computation and pattern recognition.

...read moreread less

19,056 citations

Book Chapter•DOI•

Neural Networks for Pattern Recognition

[...]

Suresh Kothari¹, Heekuck Oh¹•Institutions (1)

Iowa State University¹

01 Jan 1993-Advances in Computers

TL;DR: The chapter discusses two important directions of research to improve learning algorithms: the dynamic node generation, which is used by the cascade correlation algorithm; and designing learning algorithms where the choice of parameters is not an issue.

...read moreread less

Abstract: Publisher Summary This chapter provides an account of different neural network architectures for pattern recognition. A neural network consists of several simple processing elements called neurons. Each neuron is connected to some other neurons and possibly to the input nodes. Neural networks provide a simple computing paradigm to perform complex recognition tasks in real time. The chapter categorizes neural networks into three types: single-layer networks, multilayer feedforward networks, and feedback networks. It discusses the gradient descent and the relaxation method as the two underlying mathematical themes for deriving learning algorithms. A lot of research activity is centered on learning algorithms because of their fundamental importance in neural networks. The chapter discusses two important directions of research to improve learning algorithms: the dynamic node generation, which is used by the cascade correlation algorithm; and designing learning algorithms where the choice of parameters is not an issue. It closes with the discussion of performance and implementation issues.

...read moreread less

13,033 citations

"Rough set methods in feature select..." refers methods in this paper

...Let us define now the following two operations on sets B ðX Þ ¼ fx 2 U : BðxÞ Xg; B ðX Þ ¼ fx 2 U : BðxÞ \ X 6¼ ;g assigning to every subset X of the universe U two sets B ðX Þ and B ðX Þ called the B-lower and the Bupper approximation of X, respectively....
[...]
...Feature selection methods consists of two main streams (Duda and Hart, 1973; Fukunaga, 1990; Bishop, 1995; John et al., 1994): open-loop methods and closed-loop methods....
[...]
...We have applied PCA, with the resulting KLT (Duda and Hart, 1973; Bishop, 1995), for the orthonormal projection (and reduction) of reduced SVD patterns xsvd;r representing recognized face images....
[...]

Book•

Introduction to Statistical Pattern Recognition

[...]

Keinosuke Fukunaga

01 Jan 1972

TL;DR: This completely revised second edition presents an introduction to statistical pattern recognition, which is appropriate as a text for introductory courses in pattern recognition and as a reference book for workers in the field.

...read moreread less

Abstract: This completely revised second edition presents an introduction to statistical pattern recognition Pattern recognition in general covers a wide range of problems: it is applied to engineering problems, such as character readers and wave form analysis as well as to brain modeling in biology and psychology Statistical decision and estimation, which are the main subjects of this book, are regarded as fundamental to the study of pattern recognition This book is appropriate as a text for introductory courses in pattern recognition and as a reference book for workers in the field Each chapter contains computer projects as well as exercises

...read moreread less

10,526 citations

Book•

Rough Sets: Theoretical Aspects of Reasoning about Data

[...]

Zdzisław Pawlak

31 Oct 1991

TL;DR: Theoretical Foundations.

...read moreread less

Abstract: I. Theoretical Foundations.- 1. Knowledge.- 1.1. Introduction.- 1.2. Knowledge and Classification.- 1.3. Knowledge Base.- 1.4. Equivalence, Generalization and Specialization of Knowledge.- Summary.- Exercises.- References.- 2. Imprecise Categories, Approximations and Rough Sets.- 2.1. Introduction.- 2.2. Rough Sets.- 2.3. Approximations of Set.- 2.4. Properties of Approximations.- 2.5. Approximations and Membership Relation.- 2.6. Numerical Characterization of Imprecision.- 2.7. Topological Characterization of Imprecision.- 2.8. Approximation of Classifications.- 2.9. Rough Equality of Sets.- 2.10. Rough Inclusion of Sets.- Summary.- Exercises.- References.- 3. Reduction of Knowledge.- 3.1. Introduction.- 3.2. Reduct and Core of Knowledge.- 3.3. Relative Reduct and Relative Core of Knowledge.- 3.4. Reduction of Categories.- 3.5. Relative Reduct and Core of Categories.- Summary.- Exercises.- References.- 4. Dependencies in Knowledge Base.- 4.1. Introduction.- 4.2. Dependency of Knowledge.- 4.3. Partial Dependency of Knowledge.- Summary.- Exercises.- References.- 5. Knowledge Representation.- 5.1. Introduction.- 5.2. Examples.- 5.3. Formal Definition.- 5.4. Significance of Attributes.- 5.5. Discernibility Matrix.- Summary.- Exercises.- References.- 6. Decision Tables.- 6.1. Introduction.- 6.2. Formal Definition and Some Properties.- 6.3. Simplification of Decision Tables.- Summary.- Exercises.- References.- 7. Reasoning about Knowledge.- 7.1. Introduction.- 7.2. Language of Decision Logic.- 7.3. Semantics of Decision Logic Language.- 7.4. Deduction in Decision Logic.- 7.5. Normal Forms.- 7.6. Decision Rules and Decision Algorithms.- 7.7. Truth and Indiscernibility.- 7.8. Dependency of Attributes.- 7.9. Reduction of Consistent Algorithms.- 7.10. Reduction of Inconsistent Algorithms.- 7.11. Reduction of Decision Rules.- 7.12. Minimization of Decision Algorithms.- Summary.- Exercises.- References.- II. Applications.- 8. Decision Making.- 8.1. Introduction.- 8.2. Optician's Decisions Table.- 8.3. Simplification of Decision Table.- 8.4. Decision Algorithm.- 8.5. The Case of Incomplete Information.- Summary.- Exercises.- References.- 9. Data Analysis.- 9.1. Introduction.- 9.2. Decision Table as Protocol of Observations.- 9.3. Derivation of Control Algorithms from Observation.- 9.4. Another Approach.- 9.5. The Case of Inconsistent Data.- Summary.- Exercises.- References.- 10. Dissimilarity Analysis.- 10.1. Introduction.- 10.2. The Middle East Situation.- 10.3. Beauty Contest.- 10.4. Pattern Recognition.- 10.5. Buying a Car.- Summary.- Exercises.- References.- 11. Switching Circuits.- 11.1. Introduction.- 11.2. Minimization of Partially Defined Switching Functions.- 11.3. Multiple-Output Switching Functions.- Summary.- Exercises.- References.- 12. Machine Learning.- 12.1. Introduction.- 12.2. Learning From Examples.- 12.3. The Case of an Imperfect Teacher.- 12.4. Inductive Learning.- Summary.- Exercises.- References.

...read moreread less

7,826 citations