Attribute selection with fuzzy decision reducts

Q: What is the way to interpret the results?

When interpreting these results, one should always keep in mind the trade-off between accuracy (RMSE) and attribute subset size: a higher accuracy (lower RSME) is of course desirable, but so is a smaller subset size, i.e., the less conditional attributes there are in the reduced data set, the stronger its generalization capacity.

Q: What is the general definition of fuzzy decision reducts?

In the general definition that the authors propose, the authors require an increasing [0, 1]-valued measure, so as to guarantee that the larger an attribute subset, the higher its degree of fuzzy decision reducthood (monotonicity), which is in analogy to other approaches to define a degree of approximating decision classes [43, 44].

Q: What are the evaluation measures in the previous subsections?

As the authors have shown, the evaluation measures γ, γ′, δ, δ′, f and g introduced in the previous subsections all give rise to corresponding fuzzy decision reducts.

Q: How much accuracy can be achieved with fuzzy -decision reducts?

as seen in Figure 2b), if a 1% accuracy drop is permissible, fuzzy γ-decision reducts manage to reduce the subset size by over 40%, while with g a reduction of the data set by more than 63% is possible.

Q: What is the effect of the heuristic on the and f subset?

This affects QuickReduct’s operation adversely; when all of the considered subsets in a given iteration evaluate to 0, the heuristic is forced to select one without any information about its true merit.

Q: What is the default distance weighting for the K-nearest neighbour classifier?

In their experiments, the authors have used the very simple K-nearest neighbour classifier [1], implemented in Weka [53] as IBk, with default parameters (K = 1, no distance weighting).

Q: What is the difference between the two experiments?

Their experiments clearly endorse the benefit of using fuzzy decision reducts, showing a greater flexibility and better potential to produce good-sized, highquality attribute subsets than the crisp decision reducts that have been used so far in fuzzy-rough data analysis.

Question

Q1. What contributions have the authors mentioned in the paper "Aberystwyth university attribute selection with fuzzy decision reducts" ?

Q2. How many measures were selected for each of the fuzzy-rough measures?

Q3. What is the way to interpret the results?

Q4. What is the general definition of fuzzy decision reducts?

Q5. What are the evaluation measures in the previous subsections?

Q6. How much accuracy can be achieved with fuzzy -decision reducts?

Q7. What is the effect of the heuristic on the and f subset?

Q8. What is the default distance weighting for the K-nearest neighbour classifier?

Q9. What is the difference between the two experiments?

Accepted Answer

Therefore, within the context of fuzzy rough set theory, the authors present a generalization of the classical rough set framework for databased attribute selection and reduction using fuzzy tolerance relations. The paper unifies existing work in this direction, and introduces the concept of fuzzy decision reducts, dependent on an increasing attribute subset measure. Experimental results demonstrate the potential of fuzzy decision reducts to discover shorter attribute subsets, leading to decision models with a better coverage and with comparable, or even higher accuracy.

Accepted Answer

For each of the fuzzy-rough measures introduced, the authors ran QuickReduct once with α = 1, and a second time with a fixed α < 1; in particular, a value of α = 0.95 was deemed a suitable overall choice for most measures, except for g, which requires a much higher threshold, and for which α = 0.9999 was selected.

Accepted Answer

When interpreting these results, one should always keep in mind the trade-off between accuracy (RMSE) and attribute subset size: a higher accuracy (lower RSME) is of course desirable, but so is a smaller subset size, i.e., the less conditional attributes there are in the reduced data set, the stronger its generalization capacity.

Accepted Answer

In the general definition that the authors propose, the authors require an increasing [0, 1]-valued measure, so as to guarantee that the larger an attribute subset, the higher its degree of fuzzy decision reducthood (monotonicity), which is in analogy to other approaches to define a degree of approximating decision classes [43, 44].

Accepted Answer

As the authors have shown, the evaluation measures γ, γ′, δ, δ′, f and g introduced in the previous subsections all give rise to corresponding fuzzy decision reducts.

Accepted Answer

as seen in Figure 2b), if a 1% accuracy drop is permissible, fuzzy γ-decision reducts manage to reduce the subset size by over 40%, while with g a reduction of the data set by more than 63% is possible.

Accepted Answer

This affects QuickReduct’s operation adversely; when all of the considered subsets in a given iteration evaluate to 0, the heuristic is forced to select one without any information about its true merit.

Accepted Answer

In their experiments, the authors have used the very simple K-nearest neighbour classifier [1], implemented in Weka [53] as IBk, with default parameters (K = 1, no distance weighting).

Accepted Answer

Their experiments clearly endorse the benefit of using fuzzy decision reducts, showing a greater flexibility and better potential to produce good-sized, highquality attribute subsets than the crisp decision reducts that have been used so far in fuzzy-rough data analysis.

Attribute selection with fuzzy decision reducts

Figures

Citations

Machine learning

mr2PSO: A maximum relevance minimum redundancy feature selection method based on swarm intelligence for support vector machine classification

Feature selection in mixed data

A Fitting Model for Feature Selection With Fuzzy Rough Sets

Feature subset selection based on fuzzy neighborhood rough sets

References

Random Forests

Fuzzy sets

Data Mining: Practical Machine Learning Tools and Techniques

The Elements of Statistical Learning: Data Mining, Inference, and Prediction

The Elements of Statistical Learning

Related Papers (5)

Rough fuzzy sets and fuzzy rough sets

Rough Sets: Theoretical Aspects of Reasoning about Data

A comparative study of fuzzy rough sets