Very Simple Classification Rules Perform Well on Most Commonly Used Datasets

doi:10.1023/A:1022631118932

Open AccessJournal ArticleDOI

Very Simple Classification Rules Perform Well on Most Commonly Used Datasets

Robert C. Holte

- 01 Apr 1993 -

Machine Learning

- Vol. 11, Iss: 1, pp 63-90

TLDR

On most datasets studied, the best of very simple rules that classify examples on the basis of a single attribute is as accurate as the rules induced by the majority of machine learning systems.

Abstract:

This article reports an empirical investigation of the accuracy of rules that classify examples on the basis of a single attribute. On most datasets studied, the best of these very simple rules is as accurate as the rules induced by the majority of machine learning systems. The article explores the implications of this finding for machine learning research and applications.

Citations

PDF

Open Access

More filters

Book

Data Mining: Practical Machine Learning Tools and Techniques

Ian H. Witten, +2 more

TL;DR: This highly anticipated third edition of the most acclaimed work on data mining and machine learning will teach you everything you need to know about preparing inputs, interpreting outputs, evaluating results, and the algorithmic methods at the heart of successful data mining.

...read moreread less

Proceedings Article

Experiments with a new boosting algorithm

Yoav Freund, +1 more

TL;DR: This paper describes experiments carried out to assess how well AdaBoost with and without pseudo-loss, performs on real learning problems and compared boosting to Breiman's "bagging" method when used to aggregate various classifiers.

...read moreread less

Book

Pattern recognition and neural networks

Brian D. Ripley, +1 more

TL;DR: Professor Ripley brings together two crucial ideas in pattern recognition; statistical methods and machine learning via neural networks in this self-contained account.

...read moreread less

Book

Simple Heuristics That Make Us Smart

Gerd Gigerenzer, +1 more

TL;DR: Fast and frugal heuristics as discussed by the authors are simple rules for making decisions with realistic mental resources and can enable both living organisms and artificial systems to make smart choices, classifications, and predictions by employing bounded rationality.

...read moreread less

Posted Content

Principles of data mining

David J. Hand, +2 more

TL;DR: This paper gives a lightning overview of data mining and its relation to statistics, with particular emphasis on tools for the detection of adverse drug reactions.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Induction of Decision Trees

J. R. Quinlan

- 25 Mar 1986 -

Machine Learning

TL;DR: In this paper, an approach to synthesizing decision trees that has been used in a variety of systems, and it describes one such system, ID3, in detail, is described, and a reported shortcoming of the basic algorithm is discussed.

...read moreread less

Journal ArticleDOI

The CN2 Induction Algorithm

Peter Clark, +1 more

- 01 Mar 1989 -

Machine Learning

TL;DR: A description and empirical evaluation of a new induction system, CN2, designed for the efficient induction of simple, comprehensible production rules in domains where problems of poor description language and/or noise may be present.

...read moreread less

Journal ArticleDOI

Knowledge acquisition via incremental conceptual clustering

Douglas H. Fisher

- 01 Sep 1987 -

Machine Learning

TL;DR: COBWEB is a conceptual clustering system that organizes data so as to maximize inference ability, and is incremental and computationally economical, and thus can be flexibly applied in a variety of domains.

...read moreread less

Journal ArticleDOI

Computer-Intensive Methods in Statistics

Persi Diaconis, +1 more

- 01 Jun 1983 -

Scientific American

TL;DR: The bootstrap method is examined and evaluated as an example of this new generation of statistical tools that take advantage of the high speed digital computer and free the statistician to attack more complicated problems.

...read moreread less

Book ChapterDOI

Rule Induction with CN2: Some Recent Improvements

Peter Clark, +1 more

TL;DR: Improvements to the CN2 algorithm are described, including the use of the Laplacian error estimate as an alternative evaluation function and it is shown how unordered as well as ordered rules can be generated.

...read moreread less

Collapse

Very Simple Classification Rules Perform Well on Most Commonly Used Datasets

Citations

Data Mining: Practical Machine Learning Tools and Techniques

Experiments with a new boosting algorithm

Pattern recognition and neural networks

Simple Heuristics That Make Us Smart

Principles of data mining

References

Induction of Decision Trees

The CN2 Induction Algorithm

Knowledge acquisition via incremental conceptual clustering

Computer-Intensive Methods in Statistics

Rule Induction with CN2: Some Recent Improvements

Related Papers (5)

C4.5: Programs for Machine Learning

Induction of Decision Trees

Data Mining: Practical Machine Learning Tools and Techniques

Classification and Regression Trees.

Random Forests