Instance-Based Learning Algorithms

doi:10.1023/A:1022689900470

Open AccessJournal ArticleDOI

Instance-Based Learning Algorithms

David W. Aha, +2 more

- 03 Jan 1991 -

Machine Learning

- Vol. 6, Iss: 1, pp 37-66

TLDR

This paper describes how storage requirements can be significantly reduced with, at most, minor sacrifices in learning rate and classification accuracy and extends the nearest neighbor algorithm, which has large storage requirements.

Abstract:

Storing and using specific instances improves the performance of several supervised learning algorithms. These include algorithms that learn decision trees, classification rules, and distributed networks. However, no investigation has analyzed algorithms that use only specific instances to solve incremental learning tasks. In this paper, we describe a framework and methodology, called instance-based learning, that generates classification predictions using only specific instances. Instance-based learning algorithms do not maintain a set of abstractions derived from specific instances. This approach extends the nearest neighbor algorithm, which has large storage requirements. We describe how storage requirements can be significantly reduced with, at most, minor sacrifices in learning rate and classification accuracy. While the storage-reducing algorithm performs well on several real-world databases, its performance degrades rapidly with the level of attribute noise in training instances. Therefore, we extended it with a significance test to distinguish noisy instances. This extended algorithm's performance degrades gracefully with increasing noise levels and compares favorably with a noise-tolerant decision tree algorithm.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

A Grey-Based Nearest Neighbor Approach for Missing Attribute Value Prediction

Chi-Chun Huang, +1 more

- 01 May 2004 -

Applied Intelligence

TL;DR: Experimental results indicate that the accuracy of classification is maintained or even increased when the proposed method is applied for missing attribute value prediction.

...read moreread less

Journal ArticleDOI

Trait-based risk assessment for invasive species: high performance across diverse taxonomic groups, geographic ranges and machine learning/statistical tools.

Reuben P. Keller, +2 more

- 01 May 2011 -

Diversity and Distributions

TL;DR: In this article, a range of statistical and machine learning algorithms were compared to determine the effects of data set size and scale, the algorithm used, and to determine overall performance of the trait-based risk assessment approach.

...read moreread less

Journal ArticleDOI

Application of machine learning to an early warning system for very short-term heavy rainfall.

Seung-Hyun Moon, +3 more

- 01 Jan 2019 -

Journal of Hydrology

TL;DR: A selective discretization method is devised that converts a subset of continuous input variables to nominal ones and works well on heavy rainfall nowcasting in terms of F-measure and equitable threat score.

...read moreread less

Journal ArticleDOI

Feature selection with redundancy-complementariness dispersion

Chen Zhijun, +6 more

- 01 Nov 2015 -

Knowledge Based Systems

TL;DR: A modification item concerning feature complementariness is introduced in the evaluation criterion of features and the redundancy-complementariness dispersion is taken into account to adjust the measurement of pairwise inter-correlation of features.

...read moreread less

Journal ArticleDOI

Automated classification based on video data at intersections with heavy pedestrian and bicycle traffic: Methodology and application

Sohail Zangenehpour, +2 more

- 01 Jul 2015 -

Transportation Research Part C-emerging ...

TL;DR: In this article, a method based on Histogram of Oriented Gradients (HOG) and Support Vector Machine (SVM) was proposed to classify moving objects in crowded traffic scenes.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Classification and Regression Trees.

John Van Ryzin, +4 more

- 01 Mar 1986 -

Journal of the American Statistical Asso...

Journal ArticleDOI

Induction of Decision Trees

J. R. Quinlan

- 25 Mar 1986 -

Machine Learning

TL;DR: In this paper, an approach to synthesizing decision trees that has been used in a variety of systems, and it describes one such system, ID3, in detail, is described, and a reported shortcoming of the basic algorithm is discussed.

...read moreread less

MonographDOI

Parallel Distributed Processing: Explorations in the Microstructure of Cognition: Foundations

David E. Rumelhart, +2 more

Book

Classification and regression trees

Leo Breiman

TL;DR: The methodology used to construct tree structured rules is the focus of a monograph as mentioned in this paper, covering the use of trees as a data analysis method, and in a more mathematical framework, proving some of their fundamental properties.

...read moreread less

Journal ArticleDOI

Nearest neighbor pattern classification

Thomas M. Cover, +1 more

- 01 Jan 1967 -

IEEE Transactions on Information Theory

TL;DR: The nearest neighbor decision rule assigns to an unclassified sample point the classification of the nearest of a set of previously classified points, so it may be said that half the classification information in an infinite sample set is contained in the nearest neighbor.

...read moreread less

Collapse

Instance-Based Learning Algorithms

Citations

A Grey-Based Nearest Neighbor Approach for Missing Attribute Value Prediction

Trait-based risk assessment for invasive species: high performance across diverse taxonomic groups, geographic ranges and machine learning/statistical tools.

Application of machine learning to an early warning system for very short-term heavy rainfall.

Feature selection with redundancy-complementariness dispersion

Automated classification based on video data at intersections with heavy pedestrian and bicycle traffic: Methodology and application

References

Classification and Regression Trees.

Induction of Decision Trees

Parallel Distributed Processing: Explorations in the Microstructure of Cognition: Foundations

Classification and regression trees

Nearest neighbor pattern classification

Related Papers (5)

C4.5: Programs for Machine Learning

Data Mining: Practical Machine Learning Tools and Techniques

Random Forests

Induction of Decision Trees

The WEKA data mining software: an update