Nearest neighbor pattern classification

doi:10.1109/TIT.1967.1053964

Open AccessJournal ArticleDOI

Nearest neighbor pattern classification

Thomas M. Cover, +1 more

- 01 Jan 1967 -

IEEE Transactions on Information Theory

- Vol. 13, Iss: 1, pp 21-27

TLDR

The nearest neighbor decision rule assigns to an unclassified sample point the classification of the nearest of a set of previously classified points, so it may be said that half the classification information in an infinite sample set is contained in the nearest neighbor.

Abstract:

The nearest neighbor decision rule assigns to an unclassified sample point the classification of the nearest of a set of previously classified points. This rule is independent of the underlying joint distribution on the sample points and their classifications, and hence the probability of error R of such a rule must be at least as great as the Bayes probability of error R^{\ast} --the minimum probability of error over all decision rules taking underlying probability structure into account. However, in a large sample analysis, we will show in the M -category case that R^{\ast} \leq R \leq R^{\ast}(2 --MR^{\ast}/(M-1)) , where these bounds are the tightest possible, for all suitably smooth underlying distributions. Thus for any number of categories, the probability of error of the nearest neighbor rule is bounded above by twice the Bayes probability of error. In this sense, it may be said that half the classification information in an infinite sample set is contained in the nearest neighbor.

Nearest neighbor pattern classification

Citations

Data Mining: Concepts and Techniques

Data Mining: Practical Machine Learning Tools and Techniques

Classification and regression trees

Pattern Recognition with Fuzzy Objective Function Algorithms

Pattern Recognition and Machine Learning

References

The magical number seven, plus or minus two: some limits on our capacity for processing information

The magical number seven plus or minus two: some limits on our capacity for processing information

A nonparametric estimate of a multivariate density function

Applications of Information Theory to Psychology

Applications of information theory to psychology: A summary of basic concepts, methods, and results.

Related Papers (5)

Random Forests

Support-Vector Networks

C4.5: Programs for Machine Learning

The Nature of Statistical Learning Theory

Pattern Classification