Minimum Cross-Entropy Pattern Classification and Cluster Analysis

doi:10.1109/TPAMI.1982.4767189

Journal ArticleDOI

Minimum Cross-Entropy Pattern Classification and Cluster Analysis

John E. Shore, +1 more

- 01 Jan 1982 -

IEEE Transactions on Pattern Analysis an...

- Vol. 4, Iss: 1, pp 11-17

Chats0

TLDR

The approach is a generalization of a recently developed speech coding technique called speech coding by vector quantization based on the minimization of cross-entropy, and can be viewed as a refinement of a general classification method due to Kullback.

Abstract:

This paper considers the problem of classifying an input vector of measurements by a nearest neighbor rule applied to a fixed set of vectors. The fixed vectors are sometimes called characteristic feature vectors, codewords, cluster centers, models, reproductions, etc. The nearest neighbor rule considered uses a non-Euclidean information-theoretic distortion measure that is not a metric, but that nevertheless leads to a classification method that is optimal in a well-defined sense and is also computationally attractive. Furthermore, the distortion measure results in a simple method of computing cluster centroids. Our approach is based on the minimization of cross-entropy (also called discrimination information, directed divergence, K-L number), and can be viewed as a refinement of a general classification method due to Kullback. The refinement exploits special properties of cross-entropy that hold when the probability densities involved happen to be minimum cross-entropy densities. The approach is a generalization of a recently developed speech coding technique called speech coding by vector quantization.

Minimum Cross-Entropy Pattern Classification and Cluster Analysis

Citations

Vector quantization

Distance measures for signal processing and pattern recognition

On measuring the distance between histograms

Properties of cross-entropy minimization

Optimal partitioning for classification and regression trees

References

Some methods for classification and analysis of multivariate observations

Information Theory and Statistical Mechanics. II

An Algorithm for Vector Quantizer Design

Information Theory and Statistics

Axiomatic derivation of the principle of maximum entropy and the principle of minimum cross-entropy

Related Papers (5)

Information Theory and Statistics

Axiomatic derivation of the principle of maximum entropy and the principle of minimum cross-entropy

Information Theory and Statistical Mechanics. II

$I$-Divergence Geometry of Probability Distributions and Minimization Problems

On Information and Sufficiency