scispace - formally typeset
Journal ArticleDOI

Classification performance of mathematical programming techniques in discriminant analysis: Results for small and medium sample sizes

Reads0
Chats0
TLDR
In this paper, the performance of two widely used parametric statistical techniques (Fisher's linear discriminant function and Smith's quadratic function) and a class of recently proposed nonparametric estimation techniques based on mathematical programming (linear and mixed-integer programming) was investigated.
Abstract
The performance on small and medium-size samples of several techniques to solve the classification problem in discriminant analysis is investigated. The techniques considered are two widely used parametric statistical techniques (Fisher's linear discriminant function and Smith's quadratic function), and a class of recently proposed nonparametric estimation techniques based on mathematical programming (linear and mixed-integer programming). A simulation study is performed, analyzing the relative performance of the above techniques in the two-group case, for various small sample sizes, moderate group overlap and across six different data conditions. Training samples as well as validation samples are used to assess the classificatory performance of the techniques. The degree of group overlap and sample sizes selected for analysis in this paper are of interest in practice because they closely reflect conditions of many real data sets. The results of the experiment show that Smith's nonlinear quadratic function tends to be superior on the training samples and validation samples when the variances–covariances across groups are heterogeneous, while the mixed-integer technique performs best on the training samples when the variances–covariances are equal, and on validation samples with equal variances and discrete uniform independent variables. The mixed-integer technique and the quadratic discriminant function are also found to be more sensitive than the other techniques to the sample size, giving disproportionally inaccurate results on small samples.

read more

Citations
More filters
Journal ArticleDOI

Mathematical Programming Approaches for the Classification Problem in Two-Group Discriminant Analysis.

TL;DR: The authors introduce mathematical programming formulations as new approaches to solve the classification problem in discriminant analysis as powerful alternatives to the existing methods of maximizing correct classification of entities into groups.
Journal ArticleDOI

Nontraditional approaches to statistical classification: Some perspectives on L_p-norm methods

TL;DR: Proposals to integrate a number of the most important Lp-norm methods proposed to date within a unified framework are made, emphasizing their conceptual differences and similarities, rather than focusing on mathematical detail.
Journal ArticleDOI

Second order mathematical programming formulations for discriminant analysis

TL;DR: In this paper, a nonparametric formulation based on mathematical programming (MP) for solving the classification problem in discriminant analysis, which differs from previously proposed MP-based models in that, even though the final discriminant function is linear in terms of the parameters to be estimated, the formulation is quadratic in the predictor (attribute) variables.
Journal ArticleDOI

Discrimination and Classification.

TL;DR: The authors presents different approaches to discrimination and classification problems from a statistical perspective and provides computer projects concentrating on the most widely used and important algorithms, numerical examples, and theoretical questions reinforce to further develop the ideas introduced in the text.
Journal ArticleDOI

Neural networks analysis in business failure prediction of Chinese importers: A between-countries approach

TL;DR: The results show that the neural network models provide good classification capability in both cross-industry and industry-specific contexts and factor analysis is superior to stepwise discriminant analysis and ALL in terms of classification accuracy, generalization ability and error costs, while SDA provides the worst performance in all situations.
References
More filters
Book

Applied Multivariate Statistical Analysis

TL;DR: In this article, the authors present an overview of the basic concepts of multivariate analysis, including matrix algebra and random vectors, as well as a strategy for analyzing multivariate models.
Book

Applied Linear Statistical Models

TL;DR: Applied Linear Statistical Models 5e as discussed by the authors is the leading authoritative text and reference on statistical modeling, which includes brief introductory and review material, and then proceeds through regression and modeling for the first half, and through ANOVA and Experimental Design in the second half.
Journal ArticleDOI

Applied Multivariate Statistical Analysis.

TL;DR: In this article, the authors present an overview of the basic concepts of multivariate analysis, including matrix algebra and random vectors, as well as a strategy for analyzing multivariate models.
Related Papers (5)