Structural Minimax Probability Machine

doi:10.1109/TNNLS.2016.2544779

Open AccessJournal ArticleDOI

Structural Minimax Probability Machine

Bin Gu, +2 more

- 01 Jul 2017 -

IEEE Transactions on Neural Networks

- Vol. 28, Iss: 7, pp 1646-1656

Chats0

TLDR

This paper uses two finite mixture models to capture the structural information of the data from binary classification and proposes a structural MPM, which can be interpreted as a large margin classifier and can be transformed to support vector machine and maxi–min margin machine under certain special conditions.

Abstract:

Minimax probability machine (MPM) is an interesting discriminative classifier based on generative prior knowledge. It can directly estimate the probabilistic accuracy bound by minimizing the maximum probability of misclassification. The structural information of data is an effective way to represent prior knowledge, and has been found to be vital for designing classifiers in real-world problems. However, MPM only considers the prior probability distribution of each class with a given mean and covariance matrix, which does not efficiently exploit the structural information of data. In this paper, we use two finite mixture models to capture the structural information of the data from binary classification. For each subdistribution in a finite mixture model, only its mean and covariance matrix are assumed to be known. Based on the finite mixture models, we propose a structural MPM (SMPM). SMPM can be solved effectively by a sequence of the second-order cone programming problems. Moreover, we extend a linear model of SMPM to a nonlinear model by exploiting kernelization techniques. We also show that the SMPM can be interpreted as a large margin classifier and can be transformed to support vector machine and maxi–min margin machine under certain special conditions. Experimental results on both synthetic and real-world data sets demonstrate the effectiveness of SMPM.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Design of optimal CMOS ring oscillator using an intelligent optimization tool

Ali Mohammadi, +2 more

TL;DR: MOMIPO outperforms the best performance among other multi-objective algorithms in presented RO designing tool and creates a perfect trade-off between the contradictory objective functions in CMOS RO optimal design.

...read moreread less

Journal ArticleDOI

Strategies for data stream mining method applied in anomaly detection

Ruxia Sun, +4 more

- 01 Jun 2019 -

Cluster Computing

TL;DR: The properties of data stream make analysis method different from the method based on data set and the analysis model is required to be updated immediately when concept drift occurs, and the difference between data stream and data set is compared.

...read moreread less

Journal ArticleDOI

Confidence-weighted bias model for online collaborative filtering

Xiuze Zhou, +3 more

- 08 Jul 2017 -

Applied Soft Computing

TL;DR: A confidence-weighted bias model (CWBM) is proposed for online collaborative filtering (OCF) that adds bias into CF and further introduces confidence weights; thus, it can improve the stability and accuracy of OCF.

...read moreread less

Journal ArticleDOI

A real-time image forensics scheme based on multi-domain learning

Bin Yang, +2 more

- 01 Feb 2020 -

Journal of Real-time Image Processing

TL;DR: Experimental evaluation results show that MDL-CNN method can significantly improve the forensic performance and a multi-domain loss function is developed to enhance the recognition ability of in-depth learning features.

...read moreread less

Journal ArticleDOI

Wavelet Denoising of Vehicle Platform Vibration Signal Based on Threshold Neural Network

Mingzhu Li, +4 more

- 26 Jan 2017 -

Shock and Vibration

TL;DR: A method to denoise the VPVS based on the wavelet coefficients thresholding and threshold neural network (TNN) achieves better results, compared to the previous denoising methods using the indexes of SNR and RMSE.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

A tutorial on hidden Markov models and selected applications in speech recognition

Lawrence R. Rabiner

TL;DR: In this paper, the authors provide an overview of the basic theory of hidden Markov models (HMMs) as originated by L.E. Baum and T. Petrie (1966) and give practical details on methods of implementation of the theory along with a description of selected applications of HMMs to distinct problems in speech recognition.

...read moreread less

Journal ArticleDOI

Hierarchical Grouping to Optimize an Objective Function

Joe H. Ward

- 01 Mar 1963 -

Journal of the American Statistical Asso...

TL;DR: In this paper, a procedure for forming hierarchical groups of mutually exclusive subsets, each of which has members that are maximally similar with respect to specified characteristics, is suggested for use in large-scale (n > 100) studies when a precise optimal solution for a specified number of groups is not practical.

...read moreread less

Book ChapterDOI

Neural Networks for Pattern Recognition

Suresh Kothari, +1 more

- 01 Jan 1993 -

Advances in Computers

TL;DR: The chapter discusses two important directions of research to improve learning algorithms: the dynamic node generation, which is used by the cascade correlation algorithm; and designing learning algorithms where the choice of parameters is not an issue.

...read moreread less

Journal ArticleDOI

Using SeDuMi 1.02, a MATLAB toolbox for optimization over symmetric cones

Jos F. Sturm

- 01 Jan 1999 -

Optimization Methods & Software

TL;DR: This paper describes how to work with SeDuMi, an add-on for MATLAB, which lets you solve optimization problems with linear, quadratic and semidefiniteness constraints by exploiting sparsity.

...read moreread less

Journal ArticleDOI

A comparison of methods for multiclass support vector machines

Hsu Chih-Wei, +1 more

- 01 Mar 2002 -

IEEE Transactions on Neural Networks

TL;DR: Decomposition implementations for two "all-together" multiclass SVM methods are given and it is shown that for large problems methods by considering all data at once in general need fewer support vectors.

...read moreread less