scispace - formally typeset
Open AccessJournal ArticleDOI

Structural Minimax Probability Machine

Reads0
Chats0
TLDR
This paper uses two finite mixture models to capture the structural information of the data from binary classification and proposes a structural MPM, which can be interpreted as a large margin classifier and can be transformed to support vector machine and maxi–min margin machine under certain special conditions.
Abstract
Minimax probability machine (MPM) is an interesting discriminative classifier based on generative prior knowledge. It can directly estimate the probabilistic accuracy bound by minimizing the maximum probability of misclassification. The structural information of data is an effective way to represent prior knowledge, and has been found to be vital for designing classifiers in real-world problems. However, MPM only considers the prior probability distribution of each class with a given mean and covariance matrix, which does not efficiently exploit the structural information of data. In this paper, we use two finite mixture models to capture the structural information of the data from binary classification. For each subdistribution in a finite mixture model, only its mean and covariance matrix are assumed to be known. Based on the finite mixture models, we propose a structural MPM (SMPM). SMPM can be solved effectively by a sequence of the second-order cone programming problems. Moreover, we extend a linear model of SMPM to a nonlinear model by exploiting kernelization techniques. We also show that the SMPM can be interpreted as a large margin classifier and can be transformed to support vector machine and maxi–min margin machine under certain special conditions. Experimental results on both synthetic and real-world data sets demonstrate the effectiveness of SMPM.

read more

Citations
More filters
Journal ArticleDOI

Design of optimal CMOS ring oscillator using an intelligent optimization tool

TL;DR: MOMIPO outperforms the best performance among other multi-objective algorithms in presented RO designing tool and creates a perfect trade-off between the contradictory objective functions in CMOS RO optimal design.
Journal ArticleDOI

Strategies for data stream mining method applied in anomaly detection

TL;DR: The properties of data stream make analysis method different from the method based on data set and the analysis model is required to be updated immediately when concept drift occurs, and the difference between data stream and data set is compared.
Journal ArticleDOI

Confidence-weighted bias model for online collaborative filtering

TL;DR: A confidence-weighted bias model (CWBM) is proposed for online collaborative filtering (OCF) that adds bias into CF and further introduces confidence weights; thus, it can improve the stability and accuracy of OCF.
Journal ArticleDOI

A real-time image forensics scheme based on multi-domain learning

TL;DR: Experimental evaluation results show that MDL-CNN method can significantly improve the forensic performance and a multi-domain loss function is developed to enhance the recognition ability of in-depth learning features.
Journal ArticleDOI

Wavelet Denoising of Vehicle Platform Vibration Signal Based on Threshold Neural Network

TL;DR: A method to denoise the VPVS based on the wavelet coefficients thresholding and threshold neural network (TNN) achieves better results, compared to the previous denoising methods using the indexes of SNR and RMSE.
References
More filters
Journal ArticleDOI

A tutorial on hidden Markov models and selected applications in speech recognition

TL;DR: In this paper, the authors provide an overview of the basic theory of hidden Markov models (HMMs) as originated by L.E. Baum and T. Petrie (1966) and give practical details on methods of implementation of the theory along with a description of selected applications of HMMs to distinct problems in speech recognition.
Journal ArticleDOI

Hierarchical Grouping to Optimize an Objective Function

TL;DR: In this paper, a procedure for forming hierarchical groups of mutually exclusive subsets, each of which has members that are maximally similar with respect to specified characteristics, is suggested for use in large-scale (n > 100) studies when a precise optimal solution for a specified number of groups is not practical.
Book ChapterDOI

Neural Networks for Pattern Recognition

TL;DR: The chapter discusses two important directions of research to improve learning algorithms: the dynamic node generation, which is used by the cascade correlation algorithm; and designing learning algorithms where the choice of parameters is not an issue.
Journal ArticleDOI

Using SeDuMi 1.02, a MATLAB toolbox for optimization over symmetric cones

TL;DR: This paper describes how to work with SeDuMi, an add-on for MATLAB, which lets you solve optimization problems with linear, quadratic and semidefiniteness constraints by exploiting sparsity.
Journal ArticleDOI

A comparison of methods for multiclass support vector machines

TL;DR: Decomposition implementations for two "all-together" multiclass SVM methods are given and it is shown that for large problems methods by considering all data at once in general need fewer support vectors.
Related Papers (5)