Showing papers on "Sequential minimal optimization published in 2006"

PDF

Open Access

Journal Article•DOI•

[...]

Alexandros Karatzoglou, David Meyer, Kurt Hornik

06 Apr 2006-Journal of Statistical Software

TL;DR: The purpose of this paper is to present and compare these implementations of support vector machines, among the most popular and efficient classification and regression methods currently available.

...read moreread less

Abstract: Being among the most popular and efficient classification and regression methods currently available, implementations of support vector machines exist in almost every popular programming language. Currently four R packages contain SVM related software. The purpose of this paper is to present and compare these implementations. (authors' abstract)

...read moreread less

576 citations

Journal Article•DOI•

A study on SMO-type decomposition methods for support vector machines

[...]

Pai-Hsuen Chen¹, Rong-En Fan, Chih-Jen Lin¹•Institutions (1)

National Taiwan University¹

01 Jul 2006-IEEE Transactions on Neural Networks

TL;DR: The main results include a simple asymptotic convergence proof, a general explanation of the shrinking and caching techniques, and the linear convergence of the methods.

...read moreread less

Abstract: Decomposition methods are currently one of the major methods for training support vector machines. They vary mainly according to different working set selections. Existing implementations and analysis usually consider some specific selection rules. This paper studies sequential minimal optimization type decomposition methods under a general and flexible way of choosing the two-element working set. The main results include: 1) a simple asymptotic convergence proof, 2) a general explanation of the shrinking and caching techniques, and 3) the linear convergence of the methods. Extensions to some support vector machine variants are also discussed.

...read moreread less

302 citations

Journal Article•DOI•

Parallel sequential minimal optimization for the training of support vector machines

[...]

L. J. Cao¹, S. Sathiya Keerthi, Chong Jin Ong, J. Q. Zhang, U. Periyathamby, Xiu Ju Fu, Heow Pueh Lee - Show less +3 more•Institutions (1)

Fudan University¹

01 Jul 2006-IEEE Transactions on Neural Networks

TL;DR: The parallel SMO is developed using message passing interface (MPI) and shows great speedup on the adult data set and the Mixing National Institute of Standard and Technology (MNIST) data set when many processors are used.

...read moreread less

Abstract: Sequential minimal optimization (SMO) is one popular algorithm for training support vector machine (SVM), but it still requires a large amount of computation time for solving large size problems. This paper proposes one parallel implementation of SMO for training SVM. The parallel SMO is developed using message passing interface (MPI). Specifically, the parallel SMO first partitions the entire training data set into smaller subsets and then simultaneously runs multiple CPU processors to deal with each of the partitioned data sets. Experiments show that there is great speedup on the adult data set and the Mixing National Institute of Standard and Technology (MNIST) data set when many processors are used. There are also satisfactory results on the Web data set.

...read moreread less

170 citations

Journal Article•DOI•

Prediction of catalytic residues using Support Vector Machine with selected protein sequence and structural properties

[...]

Natalia V. Petrova¹, Cathy H. Wu¹•Institutions (1)

Georgetown University Medical Center¹

21 Jun 2006-BMC Bioinformatics

TL;DR: A novel method for the prediction of catalytic sites, using a carefully selected, supervised machine learning algorithm coupled with an optimal discriminative set of protein sequence conservation and structural properties is presented.

...read moreread less

Abstract: The number of protein sequences deriving from genome sequencing projects is outpacing our knowledge about the function of these proteins. With the gap between experimentally characterized and uncharacterized proteins continuing to widen, it is necessary to develop new computational methods and tools for functional prediction. Knowledge of catalytic sites provides a valuable insight into protein function. Although many computational methods have been developed to predict catalytic residues and active sites, their accuracy remains low, with a significant number of false positives. In this paper, we present a novel method for the prediction of catalytic sites, using a carefully selected, supervised machine learning algorithm coupled with an optimal discriminative set of protein sequence conservation and structural properties. To determine the best machine learning algorithm, 26 classifiers in the WEKA software package were compared using a benchmarking dataset of 79 enzymes with 254 catalytic residues in a 10-fold cross-validation analysis. Each residue of the dataset was represented by a set of 24 residue properties previously shown to be of functional relevance, as well as a label {+1/-1} to indicate catalytic/non-catalytic residue. The best-performing algorithm was the Sequential Minimal Optimization (SMO) algorithm, which is a Support Vector Machine (SVM). The Wrapper Subset Selection algorithm further selected seven of the 24 attributes as an optimal subset of residue properties, with sequence conservation, catalytic propensities of amino acids, and relative position on protein surface being the most important features. The SMO algorithm with 7 selected attributes correctly predicted 228 of the 254 catalytic residues, with an overall predictive accuracy of more than 86%. Missing only 10.2% of the catalytic residues, the method captures the fundamental features of catalytic residues and can be used as a "catalytic residue filter" to facilitate experimental identification of catalytic residues for proteins with known structure but unknown function.

...read moreread less

120 citations

Journal Article•DOI•

Practical scheme for fast detection and classification of rolling-element bearing faults using support vector machines

[...]

Alfonso Rojas¹, Asoke K. Nandi¹•Institutions (1)

University of Liverpool¹

01 Oct 2006-Mechanical Systems and Signal Processing

TL;DR: A mechanism for selecting adequate training parameters makes the classification procedure fast and effective in the detection and classification of rolling-element bearing faults.

...read moreread less

90 citations

Proceedings Article•DOI•

Nonstationary kernel combination

[...]

Darrin P. Lewis¹, Tony Jebara¹, William Stafford Noble²•Institutions (2)

Columbia University¹, University of Washington²

25 Jun 2006

TL;DR: This article proposes a method for combining multiple kernels in a nonstationary fashion using a large-margin latent-variable generative model within the maximum entropy discrimination (MED) framework, and shows that the support vector machine is a special case of this model.

...read moreread less

Abstract: The power and popularity of kernel methods stem in part from their ability to handle diverse forms of structured inputs, including vectors, graphs and strings. Recently, several methods have been proposed for combining kernels from heterogeneous data sources. However, all of these methods produce stationary combinations; i.e., the relative weights of the various kernels do not vary among input examples. This article proposes a method for combining multiple kernels in a nonstationary fashion. The approach uses a large-margin latent-variable generative model within the maximum entropy discrimination (MED) framework. Latent parameter estimation is rendered tractable by variational bounds and an iterative optimization procedure. The classifier we use is a log-ratio of Gaussian mixtures, in which each component is implicitly mapped via a Mercer kernel function. We show that the support vector machine is a special case of this model. In this approach, discriminative parameter estimation is feasible via a fast sequential minimal optimization algorithm. Empirical results are presented on synthetic data, several benchmarks, and on a protein function annotation task.

...read moreread less

89 citations

Journal Article•DOI•

Letters: Improved sparse least-squares support vector machine classifiers

[...]

Yuangui Li¹, Chen Lin², Weidong Zhang¹•Institutions (2)

Shanghai Jiao Tong University¹, Luleå University of Technology²

01 Aug 2006-Neurocomputing

TL;DR: A different reduced training set is selected to re-train LS-SVM and a new procedure is proposed to obtain the sparseness and the results indicate that it is more effective.

...read moreread less

51 citations

Proceedings Article•

Support Vector Machines on a Budget

[...]

Ofer Dekel¹, Yoram Singer¹•Institutions (1)

Hebrew University of Jerusalem¹

04 Dec 2006

TL;DR: A modified version of SVM is presented that allows the user to set a budget parameter B and focuses on minimizing the loss attained by the B worst-classified examples while ignoring the remaining examples.

...read moreread less

Abstract: The standard Support Vector Machine formulation does not provide its user with the ability to explicitly control the number of support vectors used to define the generated classifier. We present a modified version of SVM that allows the user to set a budget parameter B and focuses on minimizing the loss attained by the B worst-classified examples while ignoring the remaining examples. This idea can be used to derive sparse versions of both L1-SVM and L2-SVM. Technically, we obtain these new SVM variants by replacing the 1-norm in the standard SVM formulation with various interpolation-norms. We also adapt the SMO optimization algorithm to our setting and report on some preliminary experimental results.

...read moreread less

49 citations

Journal Article•DOI•

Incremental training of support vector machines using hyperspheres

[...]

Shinya Katagiri¹, Shigeo Abe¹•Institutions (1)

Kobe University¹

01 Oct 2006-Pattern Recognition Letters

TL;DR: By computer simulations for two-class and multiclass benchmark data sets, it is shown that the proposed incremental training method can delete data considerably without deteriorating the generalization ability.

...read moreread less

48 citations

Journal Article•DOI•

Developing parallel sequential minimal optimization for fast training support vector machine

[...]

Lijuan Cao¹, S. Sathiya Keerthi², Chong Jin Ong², P. Uvaraj³, Xiu Ju Fu³, Heow Pueh Lee³ - Show less +2 more•Institutions (3)

Fudan University¹, National University of Singapore², Institute of High Performance Computing Singapore³

01 Dec 2006-Neurocomputing

TL;DR: A parallel version of sequential minimal optimization (SMO) is developed in this paper for fast training support vector machine (SVM) and shows great speedup on the adult data set, the MNIST data set and IDEVAL data set when many processors are used.

...read moreread less

29 citations

Parallel Support Vector Machines

[...]

Dominik Brugger¹•Institutions (1)

University of Tübingen¹

01 Jan 2006

TL;DR: This article describes how to efficiently parallelize SVM training in order to cut down execution times and shows, that on most problems linear or even superlinear speedups can be attained.

...read moreread less

Abstract: The Support Vector Machine (SVM) is a supervised algorithm for the solution of classification and regression problems SVMs have gained widespread use in recent years because of successful applications like character recognition and the profound theoretical underpinnings concerning generalization performance Yet, one of the remaining drawbacks of the SVM algorithm is its high computational demands during the training and testing phase This article describes how to efficiently parallelize SVM training in order to cut down execution times The parallelization technique employed is based on a decomposition approach, where the inner quadratic program (QP) is solved using Sequential Minimal Optimization (SMO) Thus all types of SVM formulations can be solved in parallel, including C-SVC and ν-SVC for classification as well as e-SVR and ν-SVR for regression Practical results show, that on most problems linear or even superlinear speedups can be attained

...read moreread less

Simplify Support Vector Machines by Iterative Learning

[...]

Yuangui Li¹, Weidong Zhang, Chen Lin•Institutions (1)

Shanghai Jiao Tong University¹

01 Jan 2006

TL;DR: Computational results show that the iterative learning method can simplify SVM effectively and can be implemented easily, and this method will help improve the classification speed of SVM.

...read moreread less

Abstract: Support vector machines (SVM) are well known to give good results on a wide variety of pattern recognition problems, but for large scale problems, the number of support vectors usually is large, which results in substantially slow classification speed. Existing study has proposed to speed the SVM classification by decreasing the number of support vectors. In this paper it is found that SVM trained with most important training points have less support vectors and equivalent accuracy to those of SVM trained with the full training set. An iterative procedure is proposed to train the simplified SVM with most important training points, and the careful preprocessing on outliers also is used to speed the iterative learning. Computational results indicate that, compared with SVM trained with full training set, proposed method can obtain simplified SVMs with much less support vectors and equivalent classification accuracy, which supports proposed method as an effective method to obtain a simplified SVM for large problems. Keywords—Simplified SVM, Support Vector Machine, Iterative Learning

...read moreread less

Proceedings Article•DOI•

Support Vector Machine for Classification Based on Fuzzy Training Data

[...]

Ai-bing Ji¹, Jia-hong Pang¹, Shu-huan Li¹, Jian-ping Sun¹•Institutions (1)

Hebei University¹

01 Aug 2006

TL;DR: This paper introduces the support vector machine in which the training examples are fuzzy input, and gives some solving procedure of the support vectors machine with fuzzy training data.

...read moreread less

Abstract: Support vector machines (SVMs) have been very successful in pattern recognition and function estimation problems, but in the support vector machines for classification, the training examples are non-fuzzy input and output is y=plusmn1;. In this paper, we introduce the support vector machine in which the training examples are fuzzy input, and give some solving procedure of the support vector machine with fuzzy training data

...read moreread less

Proceedings Article•DOI•

Evolutionary Support Vector Regression Machines

[...]

Ruxandra Stoean, Mike Preuss, Dumitru Dumitrescu¹, Catalin Stoean²•Institutions (2)

Babeș-Bolyai University¹, University of Craiova²

26 Sep 2006

TL;DR: Evolutionary support vector machines (ESVMs) as discussed by the authors are a novel technique that assimilates the learning engine of the state-of-the-art SVM, but evolves the coefficients of the decision function by means of evolutionary algorithms.

...read moreread less

Abstract: Evolutionary support vector machines (ESVMs) are a novel technique that assimilates the learning engine of the state-of-the-art support vector machines (SVMs) but evolves the coefficients of the decision function by means of evolutionary algorithms (EAs). The new method has accomplished the purpose for which it has been initially developed, that of a simpler alternative to the canonical SVM approach for solving the optimization component of training. ESVMs, as SVMs, are natural tools for primary application to classification. However, since the latter had been further on extended to also handle regression, it is the scope of this paper to present the corresponding evolutionary paradigm. In particular, we consider the hybridization with the classical epsi-support vector regression (epsi-SVR) introduced by Vapnik and the subsequent evolution of the coefficients of the regression hyperplane. epsi-evolutionary support regression (epsi-ESVR) is validated on the Boston housing benchmark problem and the obtained results demonstrate the promise of ESVMs also as concerns regression

...read moreread less

Using Tri-Training and Support Vector Machines for addressing the ECML-PKDD 2006 Discovery Challenge

[...]

Dimitrios Mavroeidis, Konstantinos Chaidos, Stefanos Pirillos, Michalis Vazirgiannis

01 Jan 2006

TL;DR: An extensive empirical evaluation of two popular semi-supervised classification algorithms: Transduc- tive Support Vector Machines (TSVM) and Tri-Training.

...read moreread less

Abstract: In this paper we present and analyze the methodological approach we have used for addressing the ECML - PKDD Discovery Challenge 2006. The Challenge was concerned with the identification of individual user's spam emails based on a centrally collected training set. The task descriptions of the discovery challenge indicated that we should deviate from the classical supervised clas- sification paradigm and attempt to utilize semi-supervised and transductive ap- proaches. The format of the training data (bag-of-words providing only word IDs), did not allow either for the use of Natural Language Processing (NLP) ap- proaches, or for the use of standard spam-recognition strategies. The submitted model, which achieved 5 th place on Task A of the challenge, was derived by Tri- Training, a recent development in Semi-supervised algorithms research. Given a standard classifier, Tri-Training initially uses bagging to produce three diverse training datasets-classifiers, which are used for classifying the unlabeled data and incorporating them into the training set in a theoretically sound way. The classi- fier we have used within Tri-Training was Support Vector Machines (SVM) and more precisely the Sequential Minimal Optimization (SMO) implementation of WEKA. Moreover, we have used feature normalization and logistic regression models to produce continuous outputs. Apart from a detailed description and a discussion of the submitted model, this paper contains an extensive empirical evaluation of two popular semi-supervised classification algorithms: Transduc- tive Support Vector Machines (TSVM) and Tri-Training.

...read moreread less

Proceedings Article•DOI•

Fast transpose methods for kernel learning on sparse data

[...]

Patrick Haffner¹•Institutions (1)

AT&T Labs¹

25 Jun 2006

TL;DR: A new method based on transposition is proposed to speedup this computation on sparse data, instead of dot-products over sparse feature vectors, that incrementally merges lists of training examples and minimizes access to the data.

...read moreread less

Abstract: Kernel-based learning algorithms, such as Support Vector Machines (SVMs) or Perceptron, often rely on sequential optimization where a few examples are added at each iteration. Updating the kernel matrix usually requires matrix-vector multiplications. We propose a new method based on transposition to speedup this computation on sparse data. Instead of dot-products over sparse feature vectors, our computation incrementally merges lists of training examples and minimizes access to the data. Caching and shrinking are also optimized for sparsity. On very large natural language tasks (tagging, translation, text classification) with sparse feature representations, a 20 to 80-fold speedup over LIBSVM is observed using the same SMO algorithm. Theory and experiments explain what type of sparsity structure is needed for this approach to work, and why its adaptation to Maxent sequential optimization is inefficient.

...read moreread less

Book Chapter•DOI•

A novel sequential minimal optimization algorithm for support vector regression

[...]

Jun Guo¹, Norikazu Takahashi¹, Tetsuo Nishi²•Institutions (2)

Kyushu University¹, Waseda University²

03 Oct 2006

TL;DR: A novel sequential minimal optimization algorithm for support vector regression in which convex optimization problems with l variables are solved instead of standard quadratic programming problems with 2l variables where l is the number of training samples.

...read moreread less

Abstract: A novel sequential minimal optimization (SMO) algorithm for support vector regression is proposed This algorithm is based on Flake and Lawrence's SMO in which convex optimization problems with l variables are solved instead of standard quadratic programming problems with 2l variables where l is the number of training samples, but the strategy for working set selection is quite different Experimental results show that the proposed algorithm is much faster than Flake and Lawrence's SMO and comparable to the fastest conventional SMO

...read moreread less

Proceedings Article•DOI•

Minimum Enclosing Spheres Formulations for Support Vector Ordinal Regression

[...]

Shirish Shevade¹, Wei Chu²•Institutions (2)

Indian Institute of Science¹, Columbia University²

18 Dec 2006

TL;DR: Two new support vector approaches for ordinal regression find the concentric spheres with minimum volume that contain most of the training samples and guarantee that the radii of the spheres are properly ordered at the optimal solution.

...read moreread less

Abstract: We present two new support vector approaches for ordinal regression. These approaches find the concentric spheres with minimum volume that contain most of the training samples. Both approaches guarantee that the radii of the spheres are properly ordered at the optimal solution. The size of the optimization problem is linear in the number of training samples. The popular SMO algorithm is adapted to solve the resulting optimization problem. Numerical experiments on some real-world data sets verify the usefulness of our approaches for data mining.

...read moreread less

Proceedings Article•DOI•

On approximate solutions to support vector machines

[...]

Dongwei Cao¹, Daniel Boley•Institutions (1)

University of Minnesota¹

01 Jan 2006

TL;DR: This work proposes an approximate SVM, where a small number of representatives are extracted from the original training data set and used for training, and proposes two efficient implementations of the proposed algorithm, where approximations to kernel k-means are used.

...read moreread less

Abstract: We propose to speed up the training process of support vector machines (SVM) by resorting to an approximate SVM, where a small number of representatives are extracted from the original training data set and used for training. Theoretical studies show that, in order for the approximate SVM to be similar to the exact SVM given by the original training data set, kernel k-means should be used to extract the representatives. As practical variations, we also propose two efficient implementations of the proposed algorithm, where approximations to kernel k-means are used. The proposed algorithms are compared against the standard training algorithm over real data sets.

...read moreread less

Proceedings Article•DOI•

A Parallel Multi-Class Classification Support Vector Machine Based on Sequential Minimal Optimization

[...]

Jing Yang¹, Xue Yang¹, Jianpei Zhang¹•Institutions (1)

Harbin Engineering University¹

20 Jun 2006

TL;DR: A parallel multi-class SVM based on sequential minimal optimization (SMO) is proposed in this paper, which combines SMO, parallel technology, DTSVM and cluster and shows that the speeds of training and classifying are improved remarkably.

...read moreread less

Abstract: Support Vector Machine (SVM) is originally developed for binary classification problems In order to solve practical multi-class problems, various approaches such as one-against-rest (1-a-r), one-against-one (1-a-1) and decision trees based SVM have been presented The disadvantages of the existing methods of SVM multi-class classification are analyzed and compared in this paper, such as 1-a-r is difficult to train and the classifying speed of 1-a-1 is slow To solve these problems, a parallel multi-class SVM based on Sequential Minimal Optimization (SMO) is proposed in this paper This method combines SMOparallel technologyDTSVM and cluster Experiments have been made on University of California-Irvine (UCI) database, in which five benchmark datasets have been selected for testing The experiments are executed to compare 1-a-r, 1-a-1 and this method on training and testing time The result shows that the speeds of training and classifying are improved remarkably

...read moreread less

Proceedings Article•DOI•

Inverse System Control of Nonlinear Systems Using LS-SVM

[...]

Lv Guo-Fang¹, Song Jinya¹, Liang Hua¹, Sun Changyin¹•Institutions (1)

Hohai University¹

01 Jul 2006

TL;DR: Simulation results demonstrate LS-SVM method is better than SVM in accuracy, static state performance as well as computer cost.

...read moreread less

Abstract: This paper firstly provides a short introduction to least square support vector machine (LS-SVM), then provides sequential minimal optimization (SMO) based pruning algorithms for LS-SVM. After a simple discussion of inverse-model identification, a LS-SVM based direct-model identification method is developed by using LS-SVM's excellent ability of function approximation. The most important and difficult step in inverse control methods is the modeling of the inverse nonlinear dynamic system. Both SVM and LS-SVM can solve this problem. Simulation results demonstrate LS-SVM method is better than SVM in accuracy, static state performance as well as computer cost.

...read moreread less

Proceedings Article•DOI•

Evolutionary Selection of Kernels in Support Vector Machines

[...]

K. Thadani¹, Ashutosh¹, V.K. Jayaraman¹, Vijaya Sundararajan¹•Institutions (1)

Savitribai Phule Pune University¹

01 Dec 2006

TL;DR: It is shown that the Evolutionary Support Vector Machine has good generalization properties when compared with Support Vector Machines using standard (polynomial and radial basis) kernel functions.

...read moreread less

Abstract: A machine learning algorithm using evolutionary algorithms and Support Vector Machines is presented. The kernel function of support vector machines are evolved using recently introduced Gene Expression Programming algorithms. This technique trains a support vector machine with the kernel function most suitable for the training data set rather than pre-specifying the kernel function. The fitness of the kernel is measured by calculating cross validation accuracy. SVM trained with the fittest kernels is then used to classify previously unseen data. The algorithm is elucidated using preliminary case studies for classification of cancer data and bank transaction data set. It is shown that the Evolutionary Support Vector Machine has good generalization properties when compared with Support Vector Machines using standard (polynomial and radial basis) kernel functions.

...read moreread less

Journal Article•DOI•

New approach to training support vector machine

[...]

Tang Faming¹, Chen Mianyun¹, Wang Zhong-dong¹•Institutions (1)

Huazhong University of Science and Technology¹

01 Mar 2006-Journal of Systems Engineering and Electronics

TL;DR: Another learning algorithm, particle swarm optimization, for training SVM is introduted and it is found that this method works well on UCI datasets.

...read moreread less

Proceedings Article•DOI•

A New Algorithm for SVM Incremental Learning

[...]

Xiaodan Wang, Chunying Zheng, Chongming Wu, Wei Wang

01 Jan 2006

TL;DR: Based on the classification equivalence between the previous training set and the newly added training set, a new algorithm for SVM incremental learning is proposed and the useless sample is discarded and useful information in training samples is accumulated.

...read moreread less

Abstract: Based on analyzing the relationship between the Karush-Kuhn-Tucker (KKT) conditions of support vector machine and the distribution of the training samples, the possible changes of support vector set after new samples are added to training set was analyzed, and the generalized Karush-Kuhn-Tucker conditions was defined. Based on the classification equivalence between the previous training set and the newly added training set, a new algorithm for SVM incremental learning is proposed. With the presented algorithm, the useless sample is discarded and useful information in training samples is accumulated. Experimental results with the standard datasets indicate the effectiveness of the proposed algorithm

...read moreread less

Proceedings Article•DOI•

Convergence Proof of a Sequential Minimal Optimization Algorithm for Support Vector Regression

[...]

Jun Guo¹, Norikazu Takahashi¹, Tetsuo Nishi²•Institutions (2)

Kyushu University¹, Waseda University²

30 Oct 2006

TL;DR: This paper considers an SMO algorithm, which deals with the same optimization problem as Flake and Lawrence's SMO, and gives a rigorous proof that it always stops within a finite number of iterations.

...read moreread less

Abstract: A sequential minimal optimization (SMO) algorithm for support vector regression (SVR) has recently been proposed by Flake and Lawrence. However, the convergence of their algorithm has not been proved so far. In this paper, we consider an SMO algorithm, which deals with the same optimization problem as Flake and Lawrence's SMO, and give a rigorous proof that it always stops within a finite number of iterations.

...read moreread less

Proceedings Article•DOI•

An RBF Network Approach to Flatness Pattern Recognition Based on SVM Learning

[...]

Hai-Tao He¹, Nan Li¹•Institutions (1)

Yanshan University¹

01 Aug 2006

TL;DR: The new approach based on the structural equivalence of radial basis function (RBF) network and support vector machines (SVM) was efficient and intelligent and the SMO algorithm was employed to obtain more optimal structure and initial parameters of RBF network.

...read moreread less

Abstract: In the traditional method of flatness pattern recognition known as neural network with a changing topological configuration, slow convergence and local minimum were observed. Moreover, the process of experimenting the initial parameters and structure of the neural network according to the experience before has been proved time-consuming and complex. In this paper, a new approach was proposed based on the structural equivalence of radial basis function (RBF) network and Support Vector Machines (SVM). The SMO algorithm was employed to obtain more optimal structure and initial parameters of RBF network, and then the BP algorithm was used to adjust RBF network slightly. The new approach with the advantages of SVM, such as fast learning and whole optimization, was efficient and intelligent.

...read moreread less

Proceedings Article•DOI•

New Formulation of SVM for Model Selection

[...]

Mathias M. Adankon¹, Mohamed Cheriet²•Institutions (2)

École de technologie supérieure¹, Université du Québec²

30 Oct 2006

TL;DR: A new formulation for SVM is proposed that makes possible to include the hyperparameter C in the definition of the kernel parameters, equivalent to choosing the best values of kernel parameters.

...read moreread less

Abstract: Model selection for support vector machines concerns the tuning of SVM hyperparameters as C controlling the amount of overlap and the kernel parameters. Several criteria developed for tuning the SVM hyperparameters, may not be differentiable w.r.t. C, consequently, gradient-based optimization methods are not applicable. In this paper, we propose a new formulation for SVM that makes possible to include the hyperparameter C in the definition of the kernel parameters. Then, tuning hyperparameters for SVM is equivalent to choosing the best values of kernel parameters. We tested this new formulation for model selection by using the criterion of empirical error, technique based on generalization error minimization through a validation set. The experiments on different benchmarks show promising results confirming our approach.

...read moreread less

Journal Article•DOI•

Solving large-scale multiclass learning problems via an efficient support vector classifier

[...]

Shuibo Zheng¹, Houjun Tang¹, Zhengzhi Han¹, Haoran Zhang²•Institutions (2)

Shanghai Jiao Tong University¹, Zhejiang Normal University²

01 Dec 2006-Journal of Systems Engineering and Electronics

TL;DR: DAGSVMlight is proposed to select the workings set which is identical to the working set selected by SVMlight approach and may be an especially useful tool for large-scale multiclass classification problems and lead to more widespread use of SVMs in the engineering community due to its good performance.

...read moreread less

A Novel Algorithm for Learning Support Vector Machines with Structured Output Spaces

[...]

Václav Hlaváč¹•Institutions (1)

Czech Technical University in Prague¹

01 Jan 2006

TL;DR: A novel QP solver based on sequential minimal optimization (SMO) based on a novel strategy for selecting variables to be optimized which converges in a finite number of iterations to the solution which differs from the optimal one at most by a prescribed constant.

...read moreread less

Abstract: This report proposes a novel optimization algorithm for learning support vector machines (SVM) classifiers with structured output spaces introduced recently by Tsochantaridis et. al. Learning structural SVM classifier leads to a special instance of quadratic programming (QP) optimization with a huge number of constraints. The number of constraints is proportional to the cardinality of the output space which makes the QP task intractable by classical optimization methods. We propose a novel QP solver based on sequential minimal optimization (SMO). Unlike the original SMO, we propose a novel strategy for selecting variables to be optimized. The strategy aims at selecting such variables which yield the maximal improvement of optimization. We prove that the algorithm converges in a finite number of iterations to the solution which differs from the optimal one at most by a prescribed constant. Experiments performed show that the proposed algorithm is very competitive to a cutting plane algorithm of Tsochantaridis et. al. The proposed algorithm can be easily implemented and it does not require any external QP solver in contrary to the cutting plain algorithm. We demonstrated a capability of the algorithm on a problem of learning the Hidden Markov Network for color image segmentation and learning a structural classifier for car license plate recognition.

...read moreread less

Book Chapter•DOI•

A competitive approach to neural device modeling: support vector machines

[...]

Nurhan Türker¹, Filiz Güneş¹•Institutions (1)

Yıldız Technical University¹

10 Sep 2006

TL;DR: The nonlinear regression ability of the Support Vector Machines has been demonstrated by forming the SVM model of a microwave transistor and it has been compared with its neural model.

...read moreread less

Abstract: Support Vector Machines (SVM) are a system for efficiently training linear learning machines in the kernel induced feature spaces, while respecting the insights provided by the generalization theory and exploiting the optimization theory. In this work, Support Vector Machines are employed for the nonlinear regression. The nonlinear regression ability of the Support Vector Machines has been demonstrated by forming the SVM model of a microwave transistor and it has been compared with its neural model.

...read moreread less