Large Margin DAGs for Multiclass Classification

Open AccessProceedings Article

Large Margin DAGs for Multiclass Classification

John Platt, +2 more

- Vol. 12, pp 547-553

Chats0

TLDR

An algorithm, DAGSVM, is presented, which operates in a kernel-induced feature space and uses two-class maximal margin hyperplanes at each decision-node of the DDAG, which is substantially faster to train and evaluate than either the standard algorithm or Max Wins, while maintaining comparable accuracy to both of these algorithms.

Abstract:

We present a new learning architecture: the Decision Directed Acyclic Graph (DDAG), which is used to combine many two-class classifiers into a multiclass classifier. For an N-class problem, the DDAG contains N(N - 1)/2 classifiers, one for each pair of classes. We present a VC analysis of the case when the node classifiers are hyperplanes; the resulting bound on the test error depends on N and on the margin achieved at the nodes, but not on the dimension of the space. This motivates an algorithm, DAGSVM, which operates in a kernel-induced feature space and uses two-class maximal margin hyperplanes at each decision-node of the DDAG. The DAGSVM is substantially faster to train and evaluate than either the standard algorithm or Max Wins, while maintaining comparable accuracy to both of these algorithms.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

LIBSVM: A library for support vector machines

Chih-Chung Chang, +1 more

- 06 May 2011 -

ACM Transactions on Intelligent Systems ...

TL;DR: Issues such as solving SVM optimization problems theoretical convergence multiclass classification probability estimates and parameter selection are discussed in detail.

...read moreread less

Pattern Recognition and Machine Learning

Christopher M. Bishop

TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.

...read moreread less

Book

Machine Learning : A Probabilistic Perspective

Kevin P. Murphy

TL;DR: This textbook offers a comprehensive and self-contained introduction to the field of machine learning, based on a unified, probabilistic approach, and is suitable for upper-level undergraduates with an introductory-level college math background and beginning graduate students.

...read moreread less

Proceedings ArticleDOI

Learning to detect unseen object classes by between-class attribute transfer

Christoph H. Lampert, +2 more

TL;DR: The experiments show that by using an attribute layer it is indeed possible to build a learning object detection system that does not require any training images of the target classes, and assembled a new large-scale dataset, “Animals with Attributes”, of over 30,000 animal images that match the 50 classes in Osherson's classic table of how strongly humans associate 85 semantic attributes with animal classes.

...read moreread less

Journal Article

On the algorithmic implementation of multiclass kernel-based vector machines

Koby Crammer, +1 more

- 01 Mar 2002 -

Journal of Machine Learning Research

TL;DR: This paper describes the algorithmic implementation of multiclass kernel-based vector machines using a generalized notion of the margin to multiclass problems, and describes an efficient fixed-point algorithm for solving the reduced optimization problems and proves its convergence.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Statistical learning theory

Vladimir Vapnik

TL;DR: Presenting a method for determining the necessary and sufficient conditions for consistency of learning process, the author covers function estimates from small data pools, applying these estimations to real-life problems, and much more.

...read moreread less

UCI Repository of machine learning databases

Catherine Blake

Fast training of support vector machines using sequential minimal optimization, advances in kernel methods

J. C. Platt

TL;DR: SMO breaks this large quadratic programming problem into a series of smallest possible QP problems, which avoids using a time-consuming numerical QP optimization as an inner loop and hence SMO is fastest for linear SVMs and sparse data sets.

...read moreread less

Book

Fast training of support vector machines using sequential minimal optimization

John Platt

TL;DR: In this article, the authors proposed a new algorithm for training Support Vector Machines (SVM) called SMO (Sequential Minimal Optimization), which breaks this large QP problem into a series of smallest possible QP problems.

...read moreread less

Journal ArticleDOI

Approximate statistical tests for comparing supervised classification learning algorithms

Thomas G. Dietterich

- 01 Oct 1998 -

Neural Computation

TL;DR: This article reviews five approximate statistical tests for determining whether one learning algorithm outperforms another on a particular learning task and measures the power (ability to detect algorithm differences when they do exist) of these tests.

...read moreread less

Large Margin DAGs for Multiclass Classification

Citations

LIBSVM: A library for support vector machines

Pattern Recognition and Machine Learning

Machine Learning : A Probabilistic Perspective

Learning to detect unseen object classes by between-class attribute transfer

On the algorithmic implementation of multiclass kernel-based vector machines

References

Statistical learning theory

UCI Repository of machine learning databases

Fast training of support vector machines using sequential minimal optimization, advances in kernel methods

Fast training of support vector machines using sequential minimal optimization

Approximate statistical tests for comparing supervised classification learning algorithms

Related Papers (5)

A comparison of methods for multiclass support vector machines

Statistical learning theory

The Nature of Statistical Learning Theory

Support-Vector Networks

Solving multiclass learning problems via error-correcting output codes