Multiclass support vector machines for articulatory feature classification

Open AccessProceedings Article

Multiclass support vector machines for articulatory feature classification

Brian Hutchinson, +1 more

- pp 1871-1872

Chats0

TLDR

This ongoing research project investigates articulatory feature (AF) classification using multiclass support vector machines (SVMs) to assess the AF classification performance of different multiclass generalizations of the SVM, including one-versus-rest, one-Versus-one, Decision Directed Acyclic Graph (DDAG), and direct methods for multiclass learning.

Abstract:

This ongoing research project investigates articulatory feature (AF) classification using multiclass support vector machines (SVMs). SVMs are being constructed for each AF in the multi-valued feature set (Table 1), using speech data and annotation from the IFA Dutch “Open-Source” (van Son et al. 2001) and TIMIT English (Garofolo et al. 1993) corpora. The primary objective of this research is to assess the AF classification performance of different multiclass generalizations of the SVM, including one-versus-rest, one-versus-one, Decision Directed Acyclic Graph (DDAG), and direct methods for multiclass learning. Observing the successful application of SVMs to numerous classification problems (Bennett and Campbell 2000), it is hoped that multiclass SVMs will outperform existing state-of-the-art AF classifiers. One of the most basic challenges for speech recognition and other spoken language systems is to accurately map data from the acoustic domain into the linguistic domain. Much speech processing research has approached this task by taking advantage of the correlation between phones, the basic units of speech sound, and their acoustic manifestation (intuitively, there is a range of sounds that humans would consider to be an “e”). The mapping of acoustic data to phones has been largely successful, and is used in many speech systems today. Despite its success, there are drawbacks to using phones as the point of entry from the acoustic to linguistic domains. Notably, the granularity of the “phoneticsegmental” model, in which speech is represented as a series of phones, makes it difficult to account for various subphone phenomena that affect performance on spontaneous speech. Researchers have pursued an alternative approach to the acoustic-linguistic mapping through the use of articulatory modeling. This approach more directly exploits the intimate relation between articulation and acoustics: the state of one’s speech articulators (e.g. vocal folds, tongue) uniquely determines the parameters of the acoustic speech signal. Unfortunately, while the mapping from articulator to acoustics is straightforward, the problem of recovering the state of the articulators from an acoustic speech representation, acoustic-to-articulatory inversion, poses a formidable challenge (Toutios and Margaritis 2003). Nevertheless, re-

References

PDF

Open Access

More filters

Journal ArticleDOI

LIBSVM: A library for support vector machines

Chih-Chung Chang, +1 more

- 06 May 2011 -

ACM Transactions on Intelligent Systems ...

TL;DR: Issues such as solving SVM optimization problems theoretical convergence multiclass classification probability estimates and parameter selection are discussed in detail.

...read moreread less

BookDOI

Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond

Bernhard Schölkopf, +1 more

TL;DR: Learning with Kernels provides an introduction to SVMs and related kernel methods that provide all of the concepts necessary to enable a reader equipped with some basic mathematical knowledge to enter the world of machine learning using theoretically well-founded yet easy-to-use kernel algorithms.

...read moreread less

Journal Article

On the algorithmic implementation of multiclass kernel-based vector machines

Koby Crammer, +1 more

- 01 Mar 2002 -

Journal of Machine Learning Research

TL;DR: This paper describes the algorithmic implementation of multiclass kernel-based vector machines using a generalized notion of the margin to multiclass problems, and describes an efficient fixed-point algorithm for solving the reduced optimization problems and proves its convergence.

...read moreread less

Darpa Timit Acoustic-Phonetic Continuous Speech Corpus CD-ROM {TIMIT} | NIST

John S. Garofolo, +5 more

Proceedings Article

Large Margin DAGs for Multiclass Classification

John Platt, +2 more

TL;DR: An algorithm, DAGSVM, is presented, which operates in a kernel-induced feature space and uses two-class maximal margin hyperplanes at each decision-node of the DDAG, which is substantially faster to train and evaluate than either the standard algorithm or Max Wins, while maintaining comparable accuracy to both of these algorithms.

...read moreread less

Multiclass support vector machines for articulatory feature classification

References

LIBSVM: A library for support vector machines

Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond

On the algorithmic implementation of multiclass kernel-based vector machines

Darpa Timit Acoustic-Phonetic Continuous Speech Corpus CD-ROM {TIMIT} | NIST

Large Margin DAGs for Multiclass Classification

Related Papers (5)

Comparison of different strategies for a SVM-based audio segmentation

Lagrangian support vector machines for phoneme classification

A sparse modeling approach to speech recognition based on relevance vector machines.

Information Theoretic Feature Crediting in Multiclass Support Vector Machines.

Feature subset selection for support vector machines through sensitivity analysis