A support vector machine-based dynamic network for visual speech recognition applications

doi:10.1155/S1110865702207039

Open AccessJournal ArticleDOI

A support vector machine-based dynamic network for visual speech recognition applications

Mihaela Gordan, +2 more

- 01 Jan 2002 -

EURASIP Journal on Advances in Signal Pr...

- Vol. 2002, Iss: 1, pp 1248-1259

TLDR

This paper examines the suitability of support vector machines for visual speech recognition by modeling the temporal character of speech as a temporal sequence of visemes corresponding to the different phones realized in a Viterbi lattice.

Abstract:

Visual speech recognition is an emerging research field. In this paper, we examine the suitability of support vector machines for visual speech recognition. Each word is modeled as a temporal sequence of visemes corresponding to the different phones realized. One support vector machine is trained to recognize each viseme and its output is converted to a posterior probability through a sigmoidal mapping. To model the temporal character of speech, the support vector machines are integrated as nodes into a Viterbi lattice. We test the performance of the proposed approach on a small visual speech recognition task, namely the recognition of the first four digits in English. The word recognition rate obtained is at the level of the previous best reported rates.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Recent advances in the automatic recognition of audiovisual speech

Gerasimos Potamianos, +4 more

TL;DR: The main components of audiovisual automatic speech recognition (ASR) are reviewed and novel contributions in two main areas are presented: first, the visual front-end design, based on a cascade of linear image transforms of an appropriate video region of interest, and subsequently, audiovISual speech integration.

...read moreread less

Journal ArticleDOI

Audio-Visual Biometrics

Petar Aleksic, +1 more

TL;DR: The main components of audio-visual biometric systems are described, existing systems and their performance are reviewed, and future research and development directions in this area are discussed.

...read moreread less

Proceedings ArticleDOI

Visual speech recognition with loosely synchronized feature streams

Kate Saenko, +5 more

TL;DR: A novel dynamic Bayesian network with a multi-stream structure and observations consisting of articulate feature classifier scores, which can model varying degrees of co-articulation in a principled way is presented.

...read moreread less

Joint audio-visual speech processing for recognition and enhancement.

Gerasimos Potamianos, +2 more

TL;DR: Two general approaches that utilize visual speech to improve ASR in acoustically challenging environments are reviewed: One directly combines features extracted from the acoustic and visual channels, aiming at superior recognition performance of the resulting audio-visual ASR system and the other seeks to eliminate the noise present in the acoustic features, resulting in improved speech recognition.

...read moreread less

Proceedings ArticleDOI

Articulatory features for robust visual speech recognition

Kate Saenko, +2 more

TL;DR: A novel approach to visual speech modeling, based on articulatory features, which has potential benefits under visually challenging conditions, and is evaluated in a preliminary experiment on a small audio-visual database.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Statistical learning theory

Vladimir Vapnik

TL;DR: Presenting a method for determining the necessary and sufficient conditions for consistency of learning process, the author covers function estimates from small data pools, applying these estimations to real-life problems, and much more.

...read moreread less

Journal ArticleDOI

A Tutorial on Support Vector Machines for Pattern Recognition

Christopher John Burges

- 01 Jun 1998 -

Data Mining and Knowledge Discovery

TL;DR: There are several arguments which support the observed high accuracy of SVMs, which are reviewed and numerous examples and proofs of most of the key theorems are given.

...read moreread less

Book

Probability, random variables and stochastic processes

Athanasios Papoulis

TL;DR: This chapter discusses the concept of a Random Variable, the meaning of Probability, and the axioms of probability in terms of Markov Chains and Queueing Theory.

...read moreread less

Book

An Introduction to Support Vector Machines and Other Kernel-based Learning Methods

Nello Cristianini, +1 more

TL;DR: This is the first comprehensive introduction to Support Vector Machines (SVMs), a new generation learning system based on recent advances in statistical learning theory, and will guide practitioners to updated literature, new applications, and on-line software.

...read moreread less

Journal ArticleDOI

Practical Methods of Optimization.

Christoph Witzgall, +1 more

- 01 Oct 1989 -

Mathematics of Computation

Collapse

A support vector machine-based dynamic network for visual speech recognition applications

Citations

Recent advances in the automatic recognition of audiovisual speech

Audio-Visual Biometrics

Visual speech recognition with loosely synchronized feature streams

Joint audio-visual speech processing for recognition and enhancement.

Articulatory features for robust visual speech recognition

References

Statistical learning theory

A Tutorial on Support Vector Machines for Pattern Recognition

Probability, random variables and stochastic processes

An Introduction to Support Vector Machines and Other Kernel-based Learning Methods

Practical Methods of Optimization.

Related Papers (5)

Recent advances in the automatic recognition of audiovisual speech

Hearing lips and seeing voices

Audio-visual speech modeling for continuous speech recognition

Automatic lipreading to enhance speech recognition (speech reading)

Snakes : Active Contour Models