scispace - formally typeset
Open AccessBook ChapterDOI

A Temporal Network of Support Vector Machine Classifiers for the Recognition of Visual Speech

TLDR
A new system for the recognition of visual speech based on support vector machines which proved to be powerful classifiers in other visual tasks is proposed, which offers the advantage of an easy generalization to large vocabulary recognition tasks due to the use of viseme models, as opposed to entire word models.
Abstract
Speech recognition based on visual information is an emerging research field We propose here a new system for the recognition of visual speech based on support vector machines which proved to be powerful classifiers in other visual tasks We use support vector machines to recognize the mouth shape corresponding to different phones produced To model the temporal character of the speech we employ the Viterbi decoding in a network of support vector machines The recognition rate obtained is higher than those reported earlier when the same features were used The proposed solution offers the advantage of an easy generalization to large vocabulary recognition tasks due to the use of viseme models, as opposed to entire word models

read more

Content maybe subject to copyright    Report

Citations
More filters
Patent

Dynamic gesture recognition from stereo sequences

TL;DR: In this article, a stereo sequence of images of a subject is obtained and a depth disparity map is generated from the stereo sequence, based on a statistical model of the upper body of the subject.
Journal ArticleDOI

Protein function classification via support vector machine approach.

TL;DR: Support vector machine studies conducted on a number of protein classes including RNA-binding proteins; protein homodimers, proteins responsible for drug absorption, proteins involved in drug distribution and excretion, and drug metabolizing enzymes suggest the usefulness of SVM in the classification of protein functional classes and its potential application in protein function prediction.
Patent

Coupled hidden markov model for audiovisual speech recognition

TL;DR: In this article, a two-stream coupled hidden Markov model is trained and used to identify speech, at least one stream is derived from audio data and a second stream from mouth pattern data.
Journal ArticleDOI

Exploiting the heightened phase synchrony in patients with neuromuscular disease for the establishment of efficient motor imagery BCIs

TL;DR: Estimating the time-resolved phase connectivity patterns induced during a motor imagery (MI) task and adopting a supervised learning scheme to recover the subject's intention from the streaming data reveal increased phase synchrony and richer network organization in patients.
Patent

Audio-visual feature fusion and support vector machine useful for continuous speech recognition

TL;DR: In this article, a speech recognition method includes several embodiments describing application of support vector machine analysis to a mouth region, which can be accurately determined and used in conjunction with synchronous or asynchronous audio data to enhance speech recognition probabilities.
References
More filters

Statistical learning theory

TL;DR: Presenting a method for determining the necessary and sufficient conditions for consistency of learning process, the author covers function estimates from small data pools, applying these estimations to real-life problems, and much more.
Book

Probability, random variables and stochastic processes

TL;DR: This chapter discusses the concept of a Random Variable, the meaning of Probability, and the axioms of probability in terms of Markov Chains and Queueing Theory.
Book

An Introduction to Support Vector Machines and Other Kernel-based Learning Methods

TL;DR: This is the first comprehensive introduction to Support Vector Machines (SVMs), a new generation learning system based on recent advances in statistical learning theory, and will guide practitioners to updated literature, new applications, and on-line software.
Book

Computer vision

Related Papers (5)