scispace - formally typeset
Book ChapterDOI

Perception of Synthetic Visual Speech

Reads0
Chats0
TLDR
Recognition of the synthetic talker is reasonably close to that of the human talker, but a significant distance remains to be covered and improvements to the synthetic phoneme specifications are discussed.
Abstract
We report here on an experiment comparing visual recognition of monosyllabic words produced either by our computer-animated talker or a human talker. Recognition of the synthetic talker is reasonably close to that of the human talker, but a significant distance remains to be covered and we discuss improvements to the synthetic phoneme specifications. In an additional experiment using the same paradigm, we compare perception of our animated talker with a similarly generated point-light display, finding significantly worse performance for the latter for a number of viseme classes. We conclude with some ideas for future progress and briefly describe our new animated tongue.

read more

Citations
More filters
Journal ArticleDOI

Development and evaluation of a computer-animated tutor for vocabulary and language learning in children with autism.

TL;DR: The research indicates that children with autism are capable of learning new language within an automated program centered around a computer-animated agent, multimedia, and active participation and can transfer and use the language in a natural, untrained environment.
Journal ArticleDOI

Three-dimensional linear articulatory modeling of tongue, lips and face, based on MRI and video images

TL;DR: The geometry of these vocal organs is measured on one subject uttering a corpus of sustained articulations in French to imply that most 3D features such as tongue groove or lateral channels can be controlled by articulatory parameters defined for the midsagittal model.
Book ChapterDOI

Developing and evaluating conversational agents

TL;DR: The use of the agent is expanded in educational and therapeutic environments, as in the learning of non-native languages and in learning to read, and to create a human–computer interface centered on a virtual, conversational agent.
Journal ArticleDOI

Attention to Facial Regions in Segmental and Prosodic Visual Speech Perception Tasks

TL;DR: The results indicate that information in the upper part of the talker's face is more critical for intonation pattern decisions than for decisions about word segments or primary sentence stress, thus supporting the Gaze Direction Assumption.

Picture My Voice : Audio to Visual Speech Synthesis using Artificial Neural Networks

TL;DR: Through a series of audiovisual perceptual experiments withnoise-degraded audio, it is demonstrated that the animated talking head provides significantly increased intelligibility over the audio-only case, in some cases not significantly below that provided by a natural face.