Book ChapterDOI
Perception of Synthetic Visual Speech
Michael M. Cohen,Rachel Walker,Dominic W. Massaro +2 more
- pp 153-168
Reads0
Chats0
TLDR
Recognition of the synthetic talker is reasonably close to that of the human talker, but a significant distance remains to be covered and improvements to the synthetic phoneme specifications are discussed.Abstract:
We report here on an experiment comparing visual recognition of monosyllabic words produced either by our computer-animated talker or a human talker. Recognition of the synthetic talker is reasonably close to that of the human talker, but a significant distance remains to be covered and we discuss improvements to the synthetic phoneme specifications. In an additional experiment using the same paradigm, we compare perception of our animated talker with a similarly generated point-light display, finding significantly worse performance for the latter for a number of viseme classes. We conclude with some ideas for future progress and briefly describe our new animated tongue.read more
Citations
More filters
Journal ArticleDOI
Development and evaluation of a computer-animated tutor for vocabulary and language learning in children with autism.
TL;DR: The research indicates that children with autism are capable of learning new language within an automated program centered around a computer-animated agent, multimedia, and active participation and can transfer and use the language in a natural, untrained environment.
Journal ArticleDOI
Three-dimensional linear articulatory modeling of tongue, lips and face, based on MRI and video images
Pierre Badin,Gérard Bailly,Lionel Reveret,Monica Baciu,Christoph Segebarth,Christophe Savariaux +5 more
TL;DR: The geometry of these vocal organs is measured on one subject uttering a corpus of sustained articulations in French to imply that most 3D features such as tongue groove or lateral channels can be controlled by articulatory parameters defined for the midsagittal model.
Book ChapterDOI
Developing and evaluating conversational agents
TL;DR: The use of the agent is expanded in educational and therapeutic environments, as in the learning of non-native languages and in learning to read, and to create a human–computer interface centered on a virtual, conversational agent.
Journal ArticleDOI
Attention to Facial Regions in Segmental and Prosodic Visual Speech Perception Tasks
TL;DR: The results indicate that information in the upper part of the talker's face is more critical for intonation pattern decisions than for decisions about word segments or primary sentence stress, thus supporting the Gaze Direction Assumption.
Picture My Voice : Audio to Visual Speech Synthesis using Artificial Neural Networks
TL;DR: Through a series of audiovisual perceptual experiments withnoise-degraded audio, it is demonstrated that the animated talking head provides significantly increased intelligibility over the audio-only case, in some cases not significantly below that provided by a natural face.