scispace - formally typeset
Journal ArticleDOI

Audiovisual Speech Synthesis

Reads0
Chats0
TLDR
This paper presents the main approaches used to synthesize talking faces, and provides greater detail on a handful of these approaches, and an attempt is made to distinguish between facial synthesis itself and the manner in which facial movements are rendered on a computer screen.
Abstract
This paper presents the main approaches used to synthesize talking faces, and provides greater detail on a handful of these approaches. An attempt is made to distinguish between facial synthesis itself (i.e. the manner in which facial movements are rendered on a computer screen), and the way these movements may be controlled and predicted using phonetic input. The two main synthesis techniques (model-based vs. image-based) are contrasted and presented by a brief description of the most illustrative existing systems. The challenging issues—evaluation, data acquisition and modeling—that may drive future models are also discussed and illustrated by our current work at ICP.

read more

Content maybe subject to copyright    Report

Citations
More filters
MonographDOI

Text-to-Speech Synthesis

TL;DR: Text-to-Speech Synthesis provides an in-depth explanation of all aspects of current speech synthesis technology, and is designed for graduate students in electrical engineering, computer science, and linguistics.
Journal ArticleDOI

Audio-driven facial animation by joint end-to-end learning of pose and emotion

TL;DR: This work presents a machine learning technique for driving 3D facial animation by audio input in real time and with low latency, and simultaneously discovers a compact, latent code that disambiguates the variations in facial expression that cannot be explained by the audio alone.
Posted Content

The mesh-matching algorithm: an automatic 3D mesh generator for Finite element structures

TL;DR: A new patient-specific method allowing automatically 3D mesh generation for structures as complex as bone for example is investigated, called the mesh-matching (M-M) algorithm, which generated automatically customized 3D meshes of anatomical structures from an already existing model.

The equilibrium point hypothesis and its application to speech motor control

TL;DR: It is suggested that even when no account is taken of upcoming context, that apparent anticipatory changes in movement amplitude and duration may arise due to dynamics, and that simple linear control signals may underlie smooth articulatory trajectories.
Proceedings ArticleDOI

Task planning for human-robot interaction

TL;DR: This paper intends to develop and experiment various task planners and interaction schemes, that will allow the robot to select and perform its tasks while taking into account explicitly the constraints imposed by the presence of humans, their needs and preferences.
References
More filters
Journal ArticleDOI

Eigenfaces for recognition

TL;DR: A near-real-time computer system that can locate and track a subject's head, and then recognize the person by comparing characteristics of the face to those of known individuals, and that is easy to implement using a neural network architecture.
Journal ArticleDOI

Active appearance models

Abstract: We describe a new method of matching statistical models of appearance to images. A set of model parameters control modes of shape and gray-level variation learned from a training set. We construct an efficient iterative matching algorithm by learning the relationship between perturbations in the model parameters and the induced image errors.
Journal ArticleDOI

Hearing lips and seeing voices

TL;DR: The study reported here demonstrates a previously unrecognised influence of vision upon speech perception, on being shown a film of a young woman's talking head in which repeated utterances of the syllable [ba] had been dubbed on to lip movements for [ga].
Book ChapterDOI

Active Appearance Models

TL;DR: A novel method of interpreting images using an Active Appearance Model (AAM), a statistical model of the shape and grey-level appearance of the object of interest which can generalise to almost any valid example.
Book

Unmasking the face

Paul Ekman