Topic
Viseme
About: Viseme is a research topic. Over the lifetime, 865 publications have been published within this topic receiving 17889 citations.
Papers published on a yearly basis
Papers
More filters
•
TL;DR: It is concluded that broadly speaking, speakers have the same repertoire of mouth gestures, where they differ is in the use of the gestures.
Abstract: In machine lip-reading, which is identification of speech from visual-only information, there is evidence to show that visual speech is highly dependent upon the speaker [1]. Here, we use a phoneme-clustering method to form new phoneme-to-viseme maps for both individual and multiple speakers. We use these maps to examine how similarly speakers talk visually. We conclude that broadly speaking, speakers have the same repertoire of mouth gestures, where they differ is in the use of the gestures.
15 citations
••
TL;DR: The findings demonstrate that the human face is slower than that of rhesus macaques and the authors' closest living relative, the chimpanzee and support the assertion that human facial musculature and speech co-evolved.
Abstract: Background
While humans (like other primates) communicate with facial expressions, the evolution of speech added a new function to the facial muscles (facial expression muscles). The evolution of speech required the development of a coordinated action between visual (movement of the lips) and auditory signals in a rhythmic fashion to produce “visemes” (visual movements of the lips that correspond to specific sounds). Visemes depend upon facial muscles to regulate shape of the lips, which themselves act as speech articulators. This movement necessitates a more controlled, sustained muscle contraction than that produced during spontaneous facial expressions which occur rapidly and last only a short period of time. Recently, it was found that human tongue musculature contains a higher proportion of slow-twitch myosin fibers than in rhesus macaques, which is related to the slower, more controlled movements of the human tongue in the production of speech. Are there similar unique, evolutionary physiologic biases found in human facial musculature related to the evolution of speech?
15 citations
15 Sep 2015
TL;DR: This paper used a phoneme-clustering method to form new phoneme toviseme maps for both individual and multiple speakers and used these maps to examine how similarly speakers talk visually.
Abstract: In machine lip-reading, which is identification of speech from visual-only information, there is evidence to show that visual speech is highly dependent upon the speaker (Cox et al, 2008). Here, we use a phoneme-clustering method to form new phoneme-to-viseme maps for both individual and multiple speakers. We use these maps to examine how similarly speakers talk visually. We conclude that broadly speaking, speakers have the same repertoire of mouth gestures, where they differ is in the use of the gestures.
15 citations
01 Jan 1999
TL;DR: This paper shows that a language model for speech recognition that can detect and correct speech repairs in English works equally as well on a Japanese corpus of spontaneous speech.
Abstract: One of the characteristics of spontaneous speech is the abundance of speech repairs, in which speakers go back and repeat or change something they have just said. In other work [7], we proposed a language model for speech recognition that can detect and correct speech repairs in English. In this paper, we show that this model works equally as well on a Japanese corpus of spontaneous speech. The structure of the model captures the language independent aspect of speech repairs, while machine training techniques on an annotated corpus learn the language dependent aspects.
15 citations