Topic

Viseme

About: Viseme is a research topic. Over the lifetime, 865 publications have been published within this topic receiving 17889 citations.

...read moreread less

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Synthetic faces: Analysis and applications

[...]

Sumedha Kshirsagar¹, Stephane Garchery¹, Gael Sannier¹, Nadia Magnenat-Thalmann¹•Institutions (1)

University of Geneva¹

01 Jan 2003-International Journal of Imaging Systems and Technology

TL;DR: A new parameter space for sculpting facial expressions for synthetic faces is proposed, which not only offers insight into improving realism of animated faces, but also gives a new way of generating convincing speech animation and blending between several expressions.

...read moreread less

Abstract: Facial animation has been a topic of intensive research for more than three decades. Still, designing realistic facial animations remains to be a challenging task. Several models and tools have been developed so far to automate the design of faces and facial animations synchronized with speech, emotions, and gestures. In this article, we take a brief overview of the existing parameterized facial animation systems. We then turn our attention to facial expression analysis, which we believe is the key to improving realism in animated faces. We report the results of our research regarding the analysis of the facial motion capture data. We use an optical tracking system that extracts the 3D positions of markers attached at specific feature point locations. We capture the movements of these face markers for a talking person. We then form a vector space representation by using the principal component analysis of this data. We call this space “expression and viseme space.” As a result, we propose a new parameter space for sculpting facial expressions for synthetic faces. Such a representation not only offers insight into improving realism of animated faces, but also gives a new way of generating convincing speech animation and blending between several expressions. Expressive facial animation finds a variety of applications ranging from virtual environments to entertainment and games. With the advances in Internet technology, the development of online sales assistants, Web navigation aides and Web-based interactive tutors is promising than ever before. We overview the recent advances in the field of facial animation on the Web, with a detailed look at the requirements for Web-based facial animation systems and various applications. © 2003 Wiley Periodicals, Inc. Int J Imaging Syst Technol 13: 65–73, 2003; Published online in Wiley InterScience (www.interscience.wiley.com). DOI 10.1002/ima.10043

...read moreread less

17 citations

Book Chapter•DOI•

Audiovisual Speech Processing: Visual speech perception

[...]

L. E. Bernstein

01 Apr 2012

17 citations

Proceedings Article•DOI•

Performance Analysis of Lip Synchronization Using LPC, MFCC and PLP Speech Parameters

[...]

Mahesh M. Goyani¹, Namrata Dave¹, N.M. Patel¹•Institutions (1)

Sardar Patel University¹

26 Nov 2010

TL;DR: A novel approach with enhanced performance over traditional methods that have been reported so far on speech recognition, employed neural network in research work with LPC, MFCC and PLP parameters.

...read moreread less

Abstract: Many multimedia applications and entertainment industry products like games, cartoons and film dubbing require speech driven face animation and audio-video synchronization. Only Automatic Speech Recognition system (ASR) does not give good results in noisy environment. Audio Visual Speech Recognition system plays vital role in such harsh environment as it uses both – audio and visual – information. In this paper, we have proposed a novel approach with enhanced performance over traditional methods that have been reported so far. Our algorithm works on the bases of acoustic and visual parameters to achieve better results. We have tested our system for English language using LPC, MFCC and PLP parameters of the speech. Lip parameters like lip width, lip height etc are extracted from the video and these both acoustic and visual parameters are used to train systems like Artificial Neural Network (ANN), Vector Quantization (VQ), Dynamic Time Warping (DTW), Support Vector Machine (SVM). We have employed neural network in our research work with LPC, MFCC and PLP parameters. Results show that our system is giving very good response against tested vowels.

...read moreread less

17 citations

Journal Article•DOI•

Speech recognition enhancement by lip information

[...]

Shogo Nishida¹•Institutions (1)

Mitsubishi Electric¹

01 Apr 1986

TL;DR: The algorithms proposed here are composed of simple image-processing, and it is shown they work well and will make it possible to realize them in real-time.

...read moreread less

Abstract: Though technology in speech recognition has progressed recently, Automatic Speech Recognition (ASR) is vulnerable to noise. Lip-information is thought to be useful for speech recognition in noisy situations, such as in a factory or in a car.This paper describes speech recognition enhancement by lip-information. Two types of usage are dealt with. One is the detection of start and stop of speech from lip-information. This is the simplest usage of lip-information. The other is lip-pattern recognition, and it is used for speech recognition together with sound information. The algorithms for both usages are proposed, and the experimental system shows they work well. The algorithms proposed here are composed of simple image-processing. Future progress in image-processing will make it possible to realize them in real-time.

...read moreread less

17 citations

Book•

Speech Sounds: A Pictorial Guide to Typical and Atypical Speech

[...]

Sharynne McLeod, Sadanand Singh¹•Institutions (1)

San Diego State University¹

01 Nov 2008

TL;DR: Speech sounds : a pictorial guide to typical and atypical speech as discussed by the authors, is a pictual guide for typical and unusual speech in the context of speech synthesis and decoding.

...read moreread less

Abstract: Speech sounds : a pictorial guide to typical and atypical speech , Speech sounds : a pictorial guide to typical and atypical speech , کتابخانه دیجیتال جندی شاپور اهواز

...read moreread less

17 citations

Collapse

Network Information

Performance

Metrics

884

Papers

19,235

Citations

No. of papers in the topic in previous years
Year	Papers
2023	7
2022	12
2021	13
2020	39
2019	19
2018	22

Viseme

Papers published on a yearly basis

Papers

Trending Questions (8)

Network Information

Related Topics (5)

Performance

Metrics