Topic

Viseme

About: Viseme is a research topic. Over the lifetime, 865 publications have been published within this topic receiving 17889 citations.

...read moreread less

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Book Chapter•DOI•

Speech Perception: Development

[...]

Suzanne Curtin¹, Daniel Hufnagle¹, Karen E. Mulak, Paola Escudero•Institutions (1)

University of Calgary¹

01 Jan 2009

TL;DR: Early speech perception studies sought to determine which speech sound contrasts infants could detect as discussed by the authors, and found that infants can discriminate a wide range of speech sounds, and by 12-months, infants categorically perceive speech sounds; segment units from the speech stream; learn about legal sound combinations, rhythm, and stress; and track statistical properties of the speech input.

...read moreread less

Abstract: Speech perception proceeds by extracting acoustic cues and mapping them onto linguistic information. Early speech perception studies sought to determine which speech sound contrasts infants could detect. Over the past few decades, research has shown that young infants can discriminate a wide range of speech sounds, and by 12 months, infants categorically perceive speech sounds; segment units from the speech stream; learn about legal sound combinations, rhythm, and stress; and track statistical properties of the speech input. Infants then use this knowledge to begin extracting and learning words. This article reviews infant speech abilities over the first 2 years of life, discusses theoretical accounts, and outlines some challenges.

...read moreread less

2 citations

Book•

Speech processing, recognition, and artificial neural networks : proceedings of the 3rd International School on Neural Nets "Eduardo R. Caianiello"

[...]

Eduardo R. Caianiello, Gérard Chollet

01 Jan 1999

TL;DR: The present work focuses on Speech and Voice Perception, Speech Production and Perception Models and their Applications to Synthesis, Recognition, and Coding.

...read moreread less

Abstract: Section 1 - Fundamentals of Speech Analysis and Perceptron.- Articulatory Constraints on Distinctive Features.- "Herr Muller vivra a Taranto con i suoi colleghi austriaci" Investigations on a fragment of Italian Phonology.- Acoustic Analysis and Perception of Classes of Sounds (vowels and consonants).- Speech and Voice Perception: Beyond Pattern Recognition.- Section 2 - Speech Processing.- Analysis in Automatic Recognition of Speech.- Speech Production and Perception Models and their Applications to Synthesis, Recognition, and Coding.- Section 3 - Stochastic Models for Speech.- Statistical Methods for Automatic Speech Recognition.- Statistical Modelling: from Speech Recognition to Text Translation.- Continuous Speech Recognition with Neural Networks: An Application to Railway Timetables Enquires.- Multi-Level Multi-Decision Model for Automatic Speech Recognition and Understanding.- Generative Models for Automatic Speech Recognition, Understanding and Synthesis.- Speech Modelling Virtual Laboratory.- Section 4 - Auditory and Neural Network Models for Speech.- Auditory Modeling and Neural Networks.- Neural Networks for Automatic Speech Recognition: a Review.- Preprocessing and Classification of English Stops Nasals and Fricatives.- Self-Organizing Feature Maps for Arabic Phonemes.- Section 5 - Task-Oriented Applications of Automatic Speech Recognition and Synthesis.- Towards Fully Automatic Speech Processing Techniques for Interactive Voice Servers.- Multi-modal Speech Synthesis with Applications.- Author Index.

...read moreread less

2 citations

Parameterisation of Speech Lip Movements

[...]

James D. Edge¹, Adrian Hilton¹, Philip J. B. Jackson¹•Institutions (1)

University of Surrey¹

01 Sep 2008

TL;DR: A parameterisation of lip movements is described which maintains the dynamic structure inherent in the task of producing speech sounds and is believed to be appropriate to various areas of speech modeling, in particular the synthesis of speech lip movements.

...read moreread less

Abstract: In this paper we describe a parameterisation of lip movements which maintains the dynamic structure inherent in the task of producing speech sounds. A stereo capture system is used to reconstruct 3D models of a speaker producing sentences from the TIMIT corpus. This data is mapped into a space which maintains the relationships between samples and their temporal derivatives. By incorporating dynamic information within the parameterisation of lip movements we can model the cyclical structure, as well as the causal nature of speech movements as described by an underlying visual speech manifold. It is believed that such a structure will be appropriate to various areas of speech modeling, in particular the synthesis of speech lip movements.

...read moreread less

2 citations

Book Chapter•DOI•

Recognition of speaker-dependent continuous speech with Keal-Nevezh

[...]

Guy Mercier¹, A. Cozannet¹, J. Vaissière¹•Institutions (1)

CNET¹

01 Jan 1988

TL;DR: A description of the speaker-dependent continuous speech understanding system KEAL-NEVEZH, an extension of the KEAL system, connected to ALOEMDA, an active chart parser modifying its strategy and linguistic capabilities.

...read moreread less

Abstract: A description of the speaker-dependent continuous speech understanding system KEAL-NEVEZH is given An unknown utterance is recognized by means of the following procedures: Acoustic analysis, phonetic segmentation and identification, word and sentence analysis This new system is an extension of the KEAL system, connected to ALOEMDA, an active chart parser modifying its strategy and linguistic capabilities

...read moreread less

2 citations

Proceedings Article•DOI•

Speech synchronization between speech and lip shape movements for service robotics applications

[...]

Ren C. Luo¹, Chien-Chieh Huang¹, Shu-Ruei Chang¹, Yi-Jeng Tsai²•Institutions (2)

National Taiwan University¹, Industrial Technology Research Institute²

01 Nov 2011

TL;DR: This work presents a method to synchronize the image and the speech, and it uses Microsoft's Speech Application Programming Interface (SAPI) to be the speech synthesis tool.

...read moreread less

Abstract: Synchronization between speech and mouth shape includes technologies, such as computer vision, speech synthesis, and speech recognition. We present a method to synchronize the image and the speech, and we use Microsoft's Speech Application Programming Interface (SAPI) to be the speech synthesis tool. Speech animation includes two components, the speech and the image. Speech synthesis output is obtained from Text-to-Speech (TTS), and the images of visemes are generated from software, FaceGen Modeller. Import three key pictures to this software to calibrate and generate the face model. The viseme event handler in C# will connect the image of mouth shape and viseme together. Load the images sequentially and the visemes will one by one match with the images correctly. The main applications of speech synthesis are used as assistive devices, e.g. the use of screen readers for people with visual impairment. A mute person can take advantage of this technology to talk to others. In recent years, speech synthesis is extensively applied in service robotics and entertainment productions such as language learning, education, video games, animations, and music videos.

...read moreread less

2 citations

Collapse

Network Information

Performance

Metrics

884

Papers

19,235

Citations

No. of papers in the topic in previous years
Year	Papers
2023	7
2022	12
2021	13
2020	39
2019	19
2018	22

Viseme

Papers published on a yearly basis

Papers

Trending Questions (8)

Network Information

Related Topics (5)

Performance

Metrics