Home
/
Topics
/
Viseme

Topic

Viseme

About: Viseme is a research topic. Over the lifetime, 865 publications have been published within this topic receiving 17889 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1973
1972
1971
1970
1969
1968

1 / 2

Papers

PDF

Open Access

More filters

Patent•

Speech recognition method and speech recognition device

[...]

Kazuya Nomura¹, 野村和也•Institutions (1)

Panasonic¹

08 Jul 2014

TL;DR: In this paper, a speech recognition device controls one or a plurality of pieces of equipment by speech recognition and is characterized by being provided with: a speech acquisition unit that acquires speech information showing speech uttered by a user; a speech processing unit that recognizes, as character information, speech information acquired by the speech Acquisition Unit; and a recognition result determination unit that determines whether or not an utterance is to the equipment on the basis of the character information recognized by the Speech Acquisition Unit.

...read moreread less

Abstract: A speech recognition device controls one or a plurality of pieces of equipment by speech recognition and is characterized by being provided with: a speech acquisition unit that acquires speech information showing speech uttered by a user; a speech recognition processing unit that recognizes, as character information, speech information acquired by the speech acquisition unit; and a recognition result determination unit that determines whether or not an utterance is to the equipment on the basis of the character information recognized by the speech recognition processing unit.

...read moreread less

3 citations

Proceedings Article•DOI•

An Approach to Speech Driven Animation

[...]

Ningping Sun, K. Suigetsu, T. Ayabe

15 Aug 2008

TL;DR: The previous work of digital speech signal processing is discussed and how to apply the existing speech processing techniques into the proposed algorithms of speech driven lip motion animation for Japanese style anime is discussed.

...read moreread less

Abstract: In the manufacture of Japanese style anime the movement of lip with speech is usually shortened to the more convenient 'open' and 'close' of mouth because of the expensive production cost. In this paper we provide an approach to deal with speech driven lip animation for Japanese style anime. First we discuss the previous work of digital speech signal processing and show how to apply the existing speech processing techniques into our work. Then we propose our algorithms of speech driven lip motion animation. Finally the experiment results will be provided.

...read moreread less

3 citations

Proceedings Article•

SIVHA, visual speech synthesis system.

[...]

Yolanda Blanco, Maria Cuellar, Arantxa Villanueva, Fernando Lacunza, Rafael Cabeza, Beatriz Marcotegui - Show less +2 more

01 Jan 1998

TL;DR: SIVHA, a high quality Spanish speech synthesis system for severe disabled persons controlled by their eye movements, follows the eye-gaze of the patients along the screen and constructs the text with the selected words.

...read moreread less

Abstract: This paper presents SIVHA, a high quality Spanish speech synthesis system for severe disabled persons controlled by their eye movements. The system follows the eye-gaze of the patients along the screen and constructs the text with the selected words. When the user considers that the construction of the message has been finished, the synthesis of the message can be ordered. The system is divided in three modules. The first one determines the point of the screen the user is looking at, the second one is an interface to construct the sentences and the third one is the synthesis itself.

...read moreread less

3 citations

Journal Article•DOI•

Medical electronics III: Rhythmic cues aid lip readers: Through knowledge of the frequency and intensity characteristics of syllables, the deaf can learn to relate speech with text

[...]

D. C. Sargent¹•Institutions (1)

Rochester Institute of Technology¹

01 Apr 1982-IEEE Spectrum

TL;DR: In this paper, a speech text synchroniser that is intended to teach the deaf a knowledge of the frequency and intensity characteristics of syllables so that they can relate speech to text thus allowing them to make sense of an otherwise confusing lip reading is described.

...read moreread less

Abstract: Describes a speech text synchroniser that is intended to teach the deaf a knowledge of the frequency and intensity characteristics of syllables so that they can relate speech to text thus allowing them to make sense of an otherwise confusing lip reading.

...read moreread less

3 citations

Proceedings Article•DOI•

3D realistic talking face co-driven by text and speech

[...]

Mingli Song¹, Chun Chen¹, Jiajun Bu¹, Ronghua Liang•Institutions (1)

Zhejiang University¹

10 Nov 2003

TL;DR: According to the frame rate to be rendered, intermediate frames are interpolated between key frames to make the animation result looks more natural and realistic than those obtained based on the text or speech-driven only.

...read moreread less

Abstract: To create 3D realistic talking face has been a challenge for a long time. Previous works emphasize text or speech driven talking face respectively while the animation result is not very realistic or natural-looking. In the proposed approach, text and speech are considered to drive the 3D talkingface coordinately. The text is translated into a sequence of visemes' transcription. And time vector of the sequence is extracted from the speech corresponding to the text after it is segmented into phonetic sequence. A muscle based viseme vector is defined for static viseme. And then, with the time vector and the static visemes's sequence, dynamic visemes are generated through time-related dominance function. Finally, according to the frame rate to be rendered, intermediate frames are interpolated between key frames to make the animation result looks more natural and realistic than those obtained based on the text or speech-driven only.

...read moreread less

3 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
…
113
114
115
116
117
118
119
…
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177

Collapse

Network Information

Performance

Metrics

884

Papers

19,235

Citations

No. of papers in the topic in previous years
Year	Papers
2023	7
2022	12
2021	13
2020	39
2019	19
2018	22

Viseme

Papers published on a yearly basis

Papers

Trending Questions (8)

Network Information

Related Topics (5)

Performance

Metrics