Home
/
Topics
/
Viseme

Topic

Viseme

About: Viseme is a research topic. Over the lifetime, 865 publications have been published within this topic receiving 17889 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1973
1972
1971
1970
1969
1968

1 / 2

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Development of a Speech Training System by Lip Movements

[...]

Tomoki Yamamura, Miyuki Suganuma, Eiki Wakamatsu, Yuko Hoshino, Mitsuho Yamada - Show less +1 more

31 Aug 2015

TL;DR: A lip operation that would recognize the content of an utterance by reading from an image is studied that could be used for utterance training in Japanese and English.

...read moreread less

Abstract: Speech recognition technology is spreading with personal digital assistants such as smart phones However, we are concerned about the decline in the recognition rate at places with multiple voices and considerable noise Therefore, we have been studying a lip operation that would recognize the content of an utterance by reading from an image Based on this research, we created a database of utterances by Japanese television announcers and English teachers for utterance training in Japanese and English Furthermore, applying the technology we developed, we propose a method of utterance training using specific equipment First, we compared the student's utterance with data in the lip movement database Second, we evaluated the effectiveness of the utterance training equipment

...read moreread less

Patent•

An image-based instant message system and method for providing emotions expression

[...]

Giant H M Tu

01 Jul 2008

TL;DR: In this article, a method of providing an image of emotions animation based on a text message, comprising of generating a phoneme event, a wave event or an index event based on the typed text message utilizing a text-to-speech engine, was presented.

...read moreread less

Abstract: A method of providing an image of emotions animation based on a text message, comprising: generating a phoneme event, a wave event or an index event based on the typed text message utilizing a text-to-speech engine; mapping a current phoneme stored via said phoneme event into a viseme according to a mapping table from phoneme to viseme; calculating the needed number of face/lip frames based on the length of current phoneme's wave data stored via said wave event; retrieving said needed number of face/lip frames from a model file based said viseme for output.

...read moreread less

Proceedings Article•DOI•

A Visual Speech Feature to Indentify the Speaking States from Video

[...]

Xibin Jia, Baocai Yin, Yanfen Sun

11 Nov 2010

TL;DR: The paper proposes a kind of visual speech feature for the speaking mouth images from the video combining clues of the shape and local teeth texture based on the computing the Euclidian distant between each the feature point around the inner and outer lip.

...read moreread less

Abstract: The paper proposes a kind of visual speech feature for the speaking mouth images from the video combining clues of the shape and local teeth texture. The geometric feature we proposed based on the computing the Euclidian distant between each the feature point around the inner and outer lip. The local texture with G and B components as baseline is employed to calculate the color moment to describe the visibility of teeth. The weighted fusion is used to combine the two features. The k-mean algorithm is utilized to analyze the feature performance according to evaluate the clustering results. The results show that with G and B color component to derive the local texture to model the teeth visibility are better than the others and our feature has higher ability to perceive the visemes than the PCA and geometric feature only.

...read moreread less

Hidden markov models for visual speech synthesis with limited data

[...]

Allan Arb, Steven Gustafson, Timothy R. Anderson, Raymond E. Slyh

01 Jan 2001

TL;DR: Comparisons of mouth shapes generated from the artificially generated control points and the control points estimated from video not used to train the HMMs indicate that the process estimated accurate control points for the trisemes tested.

...read moreread less

Abstract: This paper addresses a problem often encountered when estimating control points used in visual speech synthesis. First, Hidden Markov Models (HMMs) are estimated for each viseme present in stored video data. Second, models are generated for each triseme (a viseme in context with the previous and following visemes) in the training set. Next, a decision tree is used to cluster and relate states in the HMMs that are similar in a contextual and statistical sense. The tree is also used to estimate HMMs for any trisemes that are not present in the stored video data when control points for such trisemes are required for synthesizing the lip motion for a sentence. Finally, the HMMs are used to generate sequences of visual speech control points for those trisemes not occurring in the stored data. Comparisons of mouth shapes generated from the artificially generated control points and the control points estimated from video not used to train the HMMs indicate that the process estimated accurate control points for the trisemes tested. This paper thus establishes a useful method for synthesizing realistic audio-synchronized video facial features.

...read moreread less

Continuous Speech Recognition inRadiology'Reporting

[...]

Venkatesh Rudrapatna, Sridhar Rajappan, Bharat Raval, Jiajie Zhang

01 Jan 2000

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
…
154
155
156
157
158
159
160
…
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177

Collapse

Network Information

Performance

Metrics

884

Papers

19,235

Citations

No. of papers in the topic in previous years
Year	Papers
2023	7
2022	12
2021	13
2020	39
2019	19
2018	22

Viseme

Papers published on a yearly basis

Papers

Trending Questions (8)

Network Information

Related Topics (5)

Performance

Metrics