Home
/
Topics
/
Viseme

Topic

Viseme

About: Viseme is a research topic. Over the lifetime, 865 publications have been published within this topic receiving 17889 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1973
1972
1971
1970
1969
1968

1 / 2

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Overlapping of Speech Sounds

[...]

E. W. Scripture

01 Nov 1935-Nature

TL;DR: In this paper, the last part of the vowel and the first parts of the consonant in a registration of ash are seen to be overlapped, and the vibratory bits that characterise the vowel can be traced in everdiminishing strength to the third line in the reproduction.

...read moreread less

Abstract: THE piece of sound film reproduced in Fig. 1 shows the last part of the vowel and the first part of the consonant in a registration of ash. The vibratory bits that characterise the vowel can be traced in ever-diminishing strength to the third line in the reproduction. The mixture of regular and irregular vibrations that characterise the consonant can be traced back to the middle of the first line. The end of the vowel and the beginning of the consonant are seen to be overlapped.

...read moreread less

2 citations

Journal Article•

Real-time speech driven talking avatar

[...]

Zhang Yanning¹•Institutions (1)

Northwestern Polytechnical University¹

01 Jan 2011-Journal of Tsinghua University

TL;DR: This paper presents a real-time speech driven talking avatar that is able to speak with live speech input and has many potential applications in videophones, virtual conferences,audio/video chats and entertainment.

...read moreread less

Abstract: This paper presents a real-time speech driven talking avatar.Unlike most talking avatars in which the speech-synchronized facial animation is generated offline,this talking avatar is able to speak with live speech input.This life-like talking avatar has many potential applications in videophones,virtual conferences,audio/video chats and entertainment.Since phonemes are the smallest units of pronunciation,a real-time phoneme recognizer was built.The synchronization between the input live speech and the facial motion used a phoneme recognition and output algorithm.The coarticulation effects are included in a dynamic viseme generation algorithm to coordinate the facial animation parameters(FAPs) from the input phonemes.The MPEG-4 compliant avatar model is driven by the generated FAPs.Tests show that the avatar motion is synchronized and natural with MOS values of 3.42 and 3.5.

...read moreread less

2 citations

Journal Article•DOI•

Visual Speech Recognition Using Weighted Dynamic Time Warping

[...]

Kyungsun Lee¹, Min-Seok Keum¹, David K. Han², Hanseok Ko•Institutions (2)

Korea University¹, Office of Naval Research²

01 Jul 2015-IEICE Transactions on Information and Systems

2 citations

Proceedings Article•DOI•

Visual-speech Synthesis of Exaggerated Corrective Feedback

[...]

Yaohua Bu¹, Weijun Li², Tianyi Ma¹, Shengqi Chen¹, Jia Jia¹, Kun Li, Xiaobo Lu¹ - Show less +3 more•Institutions (2)

Tsinghua University¹, Northeast Normal University²

12 Oct 2020

TL;DR: This work proposes a method for exaggerated visual-speech feedback in computer-assisted pronunciation training (CAPT) that outperforms non-exaggerated version on helping learners with pronunciation identification and pronunciation improvement.

...read moreread less

Abstract: To provide more discriminative feedback for the second language (L2) learners to better identify their mispronunciation, we propose a method for exaggerated visual-speech feedback in computer-assisted pronunciation training (CAPT). The speech exaggeration is realized by an emphatic speech generation neural network based on Tacotron, while the visual exaggeration is accomplished by ADC Viseme Blending, namely increasing Amplitude of movement, extending the phone's Duration and enhancing the color Contrast. User studies show that exaggerated feedback outperforms non-exaggerated version on helping learners with pronunciation identification and pronunciation improvement.

...read moreread less

2 citations

Journal Issue•DOI•

Efficient lip-synch tool for 3D cartoon animation

[...]

Shin-ichi Kawamoto, Tatsuo Yotsukura, Ken Anjyo, Satoshi Nakamura

01 Sep 2008-Computer Animation and Virtual Worlds

TL;DR: This work proposes a set of algorithms to efficiently make speech animation for 3D cartoon characters based on blendshapes, a linear interpolation technique, which is widely used in facial animation practice.

...read moreread less

Abstract: We propose a set of algorithms to efficiently make speech animation for 3D cartoon characters. Our prototype system is based on blendshapes, a linear interpolation technique, which is widely used in facial animation practice. In our system, a few base target shapes of the character, prerecorded voice, and its transcription are required as input. We describe a simple technique that amplifies the target shapes from few inputs using a generic database of viseme mouth shapes. We also introduce additional lip-synch editing parameters that allow designers to quickly tune the lip movements. Based on these, we implement our prototype system as a Maya plug-in. The demonstration movies created with this system illustrate well the practicality of our approach. Copyright © 2008 John Wiley & Sons, Ltd.

...read moreread less

2 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
…
123
124
125
126
127
128
129
…
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177

Collapse

Network Information

Performance

Metrics

884

Papers

19,235

Citations

No. of papers in the topic in previous years
Year	Papers
2023	7
2022	12
2021	13
2020	39
2019	19
2018	22

Viseme

Papers published on a yearly basis

Papers

Trending Questions (8)

Network Information

Related Topics (5)

Performance

Metrics