Home
/
Topics
/
Viseme

Topic

Viseme

About: Viseme is a research topic. Over the lifetime, 865 publications have been published within this topic receiving 17889 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1973
1972
1971
1970
1969
1968

1 / 2

Papers

PDF

Open Access

More filters

Journal Article•DOI•

An analysis of the effect of combining standard and alternate sensor signals on recognition of syllabic units for multimodal speech recognition

[...]

N. Radha¹, A. Shahina¹, P Prabha¹, B T Preethi Sri¹, A. Nayeemulla Khan² - Show less +1 more•Institutions (2)

Sri Sivasubramaniya Nadar College of Engineering¹, VIT University²

01 Oct 2017-Pattern Recognition Letters

TL;DR: The best multimodal system that combines the two acoustic cues as well as visual cue improves the recognition of POA category, MOA category by 3% and vowels by 2%.

...read moreread less

10 citations

Book Chapter•DOI•

A Temporal Network of Support Vector Machine Classifiers for the Recognition of Visual Speech

[...]

Mihaela Gordan¹, Constantine Kotropoulos², Ioannis Pitas²•Institutions (2)

Technical University of Cluj-Napoca¹, Aristotle University of Thessaloniki²

11 Apr 2002

TL;DR: A new system for the recognition of visual speech based on support vector machines which proved to be powerful classifiers in other visual tasks is proposed, which offers the advantage of an easy generalization to large vocabulary recognition tasks due to the use of viseme models, as opposed to entire word models.

...read moreread less

Abstract: Speech recognition based on visual information is an emerging research field We propose here a new system for the recognition of visual speech based on support vector machines which proved to be powerful classifiers in other visual tasks We use support vector machines to recognize the mouth shape corresponding to different phones produced To model the temporal character of the speech we employ the Viterbi decoding in a network of support vector machines The recognition rate obtained is higher than those reported earlier when the same features were used The proposed solution offers the advantage of an easy generalization to large vocabulary recognition tasks due to the use of viseme models, as opposed to entire word models

...read moreread less

10 citations

Posted Content•

Finding phonemes: improving machine lip-reading.

[...]

Helen L. Bear, Richard P. Harvey, Yuxuan Lan

03 Oct 2017-arXiv: Computer Vision and Pattern Recognition

TL;DR: The authors use a structured approach for devising speaker-dependent viseme classes, which enables the creation of a set of phoneme-to-viseme maps where each has a different quantity of visemes ranging from two to 45.

...read moreread less

Abstract: In machine lip-reading there is continued debate and research around the correct classes to be used for recognition. In this paper we use a structured approach for devising speaker-dependent viseme classes, which enables the creation of a set of phoneme-to-viseme maps where each has a different quantity of visemes ranging from two to 45. Viseme classes are based upon the mapping of articulated phonemes, which have been confused during phoneme recognition, into viseme groups. Using these maps, with the LiLIR dataset, we show the effect of changing the viseme map size in speaker-dependent machine lip-reading, measured by word recognition correctness and so demonstrate that word recognition with phoneme classifiers is not just possible, but often better than word recognition with viseme classifiers. Furthermore, there are intermediate units between visemes and phonemes which are better still.

...read moreread less

10 citations

Posted Content•

Some observations on computer lip-reading: moving from the dream to the reality

[...]

Helen L. Bear¹, Gari Owen, Richard P. Harvey¹, Barry-John Theobald¹•Institutions (1)

University of East Anglia¹

03 Oct 2017-arXiv: Computer Vision and Pattern Recognition

TL;DR: This paper showed that visemes, which were defined over a century ago, are unlikely to be optimal for a modern computer lip-reading system and showed that computer lip reading is not heavily constrained by video resolution, pose, lighting and other practical factors.

...read moreread less

Abstract: In the quest for greater computer lip-reading performance there are a number of tacit assumptions which are either present in the datasets (high resolution for example) or in the methods (recognition of spoken visual units called visemes for example). Here we review these and other assumptions and show the surprising result that computer lip-reading is not heavily constrained by video resolution, pose, lighting and other practical factors. However, the working assumption that visemes, which are the visual equivalent of phonemes, are the best unit for recognition does need further examination. We conclude that visemes, which were defined over a century ago, are unlikely to be optimal for a modern computer lip-reading system.

...read moreread less

10 citations

Journal Article•

for Speech Recognition

[...]

Hwan-Jin Choi, Yung-Hwan Oh

01 Jan 1997-Pattern Recognition

10 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
…
59
60
61
62
63
64
65
…
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177

Collapse

Network Information

Performance

Metrics

884

Papers

19,235

Citations

No. of papers in the topic in previous years
Year	Papers
2023	7
2022	12
2021	13
2020	39
2019	19
2018	22

Viseme

Papers published on a yearly basis

Papers

Trending Questions (8)

Network Information

Related Topics (5)

Performance

Metrics