Home
/
Topics
/
Word error rate

Topic

Word error rate

About: Word error rate is a research topic. Over the lifetime, 11939 publications have been published within this topic receiving 298031 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970
1969

1 / 2

Papers

PDF

Open Access

More filters

Proceedings Article•

Phonetic Speaker Recognition with Support Vector Machines

[...]

William M. Campbell¹, Joseph P. Campbell¹, Douglas A. Reynolds¹, Douglas A. Jones¹, Tim Leek¹ - Show less +1 more•Institutions (1)

Massachusetts Institute of Technology¹

09 Dec 2003

TL;DR: A new phone- based SVM speaker recognition approach that halves the error rate of conventional phone-based approaches is introduced and a new kernel based upon a linearization of likelihood ratio scoring is derived.

...read moreread less

Abstract: A recent area of significant progress in speaker recognition is the use of high level features—idiolect, phonetic relations, prosody, discourse structure, etc. A speaker not only has a distinctive acoustic sound but uses language in a characteristic manner. Large corpora of speech data available in recent years allow experimentation with long term statistics of phone patterns, word patterns, etc. of an individual. We propose the use of support vector machines and term frequency analysis of phone sequences to model a given speaker. To this end, we explore techniques for text categorization applied to the problem. We derive a new kernel based upon a linearization of likelihood ratio scoring. We introduce a new phone-based SVM speaker recognition approach that halves the error rate of conventional phone-based approaches.

...read moreread less

150 citations

Proceedings Article•DOI•

Maximum Entropy Based Restoration of Arabic Diacritics

[...]

Imed Zitouni¹, Jeffrey Sorensen¹, Ruhi Sarikaya¹•Institutions (1)

IBM¹

17 Jul 2006

TL;DR: A maximum entropy approach for restoring diacritics in a document that can easily integrate and make effective use of diverse types of information and integrates a wide array of lexical, segment-based and part-of-speech tag features.

...read moreread less

Abstract: Short vowels and other diacritics are not part of written Arabic scripts. Exceptions are made for important political and religious texts and in scripts for beginning students of Arabic. Script without diacritics have considerable ambiguity because many words with different diacritic patterns appear identical in a diacritic-less setting. We propose in this paper a maximum entropy approach for restoring diacritics in a document. The approach can easily integrate and make effective use of diverse types of information; the model we propose integrates a wide array of lexical, segment-based and part-of-speech tag features. The combination of these feature types leads to a state-of-the-art diacritization model. Using a publicly available corpus (LDC's Arabic Treebank Part 3), we achieve a diacritic error rate of 5.1%, a segment error rate 8.5%, and a word error rate of 17.3%. In case-ending-less setting, we obtain a diacritic error rate of 2.2%, a segment error rate 4.0%, and a word error rate of 7.2%.

...read moreread less

149 citations

Patent•DOI•

Speech recognition system having word-based and phoneme-based recognition means

[...]

Hiroshi Kanazawa¹, Yoichi Takebayashi¹•Institutions (1)

Toshiba¹

21 Dec 1992-Journal of the Acoustical Society of America

TL;DR: A speech recognition system includes a parameter extracting section for extracting a speech parameter of input speech, a first recognizing section for performing recognition processing by word-based matching, and a second recognizing sectionfor performing word recognition by matching in units of word constituent elements.

...read moreread less

Abstract: A speech recognition system includes a parameter extracting section for extracting a speech parameter of input speech, a first recognizing section for performing recognition processing by word-based matching, and a second recognizing section for performing word recognition by matching in units of word constituent elements. The first word recognizing section segments the speech parameter in units of words to extract a word speech pattern and performs word recognition by matching the word speech pattern with a predetermined word reference pattern. The second word recognizing section performs recognition in units of word constituent elements by using the extracted speech parameter and performs word recognition on the basis of candidates of an obtained word constituent element series. The speech recognition system further includes a recognition result output section for obtaining a recognition result on the basis of the word recognition results obtained by the first and second recognizing sections and outputting the obtained recognition result. The speech recognition system further includes a word reference pattern learning section for performing learning of a word reference pattern on the basis of the recognition result obtained by the recognizing result output section and the word speech pattern.

...read moreread less

148 citations

Proceedings Article•DOI•

Multi-resolution RASTA filtering for TANDEM-based ASR

[...]

Hynek Hermansky, Petr Fousek

04 Sep 2005

TL;DR: New speech representation based on multiple filtering of temporal trajectories of speech energies in frequency sub-bands is proposed and tested, which is inherently robust to linear distortions.

...read moreread less

Abstract: New speech representation based on multiple filtering of temporal trajectories of speech energies in frequency sub-bands is proposed and tested. The technique extends earlier works on delta features and RASTA filtering by processing temporal trajectories by a bank of band-pass filters with varying resolutions. In initial tests on OGI Digits database the technique yields about 30% relative improvement in word error rate over the conventional PLP features. Since the applied filters have zero-mean impulse responses, the technique is inherently robust to linear distortions.

...read moreread less

147 citations

Proceedings Article•DOI•

Word Sense Disambiguation vs. Statistical Machine Translation

[...]

Marine Carpuat, Dekai Wu

25 Jun 2005

TL;DR: It is found that word sense disambiguation does not yield significantly better translation quality than the statistical machine translation system alone.

...read moreread less

Abstract: We directly investigate a subject of much recent debate: do word sense disambiguation models help statistical machine translation quality? We present empirical results casting doubt on this common, but unproved, assumption Using a state-of-the-art Chinese word sense disambiguation model to choose translation candidates for a typical IBM statistical MT system, we find that word sense disambiguation does not yield significantly better translation quality than the statistical machine translation system alone Error analysis suggests several key factors behind this surprising finding, including inherent limitations of current statistical MT architectures

...read moreread less

146 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
…
66
67
68
69
70
71
72
…
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

12,777

Papers

335,740

Citations

No. of papers in the topic in previous years
Year	Papers
2023	271
2022	562
2021	640
2020	643
2019	633
2018	528

Word error rate

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics