Home
/
Topics
/
Word error rate

Topic

Word error rate

About: Word error rate is a research topic. Over the lifetime, 11939 publications have been published within this topic receiving 298031 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986
1985
1984
1983
1982
1981
1980
1979
1978
1977
1976
1975
1974
1973
1972
1971
1970
1969

1 / 2

Papers

PDF

Open Access

More filters

Book Chapter•DOI•

Estimating the Error Rate of a Prediction Rule: Improvement on Cross-Validation

[...]

Trevor Hastie

01 Jan 2008

75 citations

Application of Convolutional Neural Networks to Language Identification in Noisy Conditions

[...]

Yun Lei¹, Luciana Ferrer¹, Aaron Lawson¹, Mitchell McLaren¹, Nicolas Scheffer¹ - Show less +1 more•Institutions (1)

SRI International¹

01 Jan 2014

TL;DR: Two novel frontends for robust language identification (LID) using a convolutional neural network trained for automatic speech recognition (ASR) and the CNN is used to obtain the posterior probabilities for i-vector training and extraction instead of a universal background model (UBM).

...read moreread less

Abstract: This paper proposes two novel frontends for robust language identification (LID) using a convolutional neural network (CNN) trained for automatic speech recognition (ASR). In the CNN/i-vector frontend, the CNN is used to obtain the posterior probabilities for i-vector training and extraction instead of a universal background model (UBM). The CNN/posterior frontend is somewhat similar to a phonetic system in that the occupation counts of (tied) triphone states (senones) given by the CNN are used for classification. They are compressed to a low dimensional vector using probabilistic principal component analysis (PPCA). Evaluated on heavily degraded speech data, the proposed front ends provide significant improvements of up to 50% on average equal error rate compared to a UBM/i-vector baseline. Moreover, the proposed frontends are complementary and give significant gains of up to 20% relative to the best single system when combined.

...read moreread less

75 citations

Journal Article•DOI•

Using out-of-domain data to improve in-domain language models

[...]

Rukmini Iyer¹, Mari Ostendorf, Herbert Gish•Institutions (1)

Boston University¹

01 Aug 1997-IEEE Signal Processing Letters

TL;DR: New approaches to improve sparse application-specific language models by combining domain dependent and out-of-domain data are investigated, including a back-off scheme that effectively leads to context-dependent multiple interpolation weights, and a likelihood-based similarity weighting scheme to discriminatively use data to train a task- specific language model.

...read moreread less

Abstract: Standard statistical language modeling techniques suffer from sparse data problems when applied to real tasks in speech recognition, where large amounts of domain-dependent text are not available. We investigate new approaches to improve sparse application-specific language models by combining domain dependent and out-of-domain data, including a back-off scheme that effectively leads to context-dependent multiple interpolation weights, and a likelihood-based similarity weighting scheme to discriminatively use data to train a task-specific language model. Experiments with both approaches on a spontaneous speech recognition task (switchboard), lead to reduced word error rate over a domain-specific n-gram language model, giving a larger gain than that obtained with previous brute-force data combination approaches.

...read moreread less

75 citations

Proceedings Article•

Unlimited vocabulary speech recognition based on morphs discovered in an unsupervised manner

[...]

Vesa Siivola¹, Teemu Hirsimäki¹, Mathias Creutz, Mikko Kurimo²•Institutions (2)

Helsinki University of Technology¹, Aalto University²

01 Jan 2003

TL;DR: A method based on the Minimum Description Length principle is used to split words statistically into subword units allowing efficient language modeling and unlimited vocabulary and the resulting model outperforms both word and syllable based trigram models.

...read moreread less

Abstract: We study continuous speech recognition based on sub-word units found in an unsupervised fashion. For agglutinative languages like Finnish, traditional word-based n-gram language modeling does not work well due to the huge number of different word forms. We use a method based on the Minimum Description Length principle to split words statistically into subword units allowing efficient language modeling and unlimited vocabulary. The perplexity and speech recognition experiments on Finnish speech data show that the resulting model outperforms both word and syllable based trigram models. Compared to the word trigram model, the out-of-vocabulary rate is reduced from 20% to 0% and the word error rate from 56% to 32%.

...read moreread less

75 citations

Journal Article•DOI•

Hidden-articulator Markov models for speech recognition

[...]

Matthew Richardson¹, Jeff A. Bilmes¹, Christopher J. Diorio¹•Institutions (1)

University of Washington¹

01 Oct 2003-Speech Communication

TL;DR: The Hidden-Articulator Markov model (HAMM) as discussed by the authors is an extension of the articulatory-feature model introduced by Erler in 1996, which integrates articulatory information into speech recognition.

...read moreread less

75 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
…
156
157
158
159
160
161
162
…
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

12,777

Papers

335,740

Citations

No. of papers in the topic in previous years
Year	Papers
2023	271
2022	562
2021	640
2020	643
2019	633
2018	528

Word error rate

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics