Topic

Word error rate

About: Word error rate is a research topic. Over the lifetime, 11939 publications have been published within this topic receiving 298031 citations.

...read moreread less

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Patent•DOI•

Speaker adaptation based on lateral tying for large-vocabulary continuous speech recognition

[...]

Jerome R. Bellegarda¹, John W. Butzberger¹, Yen-Lu Chow¹•Institutions (1)

Apple Inc.¹

13 Feb 1996-Journal of the Acoustical Society of America

TL;DR: A system and method for performing speaker adaptation in a speech recognition system which includes a set of reference models corresponding to speech data from a plurality of speakers that is adapted to account for speech pattern idiosyncrasies of the new speaker, thereby reducing the error rate of thespeech recognition system.

...read moreread less

Abstract: A system and method for performing speaker adaptation in a speech recognition system which includes a set of reference models corresponding to speech data from a plurality of speakers. The speech data is represented by a plurality of acoustic models and corresponding sub-events, and each sub-event includes one or more observations of speech data. A degree of lateral tying is computed between each pair of sub-events, wherein the degree of tying indicates the degree to which a first observation in a first sub-event contributes to the remaining sub-events. When adaptation data from a new speaker becomes available, a new observation from adaptation data is assigned to one of the sub-events. Each of the sub-events is then populated with the observations contained in the assigned sub-event based on the degree of lateral tying that was computed between each pair of sub-events. The reference models corresponding to the populated sub-events are then adapted to account for speech pattern idiosyncrasies of the new speaker, thereby reducing the error rate of the speech recognition system.

...read moreread less

141 citations

Journal Article•DOI•

The Probability of Error Due to Intersymbol Interference and Gaussian Noise in Digital Communication Systems

[...]

O. Shimbo, M. Celebiler

01 Apr 1971-IEEE Transactions on Communications

TL;DR: A Gram-Charlier expansion is used to compute the error rate in the presence of intersymbol interference and additive Gaussian noise and the method presented is very useful for numerical computations.

...read moreread less

Abstract: The error rate or the probability of error is an important parameter in the design of digital communication systems. In this paper a Gram-Charlier expansion is used to compute the error rate in the presence of intersymbol interference and additive Gaussian noise. The method presented is very useful for numerical computations. We also present expressions for the truncation errors. Rigorous proofs are presented in the Appendix.

...read moreread less

140 citations

Posted Content•

ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context

[...]

Wei Han¹, Zhengdong Zhang¹, Yu Zhang¹, Jiahui Yu², Chung-Cheng Chiu¹, James Qin¹, Anmol Gulati¹, Ruoming Pang¹, Yonghui Wu¹ - Show less +5 more•Institutions (2)

Google¹, Adobe Systems²

07 May 2020-arXiv: Audio and Speech Processing

TL;DR: This paper proposes a simple scaling method that scales the widths of ContextNet that achieves good trade-off between computation and accuracy and demonstrates that on the widely used LibriSpeech benchmark, ContextNet achieves a word error rate of 2.1%/4.6%.

...read moreread less

Abstract: Convolutional neural networks (CNN) have shown promising results for end-to-end speech recognition, albeit still behind other state-of-the-art methods in performance. In this paper, we study how to bridge this gap and go beyond with a novel CNN-RNN-transducer architecture, which we call ContextNet. ContextNet features a fully convolutional encoder that incorporates global context information into convolution layers by adding squeeze-and-excitation modules. In addition, we propose a simple scaling method that scales the widths of ContextNet that achieves good trade-off between computation and accuracy. We demonstrate that on the widely used LibriSpeech benchmark, ContextNet achieves a word error rate (WER) of 2.1%/4.6% without external language model (LM), 1.9%/4.1% with LM and 2.9%/7.0% with only 10M parameters on the clean/noisy LibriSpeech test sets. This compares to the previous best published system of 2.0%/4.6% with LM and 3.9%/11.3% with 20M parameters. The superiority of the proposed ContextNet model is also verified on a much larger internal dataset.

...read moreread less

140 citations

Journal Article•DOI•

Biometric Recognition Based on Free-Text Keystroke Dynamics

[...]

Ahmed Awad E. Ahmed¹, Issa Traore¹•Institutions (1)

University of Victoria¹

01 Apr 2014-IEEE Transactions on Systems, Man, and Cybernetics

TL;DR: This paper presents a new approach for the free text analysis of keystroke that combines monograph and digraph analysis, and uses a neural network to predict missing digraphs based on the relation between the monitored keystrokes.

...read moreread less

Abstract: Accurate recognition of free text keystroke dynamics is challenging due to the unstructured and sparse nature of the data and its underlying variability As a result, most of the approaches published in the literature on free text recognition, except for one recent one, have reported extremely high error rates In this paper, we present a new approach for the free text analysis of keystrokes that combines monograph and digraph analysis, and uses a neural network to predict missing digraphs based on the relation between the monitored keystrokes Our proposed approach achieves an accuracy level comparable to the best results obtained through related techniques in the literature, while achieving a far lower processing time Experimental evaluation involving 53 users in a heterogeneous environment yields a false acceptance ratio (FAR) of 00152% and a false rejection ratio (FRR) of 482%, at an equal error rate (EER) of 246% Our follow-up experiment, in a homogeneous environment with 17 users, yields FAR=0% and FRR=501%, at EER=213%

...read moreread less

138 citations

Proceedings Article•DOI•

Short-time Gaussianization for robust speaker verification

[...]

Bing Xiang¹, Upendra V. Chaudhari¹, Jiri Navratil¹, Ganesh N. Ramaswamy¹, Ramesh A. Gopinath¹ - Show less +1 more•Institutions (1)

IBM¹

13 May 2002

TL;DR: It is shown that one of the recent techniques used for speaker recognition, feature warping can be formulated within the framework of Gaussianization, and around 20% relative improvement in both equal error rate (EER) and minimum detection cost function (DCF) is obtained on NIST 2001 cellular phone data evaluation.

...read moreread less

Abstract: In this paper, a novel approach for robust speaker verification, namely short-time Gaussianization, is proposed. Short-time Gaussianization is initiated by a global linear transformation of the features, followed by a short-time windowed cumulative distribution function (CDF) matching. First, the linear transformation in the feature space leads to local independence or decorrelation. Then the CDF matching is applied to segments of speech localized in time and tries to warp a given feature so that its CDF matches normal distribution. It is shown that one of the recent techniques used for speaker recognition, feature warping [l] can be formulated within the framework of Gaussianization. Compared to the baseline system with cepstral mean subtraction (CMS), around 20% relative improvement in both equal error rate(EER) and minimum detection cost function (DCF) is obtained on NIST 2001 cellular phone data evaluation.

...read moreread less

138 citations

Collapse

Network Information

Performance

Metrics

12,777

Papers

335,740

Citations

No. of papers in the topic in previous years
Year	Papers
2023	271
2022	562
2021	640
2020	643
2019	633
2018	528

Word error rate

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics