Topic

Word error rate

About: Word error rate is a research topic. Over the lifetime, 11939 publications have been published within this topic receiving 298031 citations.

...read moreread less

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Robust combination of neural networks and hidden Markov models for speech recognition

[...]

Edmondo Trentin¹, Marco Gori•Institutions (1)

University of Siena¹

01 Nov 2003-IEEE Transactions on Neural Networks

TL;DR: Experimental results in speaker-independent, continuous speech recognition over Italian digit-strings validate the novel hybrid framework, allowing for improved recognition performance over HMMs with mixtures of Gaussian components, as well as over Bourlard and Morgan's paradigm.

...read moreread less

Abstract: Acoustic modeling in state-of-the-art speech recognition systems usually relies on hidden Markov models (HMMs) with Gaussian emission densities. HMMs suffer from intrinsic limitations, mainly due to their arbitrary parametric assumption. Artificial neural networks (ANNs) appear to be a promising alternative in this respect, but they historically failed as a general solution to the acoustic modeling problem. This paper introduces algorithms based on a gradient-ascent technique for global training of a hybrid ANN/HMM system, in which the ANN is trained for estimating the emission probabilities of the states of the HMM. The approach is related to the major hybrid systems proposed by Bourlard and Morgan and by Bengio, with the aim of combining their benefits within a unified framework and to overcome their limitations. Several viable solutions to the "divergence problem"-that may arise when training is accomplished over the maximum-likelihood (ML) criterion-are proposed. Experimental results in speaker-independent, continuous speech recognition over Italian digit-strings validate the novel hybrid framework, allowing for improved recognition performance over HMMs with mixtures of Gaussian components, as well as over Bourlard and Morgan's paradigm. In particular, it is shown that the maximum a posteriori (MAP) version of the algorithm yields a 46.34% relative word error rate reduction with respect to standard HMMs.

...read moreread less

76 citations

Proceedings Article•DOI•

A projection extension algorithm for statistical machine translation

[...]

Christoph Tillmann¹•Institutions (1)

IBM¹

11 Jul 2003

TL;DR: A phrase- based unigram model for statistical machine translation that uses a much simpler set of model parameters than similar phrase-based models that has been successfully test on a Chinese-English and an Arabic-English translation task.

...read moreread less

Abstract: In this paper, we describe a phrase-based unigram model for statistical machine translation that uses a much simpler set of model parameters than similar phrase-based models. The units of translation are blocks -- pairs of phrases. During decoding, we use a block unigram model and a word-based trigram language model. During training, the blocks are learned from source interval projections using an underlying high-precision word alignment. The system performance is significantly increased by applying a novel block extension algorithm using an additional high-recall word alignment. The blocks are further filtered using unigram-count selection criteria. The system has been successfully test on a Chinese-English and an Arabic-English translation task.

...read moreread less

76 citations

Journal Article•DOI•

Word boundary detection with mel-scale frequency bank in noisy environment

[...]

Gin-Der Wu¹, Chin-Teng Lin¹•Institutions (1)

National Chiao Tung University¹

01 Sep 2000-IEEE Transactions on Speech and Audio Processing

TL;DR: An adaptive time-frequency (ATF) parameter is proposed for extracting both the time and frequency features of noisy speech signals and a new word boundary detection algorithm is proposed by using a neural fuzzy network for identifying islands of word signals in a noisy environment.

...read moreread less

Abstract: This paper addresses the problem of automatic word boundary detection in the presence of noise. We first propose an adaptive time-frequency (ATF) parameter for extracting both the time and frequency features of noisy speech signals. The ATF parameter extends the TF parameter proposed by Junqua et al. (1994) from single band to multiband spectrum analysis, where the frequency bands help to make the distinction of speech and noise signals clear. The ATF parameter can extract useful frequency information by adaptively choosing proper bands of the mel-scale frequency bank. The ATF parameter increased the recognition rate by about 3% of a TF-based robust algorithm which has been shown to outperform several commonly used algorithms for word boundary detection in the presence of noise. The ATF parameter also reduced the recognition error rate due to endpoint detection to about 20%. Based on the ATF parameter, we further propose a new word boundary detection algorithm by using a neural fuzzy network (called SONFIN) for identifying islands of word signals in a noisy environment. Due to the self-learning ability of SONFIN, the proposed algorithm avoids the need of empirically determining thresholds and ambiguous rules in normal word boundary detection algorithms. As compared to normal neural networks, the SONFIN can always find itself an economic network size in high learning speed. Our results also showed that the SONFIN's performance is not significantly affected by the size of training set. The ATF-based SONFIN achieved higher recognition rate than the TF-based robust algorithm by about 5%. It also reduced the recognition error rate due to endpoint detection to about 10%, compared to an average of approximately 30% obtained with the TF-based robust algorithm, and 50% obtained with the modified version of the Lamel et al. (1981) algorithm.

...read moreread less

76 citations

Journal Article•DOI•

On a New Class of Bounds on Bayes Risk in Multihypothesis Pattern Recognition

[...]

P.A. Devijver

01 Jan 1974-IEEE Transactions on Computers

TL;DR: A new distance is proposed which permits tighter bounds to be set on the error probability of the Bayesian decision rule and which is shown to be closely related to several certainty or separability measures.

...read moreread less

Abstract: An important measure concerning the use of statistical decision schemes is the error probability associated with the decision rule. Several methods giving bounds on the error probability are presently available, but, most often, the bounds are loose. Those methods generally make use of so-cailed distances between statistical distributions. In this paper a new distance is proposed which permits tighter bounds to be set on the error probability of the Bayesian decision rule and which is shown to be closely related to several certainty or separability measures. Among these are the nearest neighbor error rate and the average conditional quadratic entropy of Vajda. Moreover, our distance bears much resemblance to the information theoretic concept of equivocation. This relationship is discussed. Comparison is made between the bounds on the Bayes risk obtained with the Bhattacharyya coefficient, the equivocation, and the new measure which we have named the Bayesian distance.

...read moreread less

76 citations

Journal Article•DOI•

Word age-of-acquisition, reading latencies and auditory recognition

[...]

Kenneth Gilhooly¹, Robert H. Logie¹•Institutions (1)

University of Aberdeen¹

01 Sep 1981-Current Psychology

TL;DR: In this paper, the effects of age-of-acquisition on word naming speed and auditory recognition of words presented at a low volume were investigated and the results are interpreted as supporting the view that the age of acquisition variable mainly affects word production and has little effect on word recognition processes.

...read moreread less

Abstract: This paper reports two experiments concerning the effects of word age-of-acquisition on word naming speed and auditory recognition of words presented at a low volume. The first experiment found significant facilitating effects of word age-of-acquisition in word naming even when word length, frequency and familiarity were taken into account. The second experiment found no evidence of age-of-acquisition effects in auditory word recognition. The results are interpreted as supporting the view that the age-of-acquisition variable mainly affects word production and has little effect on word recognition processes.

...read moreread less

76 citations

Collapse

Network Information

Performance

Metrics

12,777

Papers

335,740

Citations

No. of papers in the topic in previous years
Year	Papers
2023	271
2022	562
2021	640
2020	643
2019	633
2018	528

Word error rate

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics