TRAPS - classifiers of temporal patterns.

Open AccessProceedings Article

TRAPS - classifiers of temporal patterns.

Chats0

TLDR

The work proposes a radically different set of features for ASR where TempoRAl Patterns of spectral energies are used in place of the conventional spectral patterns.

Abstract:

The work proposes a radically di erent set of features for ASR where TempoRAl Patterns of spectral energies are used in place of the conventional spectral patterns The approach has several inherent advantages, among them robustness to stationary or slowly varying disturbances

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Acoustic Modeling Using Deep Belief Networks

Abdelrahman Mohamed, +2 more

- 01 Jan 2012 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: It is shown that better phone recognition on the TIMIT dataset can be achieved by replacing Gaussian mixture models by deep neural networks that contain many layers of features and a very large number of parameters.

...read moreread less

Journal ArticleDOI

An overview of noise-robust automatic speech recognition

Jinyu Li, +3 more

- 01 Apr 2014 -

IEEE Transactions on Audio, Speech, and ...

TL;DR: A thorough overview of modern noise-robust techniques for ASR developed over the past 30 years is provided and methods that are proven to be successful and that are likely to sustain or expand their future applicability are emphasized.

...read moreread less

Journal ArticleDOI

Automatic speech recognition and speech variability: A review

Mohamed Faouzi BenZeghiba, +12 more

- 01 Oct 2007 -

Speech Communication

TL;DR: Current advances related to automatic speech recognition (ASR) and spoken language systems and deficiencies in dealing with variation naturally present in speech are outlined.

...read moreread less

Proceedings ArticleDOI

Probabilistic and Bottle-Neck Features for LVCSR of Meetings

Frantisek Grezl, +3 more

TL;DR: This work is exploring the possibility of obtaining the features directly from neural net without the necessity of converting output probabilities to features suitable for subsequent GMM-HMM system.

...read moreread less

Proceedings Article

Improved Bottleneck Features Using Pretrained Deep Neural Networks.

Dong Yu, +1 more

TL;DR: This paper shows how the use of unsupervised pretraining of a DNN enhances the network’s discriminative power and improves the bottleneck features it generates, and shows that a neural networktrained to predict context-dependent senone targets produces better bottleneck features than one trained to predict monophone states.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Perceptual linear predictive (PLP) analysis of speech

Hynek Hermansky

- 01 Apr 1990 -

Journal of the Acoustical Society of Ame...

TL;DR: A new technique for the analysis of speech, the perceptual linear predictive (PLP) technique, which uses three concepts from the psychophysics of hearing to derive an estimate of the auditory spectrum, and yields a low-dimensional representation of speech.

...read moreread less

Book

Connectionist Speech Recognition: A Hybrid Approach

Hervé Bourlard, +1 more

TL;DR: Connectionist Speech Recognition: A Hybrid Approach describes the theory and implementation of a method to incorporate neural network approaches into state-of-the-art continuous speech recognition systems based on Hidden Markov Models (HMMs) to improve their performance.

...read moreread less

Journal ArticleDOI

How do humans process and recognize speech

Jont B. Allen

- 01 Oct 1994 -

IEEE Transactions on Speech and Audio Pr...

TL;DR: Until the performance of automatic speech recognition (ASR) hardware surpasses human performance in accuracy and robustness, the authors stand to gain by understanding the basic principles behind human Speech recognition (HSR).

...read moreread less

Book

Three areas of experimental phonetics

Peter Ladefoged

Proceedings ArticleDOI

A mew ASR approach based on independent processing and recombination of partial frequency bands

Hervé Bourlard, +1 more

TL;DR: The preliminary results presented in this paper show that such an approach, even using quite simple recombination strategies, can yield at least comparable performance on clean speech while providing better robustness in the case of noisy speech.

...read moreread less