An approach to continuous speech recognition based on layered self-adjusting decoding graph

doi:10.1109/ICASSP.1997.598875

Proceedings ArticleDOI

An approach to continuous speech recognition based on layered self-adjusting decoding graph

Qiru Zhou, +1 more

- Vol. 3, pp 1779-1782

Chats0

TLDR

An approach to continuous speech recognition based on a layered self-adjusting decoding graph that utilizes a scaffolding layer to support fast network expansion and releasing and introduces self- adjusting capability in dynamic decoding on a general re-entrant decoding network.

Abstract:

In this paper, an approach to continuous speech recognition based on a layered self-adjusting decoding graph is described. It utilizes a scaffolding layer to support fast network expansion and releasing. A two level hashing structure is also described. It introduces self-adjusting capability in dynamic decoding on a general re-entrant decoding network. In stack decoding, the scaffolding layer in the proposed approach enables the decoder to look several layers into the future so that long span inter-word context dependency can be exactly preserved. Experimental results indicate that highly efficient decoding can be achieved with a significant savings on recognition resources.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Dynamic programming search for continuous speech recognition

Hermann Ney, +1 more

- 01 Sep 1999 -

IEEE Signal Processing Magazine

TL;DR: A unifying view of the dynamic programming approach to the search problem from the statistical point-of-view is given and how the search space results from the acoustic and language models required by the statistical approach are shown.

...read moreread less

Journal ArticleDOI

Progress in dynamic programming search for LVCSR

Hermann Ney, +1 more

TL;DR: This work attempts to review the use of dynamic programming search strategies for large-vocabulary continuous speech recognition (LVCSR) by searching using a lexical tree, language-model look-ahead and word-graph generation.

...read moreread less

Proceedings Article

A Reverse Turing Test using speech

Greg Kochanski, +2 more

TL;DR: This paper describes a Reverse Turing Test using speech and presents a test that depends on the fact that human recognition of distorted speech is far more robust than automatic speech recognition techniques.

...read moreread less

Journal ArticleDOI

Robust decision tree state tying for continuous speech recognition

W. Reichl, +1 more

- 01 Sep 2000 -

IEEE Transactions on Speech and Audio Pr...

TL;DR: A new two-level segmental clustering approach is devised which combines the decision tree based state tying with agglomerative clustering of rare acoustic phonetic events.

...read moreread less

Proceedings ArticleDOI

Progress in dynamic programming search for LVCSR

Hermann Ney, +1 more

TL;DR: This work attempts to review the use of dynamic programming search strategies for large-vocabulary continuous speech recognition (LVCSR) by searching using a lexical tree, language-model look-ahead and word-graph generation.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Principles of Artificial Intelligence

Nils J. Nilsson

TL;DR: This classic introduction to artificial intelligence describes fundamental AI ideas that underlie applications such as natural language processing, automatic programming, robotics, machine vision, automatic theorem proving, and intelligent data retrieval.

...read moreread less

Proceedings ArticleDOI

A one pass decoder design for large vocabulary recognition

JJ Odell, +3 more

TL;DR: This paper shows that time-synchronous one-pass decoding using cross-word triphones and a trigram language model can be implemented using a dynamically built tree-structured network and is relatively efficient in implementation.

...read moreread less

Journal ArticleDOI

A frame-synchronous network search algorithm for connected word recognition

Chin-Hui Lee, +1 more

- 01 Nov 1989 -

IEEE Transactions on Acoustics, Speech, ...

TL;DR: A description is given of an implementation of a novel frame-synchronous network search algorithm for recognizing continuous speech as a connected sequence of words according to a specified grammar that is inherently based on hidden Markov model (HMM) representations.

...read moreread less

Proceedings ArticleDOI

An efficient A* stack decoder algorithm for continuous speech recognition with a stochastic language model

Douglas B. Paul

TL;DR: A modified version of the algorithm which includes the full (forward) decoder, cross-word acoustic models and longer-span language models is described, which has been demonstrated to have a low probability of search error and to be very efficient.

...read moreread less

Proceedings ArticleDOI

An efficient A* stack decoder algorithm for continuous speech recognition with a stochastic language model

Douglas B. Paul

TL;DR: A modified version of the Viterbi A* search algorithm is described which includes the full (forward) decoder, cross-word acoustic models and longer-span language models and is demonstrated to have a low probability of search error and to be very efficient.

...read moreread less

An approach to continuous speech recognition based on layered self-adjusting decoding graph

Citations

Dynamic programming search for continuous speech recognition

Progress in dynamic programming search for LVCSR

A Reverse Turing Test using speech

Robust decision tree state tying for continuous speech recognition

Progress in dynamic programming search for LVCSR

References

Principles of Artificial Intelligence

A one pass decoder design for large vocabulary recognition

A frame-synchronous network search algorithm for connected word recognition

An efficient A* stack decoder algorithm for continuous speech recognition with a stochastic language model

An efficient A* stack decoder algorithm for continuous speech recognition with a stochastic language model

Related Papers (5)

Decision tree state tying based on segmental clustering for acoustic modeling

Estimation of probabilities from sparse data for the language model component of a speech recognizer

Tree-based state tying for high accuracy acoustic modelling

Adding linguistic constraints to document image decoding: comparing the iterated complete path and stack algorithms

A one pass decoder design for large vocabulary recognition