Showing papers by "John Makhoul published in 1992"

PDF

Open Access

Proceedings Article•DOI•

BBN BYBLOS and HARC February 1992 ATIS benchmark results

[...]

Francis Kubala, C. Barry, Madeleine Bates, Robert J. Bobrow, Pascale Fung, Robert Ingria, John Makhoul, Long Nguyen, Richard Schwartz, David Stallard - Show less +6 more

23 Feb 1992

TL;DR: Results from the February '92 evaluation on the ATIS travel planning domain for HARC, the BBN spoken language system (SLS) are presented and the individual performance of BYBLOS, the speech recognition (SPREC) component is discussed.

...read moreread less

Abstract: We present results from the February '92 evaluation on the ATIS travel planning domain for HARC, the BBN spoken language system (SLS). In addition, we discuss in detail the individual performance of BYBLOS, the speech recognition (SPREC) component.In the official scoring, conducted by NIST, BBN's HARC system produced a weighted SLS score of 43.7 on all 687 evaluable utterances in the test set. This was the lowest error achieved by any of the 7 systems evaluated.For the SPREC evaluation BBN's BYBLOS system achieved a word error rate of 6.2% on the same 687 utterances and 9.4% on the entire test set of 971 utterances. These results were significantly better than any other speech system evaluated.

...read moreread less

17 citations

Proceedings Article•DOI•

Continuous speech recognition using segmental neural nets

[...]

S. Austin, G. Zavaliagkos, John Makhoul, Richard Schwartz

07 Jun 1992

TL;DR: A hybrid SNN/HMM system has been developed to combine the advantages of both types of approaches and use is made of the N-best paradigm to generate likely phonetic segmentations, which are then scored by the SNN.

...read moreread less

Abstract: The authors present the concept of a 'segmental neural net' (SNN) for phonetic modeling in continuous speech recognition (CSR) and show how this can be used, together with a hidden Markov model (HMM) system, to improve continuous speech recognition (CSR). The SNN is a segment-based model that uses a neural network to correlate features of the speech signal throughout the duration of a phonetic segment. The problem of handling phonetic segments of varying length is solved by applying a warping function which provides the neural network inputs with a fixed-length representation of the segment. This method of modeling speech differs from that of HMMs, which assume that speech frames are conditionally independent. To take advantage of the training and decoding speed of HMMs, a hybrid SNN/HMM system has been developed to combine the advantages of both types of approaches. In this hybrid system, use is made of the N-best paradigm to generate likely phonetic segmentations, which are then scored by the SNN. The HMM and SNN scores are then combined to optimize performance. >

...read moreread less

15 citations

Proceedings Article•

Design and Performance of HARC, the BBN Spoken Language Understanding System

[...]

Madeleine Bates, Robert J. Bobrow, Pascale Fung, Robert Ingria, Francis Kubala, John Makhoul, Long Nguyen, Richard Schwartz, David Stallard - Show less +5 more

01 Jan 1992

8 citations

Proceedings Article•DOI•

Robust continuous speech recognition

[...]

John Makhoul, Richard Schwartz

23 Feb 1992

TL;DR: Important goals of this work are to achieve the highest possible word recognition accuracy in continuous speech and to develop methods for the rapid adaptation of phonetic models to the voice of a new speaker.

...read moreread less

Abstract: The primary objective of this basic research program is to develop robust methods and models for speaker-independent acoustic recognition of spontaneously-produced, continuous speech. The work has focussed on developing accurate and detailed models of phonemes and their coarticulation for the purpose of large-vocabulary continuous speech recognition. Important goals of this work are to achieve the highest possible word recognition accuracy in continuous speech and to develop methods for the rapid adaptation of phonetic models to the voice of a new speaker.

...read moreread less

7 citations

Proceedings Article•

A Hybrid Neural Net System for State-of-the-Art Continuous Speech Recognition

[...]

G. Zavaliagkos¹, Ying Zhao, Richard Schwartz, John Makhoul•Institutions (1)

Northeastern University¹

30 Nov 1992

TL;DR: A hybrid system that integrates HMM technology with neural networks and presents the concept of a "Segmental Neural Net" (SNN) for phonetic modeling in CSR, which overcomes the well-known conditional-independence limitation of HMMs.

...read moreread less

Abstract: Untill recently, state-of-the-art, large-vocabulary, continuous speech recognition (CSR) has employed Hidden Markov Modeling (HMM) to model speech sounds. In an attempt to improve over HMM we developed a hybrid system that integrates HMM technology with neural networks. We present the concept of a "Segmental Neural Net" (SNN) for phonetic modeling in CSR. By taking into account all the frames of a phonetic segment simultaneously, the SNN overcomes the well-known conditional-independence limitation of HMMs. In several speaker-independent experiments with the DARPA Resource Management corpus, the hybrid system showed a consistent improvement in performance over the baseline HMM system.

...read moreread less

4 citations

Proceedings Article•DOI•

Improving state-of-the-art continuous speech recognition systems using the N-best paradigm with neural networks

[...]

S. Austin, G. Zavaliagkos¹, John Makhoul, R. Schwartz•Institutions (1)

Northeastern University¹

23 Feb 1992

TL;DR: A hybrid SNN/HMM system that combines the speed and performance of the HMM system with the segmental modeling capabilities of SNNs is described and discriminative training using N-best is demonstrated to improve performance.

...read moreread less

Abstract: In an effort to advance the state of the art in continuous speech recognition employing hidden Markov models (HMM), Segmental Neural Nets (SNN) were introduced recently to ameliorate the well-known limitations of HMMs, namely, the conditional-independence limitation and the relative difficulty with which HMMs can handle segmental features. We describe a hybrid SNN/HMM system that combines the speed and performance of our HMM system with the segmental modeling capabilities of SNNs. The integration of the two acoustic modeling techniques is achieved successfully via the N-best rescoring paradigm. The N-best lists are used not only for recognition, but also during training. This discriminative training using N-best is demonstrated to improve performance. When tested on the DARPA Resource Management speaker-independent corpus, the hybrid SNN/HMM system decreases the error by about 20% compared to the state-of-the-art HMM system.

...read moreread less

4 citations

Proceedings Article•DOI•

BBN real-time speech recognition demonstrations

[...]

S. Austin, Rusty Bobrow, Dan Ellard, Robert Ingria, John Makhoul, Long Nguyen, Pat Peterson, P. Placeway, Richard Schwartz - Show less +5 more

23 Feb 1992

TL;DR: Typically, real-time speech recognition -- if achieved at all -- is accomplished either by greatly simplifying the processing to be done, or by the use of special-purpose hardware.

...read moreread less

Abstract: Typically, real-time speech recognition -- if achieved at all -- is accomplished either by greatly simplifying the processing to be done, or by the use of special-purpose hardware. Each of these approaches has obvious problems. The former results in a substantial loss in accuracy, while the latter often results in obsolete hardware being developed at great expense and delay.

...read moreread less

1 citations