Showing papers by "Michael S. Phillips published in 1991"

PDF

Open Access

Proceedings Article•DOI•

Integration of speech recognition and natural language processing in the MIT VOYAGER system

[...]

Victor W. Zue¹, James Glass¹, David Goodine¹, Hong Leung¹, Michael S. Phillips¹, Joseph Polifroni¹, Stephanie Seneff¹ - Show less +3 more•Institutions (1)

Massachusetts Institute of Technology¹

14 Apr 1991

TL;DR: Recent attempts at improving the integration between the speech recognition and natural language components are described, using the generation capability of the natural language component to produce a word-pair language model to constrain the recognizer's search space, thus improving the coverage of the overall system.

...read moreread less

Abstract: The MIT VOYAGER speech understanding system is an urban exploration and navigation system that interacts with the user through spoken dialogue, text, and graphics. The authors describe recent attempts at improving the integration between the speech recognition and natural language components. They used the generation capability of the natural language component to produce a word-pair language model to constrain the recognizer's search space, thus improving the coverage of the overall system. They also implemented a strategy in which the recognizer generates the top N word strings and passes them along to the natural language component for filtering. Results on performance evaluation are presented. >

...read moreread less

66 citations

Proceedings Article•

Full integration of speech and language understanding in the MIT spoken language system.

[...]

David Goodine, Stephanie Seneff, Lynette Hirschman, Michael S. Phillips

01 Jan 1991

24 citations

Proceedings Article•DOI•

Development and preliminary evaluation of the MIT ATIS system

[...]

Stephanie Seneff, James Glass, David Goddeau, David Goodine, Lynette Hirschman, Hong Leung, Michael S. Phillips, Joseph Polifroni, Victor W. Zue - Show less +5 more

19 Feb 1991

TL;DR: The MIT ATIS system as discussed by the authors is based on the MIT SUMMIT system using context independent phone models, and includes a word-pair grammar with perplexity 92 (on the June-90 test set).

...read moreread less

Abstract: This paper represents a status report on the MIT ATIS system. The most significant new achievement is that we now have a speech-input mode. It is based on the MIT SUMMIT system using context independent phone models, and includes a word-pair grammar with perplexity 92 (on the June-90 test set). In addition, we have completely redesigned the back-end component, in order to emphasize portability and extensibility. The parser now produces an intermediate semantic frame representation, which serves as the focal point for all back-end operations, such as history management, text generation, and SQL query generation. Most of those aspects of the system that are tied to a particular domain are now entered through a set of tables associated with a small artificial language for decoding them. We have also improved the display of the database table, making it considerably easier for a subject to comprehend the information given. We report here on the results of the official DARPA February-91 evaluation, as well as on results of an evaluation on data collected at MIT, for both speech input and text input.

...read moreread less

19 citations

Proceedings Article•

Automatic learning of lexical representations for sub-word unit based speech recognition systems.

[...]

Michael S. Phillips, James Glass, Victor W. Zue

01 Jan 1991

11 citations

Proceedings Article•DOI•

Modelling context dependency in acoustic-phonetic and lexical representations

[...]

Michael S. Phillips, James Glass, Victor W. Zue

19 Feb 1991

TL;DR: These changes, along with an improved corrective training procedure for adapting pronunciation are weights and a larger set of training data, have resulted in the reduction of error rate by almost a factor of two on the Resource Management task.

...read moreread less

Abstract: In 1989, our group first reported on the development of SUMMIT, a segment-based speaker-independent continuous-speech recognition system [13]. The initial version of SUMMIT made use of fairly simple context-independent models for the lexical labels. Recently, we have begun to incorporate more complex models of lexical labels that take into account a variety of contextual factors. These changes, along with an improved corrective training procedure for adapting pronunciation are weights and a larger set of training data, have resulted in the reduction of error rate by almost a factor of two on the Resource Management task.

...read moreread less

9 citations

Proceedings Article•DOI•

Integrating syntax and semantics into spoken language understanding

[...]

Lynette Hirschman, Stephanie Seneff, David Goodine, Michael S. Phillips

19 Feb 1991

TL;DR: Experiments on a fully integrated system which uses the parser to predict possible next words to the recognizer are now underway, and improvement by combining acoustic score and parse probability normalized for number of terminals.

...read moreread less

Abstract: This paper describes several experiments combining natural language and acoustic constraints to improve overall performance of the MIT VOYAGER spoken language system. This system couples the SUMMIT speech recognition system with the TINA language understanding system to answer spoken queries about navigational assistance in the Cambridge, MA, area. The overall goal of our research is to combine acoustic, syntactic and semantic knowledge sources. Our first experiment showed improvement by combining acoustic score and parse probability normalized for number of terminals. Results were further improved by the use of an explicit rejection criterion based on normalized parse probabilities. The use of the combined parse/acoustic score, together with the rejection criterion, gave an improvement in overall score of more than 33% on both training and test data, where score is defined as percent correct minus percent incorrect. Experiments on a fully integrated system which uses the parser to predict possible next words to the recognizer are now underway.

...read moreread less

8 citations

Spoken language systems for human/machine interfaces.

[...]

Victor W. Zue, James Glass, Dave Coddeau, David Goodine, Lynette Hirschman, Hong C. Leung, Michael S. Phillips, Joseph Polifroni, Stephanie Seneff - Show less +5 more

01 Jan 1991

TL;DR: A lug nut is held and guided by a plastic cage formed by a flanged bush, whose cylindrical part has longitudinal apertures in which the lugs slide axially and bosses at right angles to the aperture adjacent the flange.

...read moreread less

Abstract: A lug nut is held and guided by a plastic cage formed by a flanged bush, whose cylindrical part has longitudinal apertures in which the lugs slide axially and bosses at right angles to the apertures adjacent the flange. This assembly can be inserted from one side of a sheet into a corresponding slotted hole in the sheet, and on turning through 90 DEG is locked angularly by seating the bosses in the slots. The bush is of nylon and has an annular rib on a surface of the flange to seal the device against the sheet. The bush can have a radial protuberance and ramps on the bosses to prevent inadvertent extraction from the hole. The flange can overlap the cylindrical wall of the bush radially inwardly to form a guiding and sealing hole for the bolt to be engaged in the lug nut.

...read moreread less

7 citations

Proceedings Article•

The MIT ATIS system; preliminary development, spontaneous speech data collection, and performance evaluation.

[...]

Victor W. Zue, James Glass, David Goodine, Lynette Hirschman, Hong C. Leung, Michael S. Phillips, Joseph Polifroni, Stephanie Seneff - Show less +4 more

01 Jan 1991

5 citations

Collection and Analyses of WSJ-CSR Data at MIT 1

[...]

Michael S. Phillips, James Glass, Joseph Polifroni, Victor W. Zue

01 Jan 1991

TL;DR: The purpose of this paper is to document the involvement in the development of the WSJ-CSR corpus, from recording and transcription to analyses and distribution, and to present the results of an experiment investigating the preprocessing of the prompt text.

...read moreread less

Abstract: Recently, the DARPA community started a new data collection initiative in the Wall Street Journal (WSJ) domain to support research and development of very large vocabulary continuous speech recognition (CSR) systems. Since August 1991, our group has actively participated in the development of the WSJ-CSR corpus. The purpose of this paper is to document our involvement in this process, from recording and transcription to analyses and distribution. We will also present the results of an experiment investigating the preprocessing of the prompt text.

...read moreread less

4 citations

Proceedings Article•DOI•

Talking To Your Database: Interactive Spoken Language Interfaces

[...]

James Glass¹, David Goodine, L. Hirschman, Hong Leung, Michael S. Phillips, J. Polifroni, Stephanie Seneff, Victor W. Zue - Show less +4 more•Institutions (1)

Massachusetts Institute of Technology¹

31 Oct 1991

TL;DR: Spoken language interfaces offer significant benefits over conventional user interfaces for certain classes of applications, particularly handsbusy or eyes-busy applications, where typed input and/or visual displays may not be possible or convenient.

...read moreread less

Abstract: This paper describes research on spoken language interfaces for interactive problem solving A spoken language interface combines speech recognition technology with language understanding technology to provide an application-specific interface The interface converts acoustic input (speech) into a series of words which are interpreted to produce the appropriate response and/or action The system response may be spoken or it may be in the form of a display, as appropriate to the needs of the user Spoken language interfaces offer significant benefits over conventional user interfaces for certain classes of applications, particularly handsbusy or eyes-busy applications, where typed input and/or visual displays may not be possible or convenient To illustrate this, we present two examples of spoken language interfaces developed at MIT: an interactive system for urban navigation, VOYAGER; and an air travel planning system ATISThe VOYAGER system currently runs in a few times real time and is able to provide answers for more than 50% of user queries for untrained users

...read moreread less