scispace - formally typeset
G

Gerald M. McCobb

Researcher at Nuance Communications

Publications -  28
Citations -  1473

Gerald M. McCobb is an academic researcher from Nuance Communications. The author has contributed to research in topics: Grammar & Markup language. The author has an hindex of 19, co-authored 28 publications receiving 1473 citations. Previous affiliations of Gerald M. McCobb include IBM.

Papers
More filters
Patent

Ordering recognition results produced by an automatic speech recognition engine for a multimodal application

TL;DR: In this article, a method is described for ordering recognition results produced by an automatic speech recognition (ASR) engine for a multimodal application implemented with a grammar of the multimodAL application in the ASR engine.
Patent

Dynamic switching between local and remote speech rendering

TL;DR: In this paper, a multimodal browser for rendering a multi-modal document on an end system defining a host can include a visual browser component for rendering visual content, if any, of the multimodi-al document, and a voice browser component, which can determine which of a plurality of speech processing configuration is used by the host in rendering the voice-based content.
Patent

Creating a mixed-initiative grammar from directed dialog grammars

TL;DR: In this article, a method of building a mixed-initiative grammar can include receiving one or more conjoin phrases, wherein each conjoin phrase is associated with a selected one of the plurality of directed dialog grammars, and receiving a user input specifying a selected grammar generation technique.
Patent

Method of enhancing voice interactions using visual messages

TL;DR: In this paper, a method for enhancing voice interactions within a portable multimodal computing device using visual messages is presented, where the message is a prompt for the speech input and/or a confirmation of the input.
Patent

Method and system for voice-enabled autofill

TL;DR: In this paper, a computer-implemented method and system are provided for filling a graphic-based form field in response to a speech utterance, which includes generating a grammar corresponding to the form field, the grammar being based on a user profile and comprising a semantic interpretation string.