Home
/
Authors
/
Gerald M. McCobb

Author

Gerald M. McCobb

Other affiliations: IBM

Bio: Gerald M. McCobb is an academic researcher from Nuance Communications. The author has contributed to research in topics: Grammar & Markup language. The author has an hindex of 19, co-authored 28 publications receiving 1473 citations. Previous affiliations of Gerald M. McCobb include IBM.

Topics: Grammar, Markup language, Multimodal interaction, Web page, HTML ...read more

Papers

PDF

Open Access

More filters

Patent•

Ordering recognition results produced by an automatic speech recognition engine for a multimodal application

[...]

Soonthorn Ativanichayaphong¹, Jr Charles Willard Cross², Igor R. Jablokov², Gerald M. McCobb²•Institutions (2)

IBM¹, Nuance Communications²

04 Feb 2008

TL;DR: In this article, a method is described for ordering recognition results produced by an automatic speech recognition (ASR) engine for a multimodal application implemented with a grammar of the multimodAL application in the ASR engine.

...read moreread less

Abstract: A method is described for ordering recognition results produced by an automatic speech recognition ( ASR ) engine for a multimodal application implemented with a grammar of the multimodalapplication in the ASR engine, with the multimodal application operating in a multimodalbrowser on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal application 10 operativelycoupled to the ASR enginethrough a VoiceXML interpreter. The method includes: receiving, in the VoiceXML interpreter from the multimodal application, a voice utterance; determining, by the VoiceXML interpreter using the ASR engine, a plurality of recognition results in dependence upon the voice utterance and the grammar; determining, by the VoiceXML interpreter according to semantic interpretation scripts of the grammar, a 15 weight for each recognition result; and sorting, bythe VoiceXML interpreter, the plurality of recognition results in dependence upon the weight for each recognition result.

...read moreread less

189 citations

Patent•

Dynamic switching between local and remote speech rendering

[...]

Charles W. Cross¹, David Jaramillo¹, Gerald M. McCobb¹•Institutions (1)

IBM¹

08 Dec 2004

TL;DR: In this paper, a multimodal browser for rendering a multi-modal document on an end system defining a host can include a visual browser component for rendering visual content, if any, of the multimodi-al document, and a voice browser component, which can determine which of a plurality of speech processing configuration is used by the host in rendering the voice-based content.

...read moreread less

Abstract: A multimodal browser for rendering a multimodal document on an end system defining a host can include a visual browser component for rendering visual content, if any, of the multimodal document, and a voice browser component for rendering voice-based content, if any, of the multimodal document. The voice browser component can determine which of a plurality of speech processing configuration is used by the host in rendering the voice-based content. The determination can be based upon the resources of the host running the application. The determination also can be based upon a processing instruction contained in the application.

...read moreread less

178 citations

Patent•

Creating a mixed-initiative grammar from directed dialog grammars

[...]

Soonthorn Ativanichayaphong¹, David Jaramillo¹, Gerald M. McCobb¹•Institutions (1)

IBM¹

21 Oct 2005

TL;DR: In this article, a method of building a mixed-initiative grammar can include receiving one or more conjoin phrases, wherein each conjoin phrase is associated with a selected one of the plurality of directed dialog grammars, and receiving a user input specifying a selected grammar generation technique.

...read moreread less

Abstract: A method of building a mixed-initiative grammar can include receiving one or more conjoin phrases, wherein each conjoin phrase is associated with a selected one of the plurality of directed dialog grammars, and receiving a user input specifying a selected grammar generation technique. The mixed-initiative grammar can be automatically generated, in accordance with the selected grammar generation technique, such that the mixed-initiative grammar specifies an allowable ordering of sets when interpreting a user spoken utterance and whether duplicative phrases are allowable within the user spoken utterance.

...read moreread less

122 citations

Patent•

Method of enhancing voice interactions using visual messages

[...]

Soonthorn Ativanichayaphong¹, David Jaramillo¹, Gerald M. McCobb¹, Leslie R. Wilson¹•Institutions (1)

IBM¹

20 May 2003

TL;DR: In this paper, a method for enhancing voice interactions within a portable multimodal computing device using visual messages is presented, where the message is a prompt for the speech input and/or a confirmation of the input.

...read moreread less

Abstract: A method for enhancing voice interactions within a portable multimodal computing device using visual messages. A multimodal interface can be provided that includes an audio interface and a visual interface. A speech input can then be received and a voice recognition task can be performed upon at least a portion of the speech input. At least one message within the multimodal interface can be visually presented, wherein the message is a prompt for the speech input and/or a confirmation of the speech input.

...read moreread less

99 citations

Patent•

Method and system for voice-enabled autofill

[...]

Soonthorn Ativanichayaphong¹, Charles W. Cross¹, Gerald M. McCobb¹•Institutions (1)

IBM¹

09 Aug 2005

TL;DR: In this paper, a computer-implemented method and system are provided for filling a graphic-based form field in response to a speech utterance, which includes generating a grammar corresponding to the form field, the grammar being based on a user profile and comprising a semantic interpretation string.

...read moreread less

Abstract: A computer-implemented method and system are provided for filling a graphic-based form field in response to a speech utterance. The computer-implemented method includes generating a grammar corresponding to the form field, the grammar being based on a user profile and comprising a semantic interpretation string. The method further includes creating an auto-fill event based upon the at least one grammar and responsive to the speech utterance, the auto-fill event causing the filling of the form field with data corresponding to the user profile. The system includes a grammar-generating module for generating a grammar corresponding to the form field, the grammar being based on a user profile and comprising a semantic interpretation string. The system also includes an event module for creating an auto-fill event based upon the at least one grammar and responsive to the speech utterance, the event causing the filling of the form field with data corresponding to the user profile.

...read moreread less

95 citations

1
2
3
4
…
5
6

Collapse

Cited by

PDF

Open Access

More filters

Patent•

Intelligent Automated Assistant

[...]

Thomas R. Gruber¹, Adam Cheyer¹, Dag Kittlaus¹, Didier Rene Guzzoni¹, Christopher Dean Brigham¹, Richard Donald Giuli¹, Marcello Bastea-Forte¹, Harry J. Saddler¹ - Show less +4 more•Institutions (1)

Apple Inc.¹

11 Jan 2011

TL;DR: In this article, an intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions.

...read moreread less

Abstract: An intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions. The system can be implemented using any of a number of different platforms, such as the web, email, smartphone, and the like, or any combination thereof. In one embodiment, the system is based on sets of interrelated domains and tasks, and employs additional functionally powered by external services with which the system can interact.

...read moreread less

1,462 citations

Patent•

Using context information to facilitate processing of commands in a virtual assistant

[...]

Thomas R. Gruber¹, Christopher Dean Brigham¹, Daniel S. Keen¹, Gregory Novick¹, Phipps Benjamin S¹ - Show less +1 more•Institutions (1)

Apple Inc.¹

28 Sep 2012

TL;DR: In this article, a virtual assistant uses context information to supplement natural language or gestural input from a user, which helps to clarify the user's intent and reduce the number of candidate interpretations of user's input, and reduces the need for the user to provide excessive clarification input.

...read moreread less

Abstract: A virtual assistant uses context information to supplement natural language or gestural input from a user. Context helps to clarify the user's intent and to reduce the number of candidate interpretations of the user's input, and reduces the need for the user to provide excessive clarification input. Context can include any available information that is usable by the assistant to supplement explicit user input to constrain an information-processing problem and/or to personalize results. Context can be used to constrain solutions during various phases of processing, including, for example, speech recognition, natural language processing, task flow processing, and dialog generation.

...read moreread less

593 citations

Patent•

Method and apparatus for building an intelligent automated assistant

[...]

Adam Cheyer¹, Didier Rene Guzzoni¹•Institutions (1)

Apple Inc.¹

08 Sep 2006

TL;DR: In this paper, a method for building an automated assistant includes interfacing a service-oriented architecture that includes a plurality of remote services to an active ontology, where the active ontologies includes at least one active processing element that models a domain.

...read moreread less

Abstract: A method and apparatus are provided for building an intelligent automated assistant. Embodiments of the present invention rely on the concept of “active ontologies” (e.g., execution environments constructed in an ontology-like manner) to build and run applications for use by intelligent automated assistants. In one specific embodiment, a method for building an automated assistant includes interfacing a service-oriented architecture that includes a plurality of remote services to an active ontology, where the active ontology includes at least one active processing element that models a domain. At least one of the remote services is then registered for use in the domain.

...read moreread less

389 citations

Patent•

Electronic Devices with Voice Command and Contextual Data Processing Capabilities

[...]

Aram Lindahl¹•Institutions (1)

Apple Inc.¹

24 May 2012

TL;DR: In this paper, an electronic device may capture a voice command from a user and store contextual information about the state of the electronic device when the voice command is received, such as a desktop computer or a remote server.

...read moreread less

Abstract: An electronic device may capture a voice command from a user. The electronic device may store contextual information about the state of the electronic device when the voice command is received. The electronic device may transmit the voice command and the contextual information to computing equipment such as a desktop computer or a remote server. The computing equipment may perform a speech recognition operation on the voice command and may process the contextual information. The computing equipment may respond to the voice command. The computing equipment may also transmit information to the electronic device that allows the electronic device to respond to the voice command.

...read moreread less

385 citations

Patent•

Automatically adapting user interfaces for hands-free interaction

[...]

Thomas R. Gruber¹, Harry J. Saddler¹•Institutions (1)

Apple Inc.¹

30 Sep 2011

TL;DR: In this article, the authors present a method for automatically determining whether a digital assistant application has been separately invoked by a user without regard to whether a user has separately invoked the application.

...read moreread less

Abstract: The method includes automatically, without user input and without regard to whether a digital assistant application has been separately invoked by a user, determining that the electronic device is in a vehicle. In some implementations, determining that the electronic device is in a vehicle comprises detecting that the electronic device is in communication with the vehicle (e.g., via a wired or wireless communication techniques and/or protocols). The method also includes, responsive to the determining, invoking a listening mode of a virtual assistant implemented by the electronic device. In some implementations, the method also includes limiting the ability of a user to view visual output presented by the electronic device, provide typed input to the electronic device, and the like.

...read moreread less

367 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104

Collapse