scispace - formally typeset
Search or ask a question
Patent•

Method and system of interpreting and presenting web content using a voice browser

Ramesh Sarukkai1•
21 Aug 2001-
TL;DR: In this paper, a highly distributed, scalable, and efficient voice browser system provides the ability to seamlessly integrate a variety of audio into the system in a unified manner, such as audio advertisements recorded by sponsors, audio data collected by broadcast groups, and text to speech generated audio.
Abstract: A highly distributed, scalable, and efficient voice browser system provides the ability to seamlessly integrate a variety of audio into the system in a unified manner. The audio rendered to the user comes from various sources, such as, for example, audio advertisements recorded by sponsors, audio data collected by broadcast groups, and text to speech generated audio. In an embodiment, voice browser architecture integrates a variety of components including: various telephony platforms (e.g. PSTN, VOIP), scalable architecture, rapid context switching, and backend web content integration and provides access to information audibly.
Citations
More filters
Patent•
11 Jan 2011
TL;DR: In this article, an intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions.
Abstract: An intelligent automated assistant system engages with the user in an integrated, conversational manner using natural language dialog, and invokes external services when appropriate to obtain information or perform various actions. The system can be implemented using any of a number of different platforms, such as the web, email, smartphone, and the like, or any combination thereof. In one embodiment, the system is based on sets of interrelated domains and tasks, and employs additional functionally powered by external services with which the system can interact.

1,462 citations

Patent•DOI•
TL;DR: In this paper, a system for receiving speech and non-speech communications of natural language questions and commands, transcribing the speech and NN communications to textual messages, and executing the questions and/or commands is presented.
Abstract: Systems and methods are provided for receiving speech and non-speech communications of natural language questions and/or commands, transcribing the speech and non-speech communications to textual messages, and executing the questions and/or commands. The invention applies context, prior information, domain knowledge, and user specific profile data to achieve a natural environment for one or more users presenting questions or commands across multiple domains. The systems and methods creates, stores and uses extensive personal profile information for each user, thereby improving the reliability of determining the context of the speech and non-speech communications and presenting the expected results for a particular question or command.

1,164 citations

Patent•
29 Aug 2006
TL;DR: In this article, a mobile system is provided that includes speech-based and non-speech-based interfaces for telematics applications that identify and uses context, prior information, domain knowledge, and user specific profile data to achieve a natural environment for users that submit requests and/or commands in multiple domains.
Abstract: A mobile system is provided that includes speech-based and non-speech-based interfaces for telematics applications. The mobile system identifies and uses context, prior information, domain knowledge, and user specific profile data to achieve a natural environment for users that submit requests and/or commands in multiple domains. The invention creates, stores and uses extensive personal profile information for each user, thereby improving the reliability of determining the context and presenting the expected results for a particular question or command. The invention may organize domain specific behavior and information into agents, that are distributable or updateable over a wide area network.

716 citations

Patent•
28 Sep 2012
TL;DR: In this article, a virtual assistant uses context information to supplement natural language or gestural input from a user, which helps to clarify the user's intent and reduce the number of candidate interpretations of user's input, and reduces the need for the user to provide excessive clarification input.
Abstract: A virtual assistant uses context information to supplement natural language or gestural input from a user. Context helps to clarify the user's intent and to reduce the number of candidate interpretations of the user's input, and reduces the need for the user to provide excessive clarification input. Context can include any available information that is usable by the assistant to supplement explicit user input to constrain an information-processing problem and/or to personalize results. Context can be used to constrain solutions during various phases of processing, including, for example, speech recognition, natural language processing, task flow processing, and dialog generation.

593 citations

Patent•
11 Dec 2007
TL;DR: In this paper, a conversational, natural language voice user interface may provide an integrated voice navigation services environment, where the user can speak conversationally, using natural language, to issue queries, commands, or other requests relating to the navigation services provided in the environment.
Abstract: A conversational, natural language voice user interface may provide an integrated voice navigation services environment. The voice user interface may enable a user to make natural language requests relating to various navigation services, and further, may interact with the user in a cooperative, conversational dialogue to resolve the requests. Through dynamic awareness of context, available sources of information, domain knowledge, user behavior and preferences, and external systems and devices, among other things, the voice user interface may provide an integrated environment in which the user can speak conversationally, using natural language, to issue queries, commands, or other requests relating to the navigation services provided in the environment.

450 citations

References
More filters
Patent•
15 Oct 2001
TL;DR: In this paper, the authors present a real-time telemarketing system for the intelligent selection and proffer of products, services or information to a user or customer via electronic communication, such as through a telephone, videophone or other computer link.
Abstract: Apparatus and methods are provided for effecting remote commerce, such as in telemarketing (either inbound or outbound) and in electronic commerce, which are particularly adapted for the intelligent selection and proffer of products, services or information to a user or customer. In one aspect of the invention, goods, service or information are provided to the user via electronic communication, such as through a telephone, videophone or other computer link, as determined by the steps of first, establishing communication via the electronic communications device between the user and the system to effect a primary transaction or primary interaction, second, obtaining data with respect to the primary transaction or primary interaction, including at least in part a determination of the identity of the user or prospective customer, third, obtaining at least a second data element relating to the user, fourth, utilizing the primary transaction or primary interaction data along with the at least second data element as factors in determining at least one good, service or item of information for prospective upsell to the user or prospective customer, and offering the item to the prospective customer. In the preferred embodiment, the selection of the proffer of goods, services or information comprises an upsell with respect to the primary transaction or primary interaction data. The offer of the upsell is preferably generated and offered in real time, that is, during the course of the communication initiated with the primary transaction or primary interaction.

1,009 citations

Patent•
25 Jan 2002
TL;DR: In this paper, a system and method for universal access to voice-based documents containing information formatted using MIME and HTML standards using customized extensions for voice information access and navigation is presented.
Abstract: A system and method provides universal access to voice-based documents containing information formatted using MIME and HTML standards using customized extensions for voice information access and navigation. These voice documents are linked using HTML hyper-links that are accessible to subscribers using voice commands, touch-tone inputs and other selection means. These voice documents and components in them are addressable using HTML anchors embedding HTML universal resource locators (URLs) rendering them universally accessible over the Internet. This collection of connected documents forms a voice web. The voice web includes subscriber-specific documents including speech training files for speaker dependent speech recognition, voice print files for authenticating the identity of a user and personal preference and attribute files for customizing other aspects of the system in accordance with a specific subscriber.

983 citations

Patent•
07 Apr 1999
TL;DR: In this article, a markup language based man-machine interface provides a user interface for telecommunications functionality, including dialing telephone numbers, answering telephone calls, creating messages, sending messages, receiving messages, establishing configuration settings defined in markup language such as HTML, and accessed through a browser program executed by the wireless communication device.
Abstract: A wireless communications device with a markup language based man-machine interface provides a user interface for telecommunications functionality, including dialing telephone numbers, answering telephone calls, creating messages, sending messages, receiving messages, establishing configuration settings defined in markup language such as HTML, and accessed through a browser program executed by the wireless communication device. This feature enables direct access to Internet and World Wide Web content, such as Web pages, to be directly integrated with telecommunication functions of the device, and allows Web content to be seamlessly integrated with other data types, since all data presented to the user via the user interface is presented via markup language-based pages. The browser processes an extended form of HTML that provides new tags and attributes that enhance the navigational, logical, and display capabilities of conventional HTML, and particularly adapt HTML to be displayed and used on wireless communication devices with small screen displays.

775 citations

Patent•
05 Nov 1997
TL;DR: In this paper, the authors present a unified messaging system that provides a multimedia mailbox for a subscriber to access stored multimedia messages such as voicemail messages, facsimile messages, combined voice and facsimiles messages and video messages, not only through a public switched telephone network using a telephone but also over a data network, such as the Internet or an intranet, using a personal computer.
Abstract: A unified messaging system that provides a multimedia mailbox. The system allows a subscriber to access stored multimedia messages, such as voicemail messages, facsimile messages, combined voice and facsimile messages and video messages, not only through a public switched telephone network using a telephone but also over a data network, such as the Internet or an intranet, using a personal computer. The system provides voicemail access over the telephone network, indicating message number, etc. with the ability to play messages to the telephone user as desired. For text type messages, such as facsimile and e-mail, the system converts the text into speech and plays the speech to the telephone user. The system allows a personal computer user to obtain the data network access using an Internet browser. The browser is used to access a home page of the system and get information about the messages stored, and is used to download (get) and play the messages at the personal computer via data streaming in the case of a voice or video messages or view the messages in the case of text type messages, such as facsimile and e-mail. The user can also perform the other typical messaging functions over the data network connection that are provided for telephone access, such as viewing a message list, saving and deleting messages, group list administration and other administration tasks.

646 citations

Patent•
David Ladd1, Gregory Johnson1•
23 Jul 1999
TL;DR: In this article, a parser unit is communicatively coupled to the network fetcher to parse the retrieved information based on predetermined syntax and an interpreter unit and a state machine are also used.
Abstract: A voice browser to process a markup language document. A voice browser includes a network fetcher unit to retrieve information from a destination of an information source. A parser unit is communicatively coupled to the network fetcher to parse the retrieved information based on predetermined syntax. The parser unit generates a tree structure representing the hierarchy of the retrieved information. An interpreter unit and a state machine are also used. The method includes the steps of retrieving and parsing a markup language document to determine at least one user input, determining whether the user input corresponds to a predetermined grammar, and using the predetermined grammar when the user input corresponds to the predetermined grammar. The method of determining a grammar is based upon phonetic rules and pronunciation. The grammar is sent to a speech recognition engine and compared to a user input.

539 citations