scispace - formally typeset

Voice browser

About: Voice browser is a(n) research topic. Over the lifetime, 140 publication(s) have been published within this topic receiving 5476 citation(s).
More filters

25 Jan 2002-
Abstract: A system and method provides universal access to voice-based documents containing information formatted using MIME and HTML standards using customized extensions for voice information access and navigation. These voice documents are linked using HTML hyper-links that are accessible to subscribers using voice commands, touch-tone inputs and other selection means. These voice documents and components in them are addressable using HTML anchors embedding HTML universal resource locators (URLs) rendering them universally accessible over the Internet. This collection of connected documents forms a voice web. The voice web includes subscriber-specific documents including speech training files for speaker dependent speech recognition, voice print files for authenticating the identity of a user and personal preference and attribute files for customizing other aspects of the system in accordance with a specific subscriber.

983 citations

David Ladd1, Gregory Johnson1
23 Jul 1999-
Abstract: A voice browser to process a markup language document. A voice browser includes a network fetcher unit to retrieve information from a destination of an information source. A parser unit is communicatively coupled to the network fetcher to parse the retrieved information based on predetermined syntax. The parser unit generates a tree structure representing the hierarchy of the retrieved information. An interpreter unit and a state machine are also used. The method includes the steps of retrieving and parsing a markup language document to determine at least one user input, determining whether the user input corresponds to a predetermined grammar, and using the predetermined grammar when the user input corresponds to the predetermined grammar. The method of determining a grammar is based upon phonetic rules and pronunciation. The grammar is sent to a speech recognition engine and compared to a user input.

539 citations

Ramesh Sarukkai1
21 Aug 2001-
Abstract: A highly distributed, scalable, and efficient voice browser system provides the ability to seamlessly integrate a variety of audio into the system in a unified manner. The audio rendered to the user comes from various sources, such as, for example, audio advertisements recorded by sponsors, audio data collected by broadcast groups, and text to speech generated audio. In an embodiment, voice browser architecture integrates a variety of components including: various telephony platforms (e.g. PSTN, VOIP), scalable architecture, rapid context switching, and backend web content integration and provides access to information audibly.

247 citations

01 Sep 2006-
Abstract: A distributed voice applications system includes a voice applications rendering agent and at least one voice applications agent that is configured to provide voice applications to an individual user. A management system may control and direct the voice applications rendering agent to create voice applications that are personalized for individual users based on user characteristics, information about the environment in which the voice applications will be performed, prior user interactions and other information. The voice applications agent and components of customized voice applications may be resident on a local user device which includes a voice browser and speech recognition capabilities. The local device, voice applications rendering agent and management system may be interconnected via a communications network.

207 citations

08 Dec 2004-
Abstract: A multimodal browser for rendering a multimodal document on an end system defining a host can include a visual browser component for rendering visual content, if any, of the multimodal document, and a voice browser component for rendering voice-based content, if any, of the multimodal document. The voice browser component can determine which of a plurality of speech processing configuration is used by the host in rendering the voice-based content. The determination can be based upon the resources of the host running the application. The determination also can be based upon a processing instruction contained in the application.

178 citations

Network Information
Related Topics (5)
Speech Application Language Tags

12 papers, 288 citations

81% related
Java Speech Markup Language

2 papers, 8 citations

78% related
Speech Recognition Grammar Specification

9 papers, 58 citations

78% related
Speech Synthesis Markup Language

37 papers, 392 citations

76% related
Audio search engine

12 papers, 895 citations

75% related
No. of papers in the topic in previous years

Top Attributes

Show by:

Topic's top 5 most impactful authors

Victor S. Moore

6 papers, 124 citations

Wendi L. Nusbickel

4 papers, 107 citations

Sandeep Sibal

4 papers, 143 citations

Inderpal Singh Mumick

4 papers, 143 citations

Nitendra Rajput

3 papers, 43 citations