scispace - formally typeset
Search or ask a question
Topic

Voice browser

About: Voice browser is a research topic. Over the lifetime, 140 publications have been published within this topic receiving 5476 citations.


Papers
More filters
Proceedings ArticleDOI
08 May 2007
TL;DR: The results show that the use of context can potentially save browsing time and substantially improve browsing experience of visually disabled people.
Abstract: Web sites are designed for graphical mode of interaction. Sighted users can "cut to the chase" and quickly identify relevant information in Web pages. On the contrary, individuals with visual disabilities have to use screen-readers tobrowse the Web. As screen-readers process pages sequentially and read through everything, Web browsing can become strenuous and time-consuming. Although, the use ofshortcuts and searching offers some improvements, the problem still remains. In this paper, we address the problemof information overload in non-visual Web access using thenotion of context. Our prototype system, CSurf, embodyingour approach, provides the usual features of a screen-reader.However, when a user follows a link, CSurf captures thecontext of the link using a simple topic-boundary detectiontechnique, and uses it to identify relevant information onthe next page with the help of a Support Vector Machine, astatistical machine-learning model. Then, CSurf reads the Web page starting from the most relevant section, identifiedby the model. We conducted a series experiments to evaluate the performance of CSurf against the state-of-the-artscreen-reader, JAWS. Our results show that the use of context can potentially save browsing time and substantiallyimprove browsing experience of visually disabled people.

100 citations

PatentDOI
TL;DR: In this article, a voice browsing system maintains a database containing a list of information sources such as web sites, connected to a network, each of the information sources is assigned a rank number which is listed in the database along with the record for the information source.
Abstract: The present invention relates to a system for acquiring information from sources on a network, such as the Internet. A voice browsing system maintains a database containing a list of information sources, such as web sites, connected to a network. Each of the information sources is assigned a rank number which is listed in the database along with the record for the information source. In response to a speech command received from a user, a network interface system accesses the information source with the highest rank number in order to retrieve information requested by the user.

99 citations

Patent
30 Mar 2000
TL;DR: In this paper, an interactive voice response system includes a server and a set of mobile clients, each mobile client includes a microphone, a speaker or headset, a processor and a voice browser.
Abstract: An interactive voice response system includes a server and a set of mobile clients. The server and clients include RF transceivers for exchanging messages over an RF channel. Each mobile client includes a microphone, a speaker or headset, a processor and a voice browser. The voice browser interprets voice pages received from the server. Upon receiving a particular voice page from the server, the voice browser outputs via the speaker voice prompts specified by the voice page. A speech recognition engine used by the voice browser converts voice responses from a user into a text response. The voice browser then performs an action based on the text response. The action taken may be to request a new voice page from the server, or to continue to interpret the current voice page.

94 citations

Patent
09 Jan 2003
TL;DR: In this paper, a voice browser dialog enabler for multimodal dialog uses a multimodAL markup document with fields have markup-based forms associated with each field and defining fragments.
Abstract: A voice browser dialog enabler for multimodal dialog uses a multimodal markup document with fields have markup-based forms associated with each field and defining fragments. A voice browser driver resides on a communication device and provides the fragments and identifiers that identify the fragments. A voice browser implementation resides on a remote voice server and receives the fragments from the driver and downloads a plurality of speech grammars. Input speech is matched against those speech grammars associated with the corresponding identifiers received in a recognition request from the voice browser driver.

73 citations

Patent
22 Feb 2000
Abstract: The present invention relates to a voice browser (110) and a method at a voice browser, the voice browser (110) being arranged at a server (120) connected to the Internet (130) and responsive to Dual Tone MultiFrequency (DTMF) tones received from a telecommunications network (150). The voice browser is responsive to different sets of predetermined DTMF tones, one set dedicated for voice browser functions and another set dedicated for HTML application functions. The voice browser (110) synchronises the possible DTMF tones that can be accepted for a certain browsed part of an HTML page.

71 citations


Network Information
Related Topics (5)
XML
26.6K papers, 393.3K citations
69% related
Hidden Markov model
28.3K papers, 725.3K citations
67% related
Metadata
43.9K papers, 642.7K citations
67% related
Web page
50.3K papers, 975.1K citations
67% related
Speaker recognition
14.9K papers, 310K citations
66% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
20183
20174
20162
20153
20141
20133