scispace - formally typeset
Search or ask a question
Topic

Speech Application Language Tags

About: Speech Application Language Tags is a research topic. Over the lifetime, 12 publications have been published within this topic receiving 288 citations. The topic is also known as: SALT.

Papers
More filters
PatentDOI
Kuansan Wang1
TL;DR: In this paper, a language model consisting of an N-gram language model and a context-free grammar language model is used to store information related to words and semantic information to be recognized.
Abstract: A speech understanding system includes a language model comprising a combination of an N-gram language model and a context-free grammar language model. The language model stores information related to words and semantic information to be recognized. A module is adapted to receive input from a user and capture the input for processing. The module is further adapted to receive SALT application program interfaces pertaining to recognition of the input. The module is configured to process the SALT application program interfaces and the input to ascertain semantic information pertaining to a first portion of the input and output a semantic object comprising text and semantic information for the first portion by accessing the language model, wherein performing recognition and outputting the semantic object are performed while capturing continues for subsequent portions of the input.

245 citations

Patent
Kuansan Wang1
03 Jul 2003
TL;DR: In this article, a system and method for combining VoiceXML with an speech application development tool such as SALT is presented, where a SALT module includes speech application language tags to execute instructions associated with the VXML module.
Abstract: The present invention provides a system and method for combining VoiceXML with an speech application development tool such as SALT. In one aspect of the present invention, a VoiceXML module includes VoiceXML executable instructions. A SALT module includes speech application language tags to execute instructions associated with the VoiceXML module.

19 citations

Book ChapterDOI
J. Larson1
01 Jan 2006
TL;DR: Speech Application Language Tags (SALT) is a small number of XML elements that may be embedded into host programming languages to speech-enable applications to develop telephony and multimodal applications.
Abstract: Enabling users to speak and listen to a computer will greatly enhance users' ability to access computers at any time from nearly any place. Speech Application Language Tags (SALT) is a small number of XML elements that may be embedded into host programming languages to speech-enable applications. SALT may be used to develop telephony (speech input and output only) applications and multimodal applications (speech input and output, as well as keyboard and mouse input and display output). SALT and the host programming language provide control structures not available in VoiceXML, the current standard language for developing speech applications.

10 citations

Proceedings ArticleDOI
10 Mar 2006
TL;DR: VTQuest is engineered based on Java 2 Platform, Enterprise Edition (J2EE) using Scalable Vector Graphics (SVG) and Speech Application Language Tags (SALT) and SVG enables zooming into the maps without losing image quality.
Abstract: Finding our way out at a large university campus is a problem. We developed VTQuest, http://sunfish.cs.vt.edu/VTQuestV, as a web-based software system to solve this problem for the campus of Virginia Tech (http://www.vt.edu/). VTQuest enables (a) multimodal interaction with voice, mouse, and keyboard, (b) browsing the campus map, (c) locating a building by name, abbreviation, category, or within a distance on the campus map, (d) locating a room on the floor plan of a building, and (e) obtaining walking directions from one building to another. VTQuest provides these capabilities for 103 buildings with floor plans for most of the buildings. VTQuest is engineered based on Java 2 Platform, Enterprise Edition (J2EE) using Scalable Vector Graphics (SVG) and Speech Application Language Tags (SALT). SVG enables zooming into the maps without losing image quality. The voice interface offers a variety of features including an extensive grammar and out-of-turn interaction.

6 citations

Journal ArticleDOI
TL;DR: The technologies currently available for speech interaction with computers are reviewed and how the future of web navigation may benefit from these technologies are suggested.
Abstract: INTRODUCTION Since the computer as we know it was first developed, man has dreamt of systems that can be operated by voice command. Science fiction has portrayed a future where users communicate with computers by voice, and generations have grown up expecting a new dawn whereby we speak and the computer speaks back. The days of communicating naturally with computers by speech, however, are not so far off. Recent technological developments have led to a new way of thinking about how we should interact with computers in general, with the development of the XML specification and the use of XML to develop applications for many types of data. In particular, Voice XML and Speech Application Language Tags (SALT) (http://www.saltforum.org/) are technologies that could revolutionise navigation of computer systems. Speech technology seems an ideal complement to web appliances. Advances in speech processing have already yielded for users a far more natural sounding voice reading web information. The incorporation of speech recognition can be an important source of user acceptance of any system. Teaching through the use of web pages has provided an alternative way of learning for many people including those who have a disability. A severely physically disabled student may find it advantageous to study at home, but if the principal teaching medium is the web and the student is unable to independently navigate the pages of the site, then a barrier to learning exists. The provision of services such as audio transcription may depend on the number of students who require the service, and providing 100 students per year with an audio recording of the same course does have advantage in scale compared to a situation where there are only a few students in a course. This paper reviews the technologies currently available for speech interaction with computers and suggests how the future of web navigation may benefit from these technologies. The paper also discusses the results obtained from evaluation of a voice navigatable website aimed at disabled users. DISABILITY Speech recognition is being used to give some severely disabled people a means of remote control of their environments--a way to choose independently among several activities such as watching television and listening to the radio. For people with physical disabilities affecting the use of their hands, productivity is far more personal. Speech recognition can enable some people to perform daily living tasks without assistance. For others, such as the growing number of professionals afflicted with Repetitive Stress Syndrome (RSI), speech recognition represents a means of getting or keeping employment. Reasons that people with a disability use voice recognition include limited use of hands, difficulty with spelling, inability to touch type, and need of a fast way of working. Therefore a user needing to work "hands-free" will need a reliable method of controlling menus: a means of moving and clicking the mouse using voice commands and a correction mechanism that works completely by voice. Users with spelling difficulties need predictive correction, a text to speech feature to help with proofreading, and a facility to "play back" what was dictated (i.e. their voice). People who wish to speed up typing are likely to be most interested in speed of recognition and easy correction using the mouse. Many visually impaired users are attracted to the idea of operating their computer by voice, necessitated either by keyboarding difficulties or lack of touch-typing proficiency. Voice in/Voice out is the use of a computer with voice recognition as a means of information input, combined with synthetic speech that enables the user to know what is on the computer screen. Many problems are inherent in using voice recognition with speech output. The main problems are: * Hearing the words or phrases echoed back is often not enough for the user to be sure there are no errors in the recognition or formatting of the text. …

5 citations

Network Information
Related Topics (5)
Web navigation
14.9K papers, 389.6K citations
80% related
User modeling
10.7K papers, 278K citations
79% related
Web modeling
21.8K papers, 467.5K citations
79% related
Web standards
15.7K papers, 393.4K citations
78% related
XML
26.6K papers, 393.3K citations
78% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
20151
20101
20071
20062
20052
20043