scispace - formally typeset
Search or ask a question
Institution

Nuance Communications

CompanyVienna, Austria
About: Nuance Communications is a company organization based out in Vienna, Austria. It is known for research contribution in the topics: Speech processing & Voice activity detection. The organization has 1518 authors who have published 1701 publications receiving 54891 citations. The organization is also known as: ScanSoft & ScanSoft Inc..


Papers
More filters
Patent
01 Dec 1999
TL;DR: In this paper, a user can browse through a network of audio information, forming a seamless integration of the world wide web and the entire telephone network browsable from any telephone set.
Abstract: The present invention allows a user to audibly and interactively browse through a network of audio information, forming a seamless integration of the world wide web and the entire telephone network browsable from any telephone set. Preferably a browser controllers (102) allows the user (100) to receive audio information and to transmit verbal instructions. The browser controller (102) links the user (100) to voice pages (108, 112, 114, 116, 118, 120), which can be any telephone station (108, 112, 114, 116) or world wide web page (120), in response to voice commands. Upon linking, certain information is played with an audio indicia which identifies a linking capability. If the user (100) repeats the information set off by the audio indicia, the telephone number or URL of the selected link is transmitted to the browser controller (112). The browser controller (112) establishes a new link with the identified telephone number or URL, and if successful, disconnects the previous link. The originator (100) no longer needs to know of the existence of the receiver nor the telephone number or URL of the receiver because this invention provides a method to browse the entire telephone network and world wide web and to connect to a receiver by saying the name of the hyperlink. This brings the power of the world wide web to the telephone network. In effect, this invention takes the PSTN from its current state as a set of more than 800 million nodes including means to make pairwise connections and converts it to a highly interconnected browsable web, as well as integrating it with the entire world wide web.

254 citations

Patent
27 Aug 2010
TL;DR: In this article, improved capabilities are described for multiple web-based content category searching for web content on a mobile communication facility comprising capturing speech presented by a user using a resident capture facility on the mobile communication device, transmitting at least a portion of the captured speech as data through a wireless communication device to a speech recognition facility, generating speech-to-text results for the captured text utilizing the speech recognition device, and transmitting the text results and a plurality of formatting rules specifying how search text may be used to form a query for a search capability on mobile communications facility, wherein each formatting
Abstract: In embodiments of the present invention improved capabilities are described for multiple web-based content category searching for web content on a mobile communication facility comprising capturing speech presented by a user using a resident capture facility on the mobile communication facility; transmitting at least a portion of the captured speech as data through a wireless communication facility to a speech recognition facility; generating speech-to-text results for the captured speech utilizing the speech recognition facility; and transmitting the text results and a plurality of formatting rules specifying how search text may be used to form a query for a search capability on the mobile communications facility, wherein each formatting rule is associated with a category of content to be searched.

248 citations

Patent
10 Oct 2003
TL;DR: In this paper, the error rate of a pronunciation guesser that guesses the phonetic spelling of words used in speech recognition is improved by causing its training to weigh letter-to-phoneme mappings used as data in such training as a function of the frequency of the words in which such mappings occur.
Abstract: The error rate of a pronunciation guesser that guesses the phonetic spelling of words used in speech recognition is improved by causing its training to weigh letter-to-phoneme mappings used as data in such training as a function of the frequency of the words in which such mappings occur. Preferably the ratio of the weight to word frequency increases as word frequencies decreases. Acoustic phoneme models for use in speech recognition with phonetic spellings generated by a pronunciation guesser that makes errors are trained against word models whose phonetic spellings have been generated by a pronunciation guesser that makes similar errors. As a result, the acoustic models represent blends of phoneme sounds that reflect the spelling errors made by the pronunciation guessers. Speech recognition enabled systems are made by storing in them both a pronunciation guesser and a corresponding set of such blended acoustic models.

241 citations

Patent
09 Apr 2007
TL;DR: In this article, a speech-enabled WWW based computing system allows a user to interact with content associated with a web page and select items of interest using speech as a mode of input.
Abstract: A speech-enabled WWW based computing system allows a user to interact with content associated with a web page and select items of interest using speech as a mode of input. Dynamic grammars can assist in the recognition operations to improve speed and comprehension.

239 citations

Journal ArticleDOI
07 Oct 2005-Science
TL;DR: SNFUH has been developed that provides depth information as well as spatial resolution at the 10- to 100-nanometer scale and used to image buried nanostructures, to perform subsurface metrology in microelectronic structures, and to image malaria parasites in red blood cells.
Abstract: A nondestructive imaging method, scanning near-field ultrasound holography (SNFUH), has been developed that provides depth information as well as spatial resolution at the 10- to 100-nanometer scale. In SNFUH, the phase and amplitude of the scattered specimen ultrasound wave, reflected in perturbation to the surface acoustic standing wave, are mapped with a scanning probe microscopy platform to provide nanoscale-resolution images of the internal substructure of diverse materials. We have used SNFUH to image buried nanostructures, to perform subsurface metrology in microelectronic structures, and to image malaria parasites in red blood cells.

236 citations


Authors

Showing all 1521 results

NameH-indexPapersCitations
Vinayak P. Dravid10381743612
Mehryar Mohri7532022868
Jinsong Wu7056616282
Horacio D. Espinosa6731516270
Shumin Zhai6720013447
Shang-Hua Teng6626516647
Dimitri Kanevsky6236214072
Marilyn A. Walker6230913429
Tara N. Sainath6127425183
Kenneth Church6129521179
John B Ketterson6081416929
Pascal Frossard5963722749
Michael Picheny5724411759
G. R. Scott Budinger5619612063
Jun Wu5335912110
Network Information
Related Institutions (5)
Google
39.8K papers, 2.1M citations

82% related

Microsoft
86.9K papers, 4.1M citations

82% related

Carnegie Mellon University
104.3K papers, 5.9M citations

80% related

Nokia
28.3K papers, 695.7K citations

79% related

Performance
Metrics
No. of papers from the Institution in previous years
YearPapers
20223
202124
202042
201955
201841
201753