Institution

Nuance Communications

Company•Vienna, Austria•

About: Nuance Communications is a company organization based out in Vienna, Austria. It is known for research contribution in the topics: Speech processing & Voice activity detection. The organization has 1518 authors who have published 1701 publications receiving 54891 citations. The organization is also known as: ScanSoft & ScanSoft Inc..

...read moreread less

Topics: Speech processing, Voice activity detection, Speaker recognition, Signal, Acoustic model ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Patent•

System for and method of creating and browsing a voice web

[...]

Michael H. Cohen¹, Tracy D. Wax¹•Institutions (1)

Nuance Communications¹

01 Dec 1999

TL;DR: In this paper, a user can browse through a network of audio information, forming a seamless integration of the world wide web and the entire telephone network browsable from any telephone set.

...read moreread less

Abstract: The present invention allows a user to audibly and interactively browse through a network of audio information, forming a seamless integration of the world wide web and the entire telephone network browsable from any telephone set. Preferably a browser controllers (102) allows the user (100) to receive audio information and to transmit verbal instructions. The browser controller (102) links the user (100) to voice pages (108, 112, 114, 116, 118, 120), which can be any telephone station (108, 112, 114, 116) or world wide web page (120), in response to voice commands. Upon linking, certain information is played with an audio indicia which identifies a linking capability. If the user (100) repeats the information set off by the audio indicia, the telephone number or URL of the selected link is transmitted to the browser controller (112). The browser controller (112) establishes a new link with the identified telephone number or URL, and if successful, disconnects the previous link. The originator (100) no longer needs to know of the existence of the receiver nor the telephone number or URL of the receiver because this invention provides a method to browse the entire telephone network and world wide web and to connect to a receiver by saying the name of the hyperlink. This brings the power of the world wide web to the telephone network. In effect, this invention takes the PSTN from its current state as a set of more than 800 million nodes including means to make pairwise connections and converts it to a highly interconnected browsable web, as well as integrating it with the entire world wide web.

...read moreread less

254 citations

Patent•

Multiple web-based content category searching in mobile search application

[...]

Michael S. Phillips¹, John N. Nguyen¹•Institutions (1)

Nuance Communications¹

27 Aug 2010

TL;DR: In this article, improved capabilities are described for multiple web-based content category searching for web content on a mobile communication facility comprising capturing speech presented by a user using a resident capture facility on the mobile communication device, transmitting at least a portion of the captured speech as data through a wireless communication device to a speech recognition facility, generating speech-to-text results for the captured text utilizing the speech recognition device, and transmitting the text results and a plurality of formatting rules specifying how search text may be used to form a query for a search capability on mobile communications facility, wherein each formatting

...read moreread less

Abstract: In embodiments of the present invention improved capabilities are described for multiple web-based content category searching for web content on a mobile communication facility comprising capturing speech presented by a user using a resident capture facility on the mobile communication facility; transmitting at least a portion of the captured speech as data through a wireless communication facility to a speech recognition facility; generating speech-to-text results for the captured speech utilizing the speech recognition facility; and transmitting the text results and a plurality of formatting rules specifying how search text may be used to form a query for a search capability on the mobile communications facility, wherein each formatting rule is associated with a category of content to be searched.

...read moreread less

248 citations

Patent•

Training and using pronunciation guessers in speech recognition

[...]

Laurence S. Gillick, Steven Wegmann¹, Jonathan Yamron•Institutions (1)

Nuance Communications¹

10 Oct 2003

TL;DR: In this paper, the error rate of a pronunciation guesser that guesses the phonetic spelling of words used in speech recognition is improved by causing its training to weigh letter-to-phoneme mappings used as data in such training as a function of the frequency of the words in which such mappings occur.

...read moreread less

Abstract: The error rate of a pronunciation guesser that guesses the phonetic spelling of words used in speech recognition is improved by causing its training to weigh letter-to-phoneme mappings used as data in such training as a function of the frequency of the words in which such mappings occur. Preferably the ratio of the weight to word frequency increases as word frequencies decreases. Acoustic phoneme models for use in speech recognition with phonetic spellings generated by a pronunciation guesser that makes errors are trained against word models whose phonetic spellings have been generated by a pronunciation guesser that makes similar errors. As a result, the acoustic models represent blends of phoneme sounds that reflect the spelling errors made by the pronunciation guessers. Speech recognition enabled systems are made by storing in them both a pronunciation guesser and a corresponding set of such blended acoustic models.

...read moreread less

241 citations

Patent•

Internet based speech recognition system with dynamic grammars

[...]

Ian M. Bennett¹•Institutions (1)

Nuance Communications¹

09 Apr 2007

TL;DR: In this article, a speech-enabled WWW based computing system allows a user to interact with content associated with a web page and select items of interest using speech as a mode of input.

...read moreread less

Abstract: A speech-enabled WWW based computing system allows a user to interact with content associated with a web page and select items of interest using speech as a mode of input. Dynamic grammars can assist in the recognition operations to improve speed and comprehension.

...read moreread less

239 citations

Journal Article•DOI•

Nanoscale imaging of buried structures via scanning near-field ultrasound holography.

[...]

Gajendra S. Shekhawat¹, Vinayak P. Dravid², Vinayak P. Dravid¹•Institutions (2)

Nuance Communications¹, Northwestern University²

07 Oct 2005-Science

TL;DR: SNFUH has been developed that provides depth information as well as spatial resolution at the 10- to 100-nanometer scale and used to image buried nanostructures, to perform subsurface metrology in microelectronic structures, and to image malaria parasites in red blood cells.

...read moreread less

Abstract: A nondestructive imaging method, scanning near-field ultrasound holography (SNFUH), has been developed that provides depth information as well as spatial resolution at the 10- to 100-nanometer scale. In SNFUH, the phase and amplitude of the scattered specimen ultrasound wave, reflected in perturbation to the surface acoustic standing wave, are mapped with a scanning probe microscopy platform to provide nanoscale-resolution images of the internal substructure of diverse materials. We have used SNFUH to image buried nanostructures, to perform subsurface metrology in microelectronic structures, and to image malaria parasites in red blood cells.

...read moreread less

236 citations

Collapse

Authors

Showing all 1521 results

Name	H-index	Papers	Citations
Vinayak P. Dravid	103	817	43612
Mehryar Mohri	75	320	22868
Jinsong Wu	70	566	16282
Horacio D. Espinosa	67	315	16270
Shumin Zhai	67	200	13447
Shang-Hua Teng	66	265	16647
Dimitri Kanevsky	62	362	14072
Marilyn A. Walker	62	309	13429
Tara N. Sainath	61	274	25183
Kenneth Church	61	295	21179
John B Ketterson	60	814	16929
Pascal Frossard	59	637	22749
Michael Picheny	57	244	11759
G. R. Scott Budinger	56	196	12063
Jun Wu	53	359	12110

Network Information

Related Institutions (5)

Google

39.8K papers, 2.1M citations

82% related

Microsoft

86.9K papers, 4.1M citations

82% related

Carnegie Mellon University

104.3K papers, 5.9M citations

80% related

Nokia

28.3K papers, 695.7K citations

38.6K papers, 1.3M citations

79% related

Performance

Metrics

1,704

Papers

56,595

Citations

No. of papers from the Institution in previous years
Year	Papers
2022	3
2021	24
2020	42
2019	55
2018	41
2017	53