Institution
Nuance Communications
Company•Vienna, Austria•
About: Nuance Communications is a company organization based out in Vienna, Austria. It is known for research contribution in the topics: Speech processing & Voice activity detection. The organization has 1518 authors who have published 1701 publications receiving 54891 citations. The organization is also known as: ScanSoft & ScanSoft Inc..
Papers published on a yearly basis
Papers
More filters
•
29 Mar 2010TL;DR: In this paper, the authors proposed a method for determining a noise reference signal for noise compensation and/or noise reduction, where a first audio signal is filtered using a first adaptive filter to obtain a first filtered audio signal.
Abstract: The invention provides a method for determining a noise reference signal for noise compensation and/or noise reduction. A first audio signal on a first signal path and a second audio signal on a second signal path are received. The first audio signal is filtered using a first adaptive filter to obtain a first filtered audio signal. The second audio signal is filtered using a second adaptive filter to obtain a second filtered audio signal. The first and the second filtered audio signal are combined to obtain the noise reference signal. The first and the second adaptive filter are adapted such as to minimize a wanted signal component in the noise reference signal.
90 citations
•
27 Oct 2005TL;DR: In this article, a method, system, and program provides for hands free contact database information entry at a communication device, which detects user initiation to record and extracts contact information from the text.
Abstract: A method, system, and program provides for hands free contact database information entry at a communication device. A recording system at a communication device detects a user initiation to record. Responsive to detecting the user initiation to record, the recording system records the ongoing conversation supported between the communication device and a second remote communication device. The recording system converts the recording of the conversation into text. Next, the recording system extracts contact information from the text. Then, the recording system stores the extracted contact information in an entry of the contact database, such that contact information is added to the contact database of the communication device without manual entry of the contact information by the user.
90 citations
•
24 Jul 2011TL;DR: In this article, a reduced virtual keyboard system for text input on electronic devices is described, where text input is performed by creating a tracing trajectory, and dynamic prediction solutions are created during the tracing process to avoid the need for a user to complete the entire word trajectory.
Abstract: A reduced virtual keyboard system for text input on electronic devices is disclosed. Text input is performed by creating a tracing trajectory. Dynamic prediction solutions are created during the tracing process, thus avoiding the need for a user to complete the entire word trajectory. The system also allows a mixture of tapping actions and sliding motions for the same word. The system may comprise a Long Words Dictionary database having first letters corresponding to predetermined keys of the keyboard. Alternatively, the system uses a Dictionary and a database management tool to find long words.
89 citations
•
23 Aug 2013TL;DR: In this article, a method of recognizing speech that comprises natural language and at least one word specified in at least 1 domain-specific vocabulary is provided, and the method comprises performing a first speech processing pass comprising identifying, in the speech, a first portion including the natural language, and a second part including the at least word specified by the domain specific vocabulary, and recognizing the first part including natural language.
Abstract: In some aspects, a method of recognizing speech that comprises natural language and at least one word specified in at least one domain-specific vocabulary is provided. The method comprises performing a first speech processing pass comprising identifying, in the speech, a first portion including the natural language and a second portion including the at least one word specified in the at least one domain-specific vocabulary, and recognizing the first portion including the natural language. The method further comprises performing a second speech processing pass comprising recognizing the second portion including the at least one word specified in the at least one domain-specific vocabulary.
89 citations
•
05 Nov 2012TL;DR: In this paper, the authors present techniques and methods for privacy-sensitive training data collection for updating acoustic models of speech recognition systems, where the system locally creates adaptation data from raw audio data, including derived statistics and/or acoustic model update parameters.
Abstract: Techniques disclosed herein include systems and methods for privacy-sensitive training data collection for updating acoustic models of speech recognition systems. In one embodiment, the system locally creates adaptation data from raw audio data. Such adaptation can include derived statistics and/or acoustic model update parameters. The derived statistics and/or updated acoustic model data can then be sent to a speech recognition server or third-party entity. Since the audio data and transcriptions are already processed, the statistics or acoustic model data is devoid of any information that could be human-readable or machine readable such as to enable reconstruction of audio data. Thus, such converted data sent to a server does not include personal or confidential information. Third-party servers can then continually update speech models without storing personal and confidential utterances of users.
89 citations
Authors
Showing all 1521 results
Name | H-index | Papers | Citations |
---|---|---|---|
Vinayak P. Dravid | 103 | 817 | 43612 |
Mehryar Mohri | 75 | 320 | 22868 |
Jinsong Wu | 70 | 566 | 16282 |
Horacio D. Espinosa | 67 | 315 | 16270 |
Shumin Zhai | 67 | 200 | 13447 |
Shang-Hua Teng | 66 | 265 | 16647 |
Dimitri Kanevsky | 62 | 362 | 14072 |
Marilyn A. Walker | 62 | 309 | 13429 |
Tara N. Sainath | 61 | 274 | 25183 |
Kenneth Church | 61 | 295 | 21179 |
John B Ketterson | 60 | 814 | 16929 |
Pascal Frossard | 59 | 637 | 22749 |
Michael Picheny | 57 | 244 | 11759 |
G. R. Scott Budinger | 56 | 196 | 12063 |
Jun Wu | 53 | 359 | 12110 |