scispace - formally typeset
Search or ask a question
Institution

Nuance Communications

CompanyVienna, Austria
About: Nuance Communications is a company organization based out in Vienna, Austria. It is known for research contribution in the topics: Speech processing & Voice activity detection. The organization has 1518 authors who have published 1701 publications receiving 54891 citations. The organization is also known as: ScanSoft & ScanSoft Inc..


Papers
More filters
Patent
06 Jan 2012
TL;DR: In this paper, the results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture.
Abstract: Techniques for combining the results of multiple recognizers in a distributed speech recognition architecture. Speech data input to a client device is encoded and processed both locally and remotely by different recognizers configured to be proficient at different speech recognition tasks. The client/server architecture is configurable to enable network providers to specify a policy directed to a trade-off between reducing recognition latency perceived by a user and usage of network resources. The results of the local and remote speech recognition engines are combined based, at least in part, on logic stored by one or more components of the client/server architecture.

10 citations

Patent
20 Sep 2017
TL;DR: In this article, a system for automatically processing text comprising information regarding a patient encounter to prioritize medical billing codes derived from the text is presented, which includes at least one storage medium storing processor-executable instructions.
Abstract: Some aspect include a system for automatically processing text comprising information regarding a patient encounter to prioritize medical billing codes derived from the text. The system comprises at least one storage medium storing processor-executable instructions, and at least one processor configured to execute the processor-executable instructions to analyze the text to extract a plurality of facts from the text, assign a plurality of medical billing codes to the text based at least in part on the plurality of facts, using a model trained at least in part on feedback from a user, order the plurality of medical billing codes in a sequence beginning with a primary medical billing code corresponding to a primary diagnosis associated with the text, and present the ordered sequence of medical billing codes to the user for review.

10 citations

Proceedings ArticleDOI
25 Mar 2012
TL;DR: This paper introduces a segmentation process consisting of two phases, first, forced alignment is performed using an HMM-GMM model and the resulting segmentation is then locally refined using an SVM based boundary model.
Abstract: Phonetic segmentation is an important step in the development of a concatenative TTS voice. This paper introduces a segmentation process consisting of two phases. First, forced alignment is performed using an HMM-GMM model. The resulting segmentation is then locally refined using an SVM based boundary model. Both the models are derived from multi-speaker data using a speaker adaptive training procedure. Evaluation results are obtained on the TIMIT corpus and on a proprietary single-speaker TTS corpus.

10 citations

Patent
22 May 2014
TL;DR: In this article, a method to extract a target speaker's speech using a known speaker voiceprint from an audio recording that includes the target speakers' speech and the known speakers' speeches is presented.
Abstract: In many scenarios, speaker verification systems can be given a single-channel audio with recordings of multiple speakers. To perform accurate speaker verification, a system can isolate the speech of a speaker. In one embodiment, a method, and corresponding system, of speaker verification includes extracting a target speaker's speech, using a known speaker voiceprint, from an audio recording that includes the target speaker's speech and the known speaker's speech. The known speaker voiceprint can correspond to the known speaker. Extracting the target speaker's speech can include determining portions of the audio recording where the known speaker voiceprint matches the known speaker's speech above a particular threshold, and extracting the target speaker's speech from other portions of the audio recording. In this manner, speaker verification is performed on the target speaker's speech without interference from the known speaker's speech and allows for a more accurate verification.

9 citations

Patent
30 Sep 2011
TL;DR: In this paper, the authors present a technique for receiving a query from a user of a mobile device, and for conveying to the user not only search results, but also feedback relating to query.
Abstract: Some embodiments of the invention provide techniques for receiving a query from a user of a mobile device, and for conveying to the user not only search results, but also feedback relating to query. For example, the user may be prompted to elicit supplemental information relating to the query, or provided other feedback. The feedback may be conveyed in a manner which minimizes how much of the mobile device's display screen is dedicated to presenting the feedback.

9 citations


Authors

Showing all 1521 results

NameH-indexPapersCitations
Vinayak P. Dravid10381743612
Mehryar Mohri7532022868
Jinsong Wu7056616282
Horacio D. Espinosa6731516270
Shumin Zhai6720013447
Shang-Hua Teng6626516647
Dimitri Kanevsky6236214072
Marilyn A. Walker6230913429
Tara N. Sainath6127425183
Kenneth Church6129521179
John B Ketterson6081416929
Pascal Frossard5963722749
Michael Picheny5724411759
G. R. Scott Budinger5619612063
Jun Wu5335912110
Network Information
Related Institutions (5)
Google
39.8K papers, 2.1M citations

82% related

Microsoft
86.9K papers, 4.1M citations

82% related

Carnegie Mellon University
104.3K papers, 5.9M citations

80% related

Nokia
28.3K papers, 695.7K citations

79% related

Performance
Metrics
No. of papers from the Institution in previous years
YearPapers
20223
202124
202042
201955
201841
201753