Home
/
Institutions
/
Nuance Communications

Institution

Nuance Communications

Company•Vienna, Austria•

About: Nuance Communications is a company organization based out in Vienna, Austria. It is known for research contribution in the topics: Speech processing & Voice activity detection. The organization has 1518 authors who have published 1701 publications receiving 54891 citations. The organization is also known as: ScanSoft & ScanSoft Inc..

...read moreread less

Topics: Speech processing, Voice activity detection, Speaker recognition, Signal, Acoustic model ...read more

Papers published on a yearly basis

2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1993
1992
1991

Papers

PDF

Open Access

More filters

Patent•

Text To Speech Synthesis for Texts with Foreign Language Inclusions

[...]

Johan Wouters¹, Christof Traber¹, David Hagstrand¹, Alexis Wilpert¹, Jürgen Keller¹, Igor Nozhov¹ - Show less +2 more•Institutions (1)

Nuance Communications¹

19 Nov 2012

TL;DR: In this paper, a speech output is generated from a text input written in a first language and containing inclusions in a second language, where words in the native language are pronounced with a native pronunciation and words in a foreign language are spoken with a proficient foreign pronunciation.

...read moreread less

Abstract: A speech output is generated from a text input written in a first language and containing inclusions in a second language. Words in the native language are pronounced with a native pronunciation and words in the foreign language are pronounced with a proficient foreign pronunciation. Language dependent phoneme symbols generated for words of the second language are replaced with language dependent phoneme symbols of the first language, where said replacing includes the steps of assigning to each language dependent phoneme symbol of the second language a language independent target phoneme symbol, mapping to each one language independent target phoneme symbol a language independent substitute phoneme symbol assignable to a language dependent substitute phoneme symbol of the first language, substituting the language dependent phoneme symbols of the second language by the language dependent substitute phoneme symbols of the first language.

...read moreread less

27 citations

Patent•

Methods and apparatus for displaying content

[...]

Vladimir Sejnoha¹, Victor Shine Chen¹, Steven Hatch¹, Gary B. Clayton¹•Institutions (1)

Nuance Communications¹

08 Sep 2010

TL;DR: In this article, a carousel having a plurality of slots may be displayed in a first portion of a display of display devices, and content that is dynamically generated based on user input may be shown in a second portion of the display, separate from the first portion.

...read moreread less

Abstract: Some embodiments relate to using a carousel to display content. In some embodiments, a carousel having a plurality of slots may be displayed in a first portion of a display of a display device, and in response to user selection of one of the plurality of slots, content that is dynamically generated based on user input may be displayed in a second portion of the display, separate from the first portion.

...read moreread less

27 citations

Patent•

Multi-modal mobile customer care system

[...]

David Andrew Mauro¹, Vijay R. Raman¹•Institutions (1)

Nuance Communications¹

09 Mar 2012

TL;DR: In this article, the authors present an interface that integrates the multiple channels of the customer service provider and recommends a channel based on an identification of a customer service need of the customers.

...read moreread less

Abstract: Customer service and/or care providers generally have multiple communications channels (i.e., modes of communications, such as an Internet webpage, live agent telephones, Interactive Voice Response (IVR) system) of communication with which a customer may interact with the customer service provider. Currently, customers must select the communications channel by guessing which communications channel would best accommodate the customer's purpose/need for communicating with the customer service provider. In some scenarios, the customer may select the wrong communications channel because the selected channel is not able to service the customer's need. In another scenario, the customer may select a channel that is more cumbersome to service the customer's particular need than another channel of the customer service provider. Embodiments of the present invention provide an interface that integrates the multiple channels of the customer service provider and recommends a channel based on an identification of a customer service need of the customer.

...read moreread less

27 citations

Proceedings Article•DOI•

Industrial OCR approaches: architecture, algorithms, and adaptation techniques

[...]

István Marosi¹•Institutions (1)

Nuance Communications¹

28 Jan 2007

TL;DR: The architecture of a modern OCR system is described with an emphasis on the adaptation process, where systems try to adapt themselves to the actual features of the image or document to be recognized.

...read moreread less

Abstract: Optical Character Recognition is much more than character classification. An industrial OCR application combines algorithms studied in detail by different researchers in the area of image processing, pattern recognition, machine learning, language analysis, document understanding, data mining, and other, artificial intelligence domains. There is no single perfect algorithm for any of the OCR problems, so modern systems try to adapt themselves to the actual features of the image or document to be recognized. This paper describes the architecture of a modern OCR system with an emphasis on this adaptation process.

...read moreread less

26 citations

Patent•

Signal noise reduction using magnitude-domain spectral subtraction

[...]

Mitchel Weintraub¹, Francoise Beaufays¹•Institutions (1)

Nuance Communications¹

29 Feb 2000

TL;DR: In this article, a method and apparatus for generating a noise-reduced feature vector representing human speech is presented, where speech data representing an input speech waveform are first input and filtered, and a noise reduction process is then performed.

...read moreread less

Abstract: A method and apparatus for generating a noise-reduced feature vector representing human speech are provided. Speech data representing an input speech waveform are first input and filtered. Spectral energies of the filtered speech data are determined, and a noise reduction process is then performed. In the noise reduction process, a spectral magnitude is computed for a frequency index of multiple frequency indexes. A noise magnitude estimate is then determined for the frequency index by updating a histogram of spectral magnitude, and then determining the noise magnitude estimate as a predetermined percentile of the histogram. A signal-to-noise ratio is then determined for the frequency index. A scale factor is computed for the frequency index, as a function of the signal-to-noise ratio and the noise magnitude estimate. The noise magnitude estimate is then scaled by the scale factor. The scaled noise magnitude estimate is subtracted from the spectral magnitudes of the filtered speech data, to produce cleaned speech data, based on which a feature vector is generated.

...read moreread less

26 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
…
102
103
104
105
106
107
108
…
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Authors

Showing all 1521 results

Name	H-index	Papers	Citations
Vinayak P. Dravid	103	817	43612
Mehryar Mohri	75	320	22868
Jinsong Wu	70	566	16282
Horacio D. Espinosa	67	315	16270
Shumin Zhai	67	200	13447
Shang-Hua Teng	66	265	16647
Dimitri Kanevsky	62	362	14072
Marilyn A. Walker	62	309	13429
Tara N. Sainath	61	274	25183
Kenneth Church	61	295	21179
John B Ketterson	60	814	16929
Pascal Frossard	59	637	22749
Michael Picheny	57	244	11759
G. R. Scott Budinger	56	196	12063
Jun Wu	53	359	12110

Network Information

Related Institutions (5)

Google

39.8K papers, 2.1M citations

82% related

Microsoft

86.9K papers, 4.1M citations

82% related

Carnegie Mellon University

104.3K papers, 5.9M citations

80% related

Nokia

28.3K papers, 695.7K citations

38.6K papers, 1.3M citations

79% related

Performance

Metrics

1,704

Papers

56,595

Citations

No. of papers from the Institution in previous years
Year	Papers
2022	3
2021	24
2020	42
2019	55
2018	41
2017	53